2602.01532v1 Feb 02, 2026 cs.AI

PRISM: 페스티나 렌테(Festina Lente) 주도성 -- 주도적 에이전트를 위한 위험 민감형 및 불확실성 인식형 숙고

PRISM: Festina Lente Proactivity -- Risk-Sensitive, Uncertainty-Aware Deliberation for Proactive Agents

X. Tan

Citations: 456

h-index: 13

Xihe Qiu

Citations: 125

h-index: 7

Teqi Hao

Citations: 24

h-index: 2

Chen Zhan

Citations: 4

h-index: 2

Yuxuan Fu

Citations: 3

h-index: 1

주도적 에이전트는 무엇을 말할지뿐만 아니라 개입 여부와 시기도 결정해야 합니다. 현재의 많은 시스템은 취약한 휴리스틱이나 무분별한 장시간 추론에 의존하는데, 이는 이득과 부담 간의 균형을 제어하기 어렵게 만듭니다. 우리는 이 문제를 비용 민감형 선택적 개입으로 정식화하고, 결정 이론적 게이트와 이중 프로세스 추론 아키텍처를 결합한 새로운 프레임워크인 PRISM을 제안합니다. 추론 시, 에이전트는 사용자의 수용 확률(보정됨)이 도움 누락과 오경보의 비대칭적 비용으로부터 도출된 임계값을 초과할 때만 개입합니다. '페스티나 렌테(Festina lente, 라틴어로 '천천히 서둘러라')'에서 영감을 받아, 우리는 수용 확률이 보정되고 비용에 기반한 임계값으로 게이팅을 수행하며, 결정 경계 근처에서만 반사실적 확인을 포함한 자원 집약적인 '느린(Slow) 모드'를 호출하여 모호하고 위험도가 높은 사례에 계산을 집중시킵니다. 학습에는 게이트 정렬 및 스키마 고정 증류(gate-aligned, schema-locked distillation)를 사용합니다. 전체 PRISM 파이프라인을 실행하는 교사 모델이 레이블이 없는 상호작용 기록에 대해 밀도 높고 실행 가능한 감독을 제공하며, 학생 모델은 개입 게이트와 명시적으로 분리된 응답 정책을 학습하여 조정 가능하고 감사 가능한 제어를 가능하게 합니다. ProactiveBench에서 PRISM은 강력한 기준 모델 대비 오경보를 22.78% 줄이고 F1 점수를 20.14% 향상시켰습니다. 이러한 결과는 원칙적인 결정 이론적 게이팅이 선택적 느린 추론 및 정렬된 증류와 결합될 때, 정확하고 계산 효율적이며 제어 가능한 주도적 에이전트를 생성함을 보여줍니다. 재현성을 돕기 위해 코드, 모델 및 리소스를 https://prism-festinalente.github.io/ 에 공개하며, 모든 실험은 오픈 소스인 ProactiveBench 벤치마크를 사용합니다.

Original Abstract

Proactive agents must decide not only what to say but also whether and when to intervene. Many current systems rely on brittle heuristics or indiscriminate long reasoning, which offers little control over the benefit-burden tradeoff. We formulate the problem as cost-sensitive selective intervention and present PRISM, a novel framework that couples a decision-theoretic gate with a dual-process reasoning architecture. At inference time, the agent intervenes only when a calibrated probability of user acceptance exceeds a threshold derived from asymmetric costs of missed help and false alarms. Inspired by festina lente (Latin: "make haste slowly"), we gate by an acceptance-calibrated, cost-derived threshold and invoke a resource-intensive Slow mode with counterfactual checks only near the decision boundary, concentrating computation on ambiguous and high-stakes cases. Training uses gate-aligned, schema-locked distillation: a teacher running the full PRISM pipeline provides dense, executable supervision on unlabeled interaction traces, while the student learns a response policy that is explicitly decoupled from the intervention gate to enable tunable and auditable control. On ProactiveBench, PRISM reduces false alarms by 22.78% and improves F1 by 20.14% over strong baselines. These results show that principled decision-theoretic gating, paired with selective slow reasoning and aligned distillation, yields proactive agents that are precise, computationally efficient, and controllable. To facilitate reproducibility, we release our code, models, and resources at https://prism-festinalente.github.io/; all experiments use the open-source ProactiveBench benchmark.

2 Citations

0 Influential

6.5 Altmetric

34.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!