2606.09071v1 Jun 08, 2026 cs.AI

REFLECT: Intervention-Supported Error Attribution for Silent Failures in LLM Agent Traces

Tung Sum Thomas Kwok
Tung Sum Thomas Kwok
Citations: 30
h-index: 2
Xiaofeng Lin
Xiaofeng Lin
Citations: 76
h-index: 4
Guang Cheng
Guang Cheng
Citations: 16
h-index: 2
Charles Fleming
Charles Fleming
Citations: 23
h-index: 3
Daniel Guo
Daniel Guo
Citations: 1,117
h-index: 3
Sahil Arun Nale
Sahil Arun Nale
Citations: 0
h-index: 0
Yingxun Wang
Yingxun Wang
Citations: 23
h-index: 3

Large language model (LLM) agents now solve complex tasks through long plan-and-execution traces, yet the ability to locate errors in a completed traces still lags far behind, especially in the \emph{silent failure} regime. Existing approaches predict suspect steps via classifiers or LLM judges, or recover correct answers via retry, but none feed the intervention outcome back to \emph{refine the attribution itself}. We propose \methodname, a method that closes this gap by diagnosing a candidate error step, testing it through controlled replay with a diagnosis-specific patch, and using the verified outcome flip as contrastive evidence to refine the final attribution. Across four localization benchmarks spanning multi-hop reasoning across domains, \methodname achieves the highest localization accuracy among same-auditor methods across all four benchmarks, with the largest gains on structured tool-use traces, while providing actionable localization even when ground-truth answers are unavailable.

0 Citations
0 Influential
2 Altmetric
10.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!