2605.28732v1 May 27, 2026 cs.CL

MemTrace: Tracing and Attributing Errors in Large Language Model Memory Systems

Guanglin Li
Guanglin Li
Citations: 13
h-index: 2
Yunzhi Yao
Yunzhi Yao
Zhejiang University;Shandong University
Citations: 3,270
h-index: 22
Xinle Deng
Xinle Deng
Citations: 141
h-index: 6
Yuanqiang Yu
Yuanqiang Yu
Citations: 52
h-index: 4
Baohua Dong
Baohua Dong
Citations: 66
h-index: 4
Hangcheng Zhu
Hangcheng Zhu
Citations: 44
h-index: 3
Buqiang Xu
Buqiang Xu
Citations: 15
h-index: 2
Jizhan Fang
Jizhan Fang
Citations: 124
h-index: 4
R. Zhong
R. Zhong
Citations: 28
h-index: 2
Hujin Peng
Hujin Peng
Citations: 66
h-index: 2
Haoliang Cao
Haoliang Cao
Citations: 0
h-index: 0
Yuan Yuan
Yuan Yuan
Citations: 72
h-index: 2
Ziqin Ma
Ziqin Ma
Citations: 0
h-index: 0
Rui'ang Hu
Rui'ang Hu
Citations: 0
h-index: 0
Ningyu Zhang
Ningyu Zhang
Citations: 127
h-index: 2
Xiaobe Lu
Xiaobe Lu
Citations: 37
h-index: 1
Yanzhe Wu
Yanzhe Wu
Citations: 0
h-index: 0
Junjie Guo
Junjie Guo
Citations: 48
h-index: 2

Memory is essential for enabling large language models to support long-horizon reasoning, yet existing memory systems remain unreliable and difficult to debug. Tracing memory's dynamic evolution is crucial to understand how information is synthesized, propagated, or corrupted over time. In this work, we study the new problem of error tracing and attribution in LLM memory systems. We propose a novel framework that transforms memory pipelines into executable memory evolution graphs, enabling fine-grained tracing of operational information flow. We then construct MemTraceBench, a benchmark collected from representative memory systems such as Long-Context, RAG, Mem0, and EverMemOS, to systematically study memory failure modes. We further introduce an automatic attribution method that iteratively traces operation subgraphs to pinpoint the root cause of any failed case. Our analysis reveals that memory failures are systematic, stemming from operation-level issues like information loss and retrieval misalignment. Crucially, we leverage these fine-grained attribution signals to guide downstream prompt optimization, establishing a closed-loop system that automatically corrects faults and boosts end-task performance by up to 7.62%. Code will be released at https://github.com/zjunlp/MemTrace.

0 Citations
0 Influential
45.451858789481 Altmetric
227.3 Score
Original PDF
17

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!