2606.06090v1 Jun 04, 2026 cs.AI

Beyond Semantic Organization: Memory as Execution State Management for Long-Horizon Agents

Hao Wu
Hao Wu
Citations: 66
h-index: 2
Baotong Lu
Baotong Lu
Citations: 175
h-index: 5
Zewen Jin
Zewen Jin
Citations: 42
h-index: 3
Haibin Lai
Haibin Lai
Citations: 11
h-index: 2
Chuyu Han
Chuyu Han
Citations: 0
h-index: 0
Yaoqi Chen
Yaoqi Chen
Citations: 25
h-index: 3
Yuru Feng
Yuru Feng
Citations: 26
h-index: 3
Qianxi Zhang
Qianxi Zhang
Citations: 686
h-index: 7
Menghao Li
Menghao Li
Citations: 68
h-index: 4
Xinjiang Wang
Xinjiang Wang
Citations: 2,437
h-index: 15
Z. Wang
Z. Wang
Citations: 10
h-index: 2
Shusen Xu
Shusen Xu
Citations: 5
h-index: 2
Zengzhong Li
Zengzhong Li
Citations: 368
h-index: 5
Cheng Li
Cheng Li
Citations: 132
h-index: 2
Qi Chen
Qi Chen
Citations: 12
h-index: 2

LLM-based agents increasingly tackle long-horizon tasks with interdependent decisions, where each action reshapes future constraints and intermediate errors can cascade. Existing RAG and agent memory systems organize histories by semantic similarity, retrieving content-relevant entries at decision time. We argue that this design mismatches execution-state dependencies: it fragments decision trajectories and mixes valid and erroneous traces, hindering coherent state reconstruction and error isolation. We propose MAGE (Memory as Agent-Guided Exploration), an active execution-state manager that stores interactions in a hierarchical state tree. The agent derives its state from the active root-to-current path, combining subgoal summaries, recent traces, and hints from prior branches. Four coupled operations maintain the tree: Grow records new traces, Compress summarizes completed subgoals, Maintain validates summaries, and Revise restores a target boundary and resumes on a new branch. This design bounds context growth while preserving state integrity and isolating flawed segments from the active path. Experiments on MemoryArena show that MAGE improves the average task success rate by 7.8--20.4 pp over baselines, while reducing token consumption by 55.1%.

0 Citations
0 Influential
7.5 Altmetric
37.5 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!