2601.10702v1 Jan 15, 2026 cs.CL

맥락적 의도를 활용한 에이전트 기억 시스템

Grounding Agent Memory in Contextual Intent

Priyanka Kargupta

Citations: 115

h-index: 6

Ruozhen Yang

Citations: 69

h-index: 2

Yucheng Jiang

Citations: 6

h-index: 2

Yueqi Jiang

Citations: 6

h-index: 2

Yunyi Zhang

Amazon

Citations: 894

h-index: 15

Jiawei Han

Citations: 56

h-index: 4

장기적인 목표 지향적 상호 작용에서 대규모 언어 모델을 활용하는 것은 여전히 어려운 과제입니다. 유사한 개체와 사실들이 다양한 잠재적 목표와 제약 조건 하에서 반복적으로 등장하며, 이는 기억 시스템이 맥락에 맞지 않는 정보를 검색하도록 유발하기 때문입니다. 본 논문에서는 STITCH (Structured Intent Tracking in Contextual History)라는 에이전트 기반 기억 시스템을 제안합니다. STITCH는 각 추적 단계에 구조화된 검색 키워드인 '맥락적 의도'를 연결하고, 현재 단계의 의도와 일치하는 과거 기록을 검색합니다. 맥락적 의도는 다음과 같은 간결한 신호를 제공하여 반복적인 언급을 명확하게 하고 간섭을 줄입니다. (1) 주제 세그먼트를 정의하는 현재의 잠재적 목표, (2) 행동 유형, (3) 관련 속성을 나타내는 중요한 개체 유형입니다. 추론 과정에서 STITCH는 의도와의 호환성을 기준으로 기억 조각을 필터링하고 우선순위를 지정하여, 의미적으로 유사하지만 맥락적으로 일치하지 않는 과거 기록을 억제합니다. 평가를 위해, 현실적이고 동적인 목표 지향적 추적에서 맥락 인지 검색을 위한 벤치마크인 CAME-Bench를 소개합니다. CAME-Bench와 LongMemEval에서 STITCH는 최첨단 성능을 달성했으며, 가장 강력한 기준 모델보다 35.6% 더 높은 성능을 보였으며, 특히 추적 길이가 증가할수록 성능 향상이 두드러졌습니다. 분석 결과, 의도 기반 색인은 검색 노이즈를 크게 줄이며, 견고한 장기 추론을 위한 의도 기반 기억 시스템을 지원한다는 것을 알 수 있었습니다.

Original Abstract

Deploying large language models in long-horizon, goal-oriented interactions remains challenging because similar entities and facts recur under different latent goals and constraints, causing memory systems to retrieve context-mismatched evidence. We propose STITCH (Structured Intent Tracking in Contextual History), an agentic memory system that indexes each trajectory step with a structured retrieval cue, contextual intent, and retrieves history by matching the current step's intent. Contextual intent provides compact signals that disambiguate repeated mentions and reduce interference: (1) the current latent goal defining a thematic segment, (2) the action type, and (3) the salient entity types anchoring which attributes matter. During inference, STITCH filters and prioritizes memory snippets by intent compatibility, suppressing semantically similar but context-incompatible history. For evaluation, we introduce CAME-Bench, a benchmark for context-aware retrieval in realistic, dynamic, goal-oriented trajectories. Across CAME-Bench and LongMemEval, STITCH achieves state-of-the-art performance, outperforming the strongest baseline by 35.6%, with the largest gains as trajectory length increases. Our analysis shows that intent indexing substantially reduces retrieval noise, supporting intent-aware memory for robust long-horizon reasoning.

2 Citations

1 Influential

7.5 Altmetric

41.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!