2603.18718v1 Mar 19, 2026 cs.AI

MemMA: 다중 에이전트 추론과 실시간 자체 진화를 통한 메모리 주기 조화

MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ Self-Evolution

Zhiwei Zhang

Citations: 426

h-index: 9

Suhang Wang

Citations: 354

h-index: 10

Min Lin

Citations: 275

h-index: 10

Hui Liu

Citations: 429

h-index: 12

Xianfeng Tang

Citations: 714

h-index: 11

Hanqing Lu

Citations: 218

h-index: 6

Qi He

Citations: 565

h-index: 11

XiangRui Zhang

Citations: 12

h-index: 2

메모리 증강 LLM 에이전트는 장기 상호 작용을 지원하기 위해 외부 메모리 저장소를 사용하지만, 대부분의 기존 시스템은 메모리 생성, 검색 및 활용을 독립적인 하위 루틴으로 처리합니다. 이는 다음과 같은 두 가지 문제를 야기합니다. 첫째, 메모리 주기의 전방 경로에서 전략적 판단력이 부족하여, 메모리 생성 및 검색이 명시적인 전략적 추론보다는 로컬 휴리스틱에 의해 주도됩니다. 둘째, 후방 경로에서 다운스트림 실패가 메모리 저장소의 직접적인 수정으로 이어지는 경우가 드물어, 학습이 부족합니다. 이러한 문제를 해결하기 위해, 우리는 메모리 주기의 전방 및 후방 경로를 조율하는 플러그 앤 플레이 다중 에이전트 프레임워크인 MemMA를 제안합니다. 전방 경로에서 Meta-Thinker는 구조화된 지침을 생성하여 메모리 생성 시 Memory Manager를 안내하고, 반복적인 검색 시 Query Reasoner를 지시합니다. 후방 경로에서 MemMA는 메모리 생성 과정에서 실시간 자체 진화를 도입하여, 탐색 질문-답변 쌍을 생성하고, 현재 메모리를 검증하며, 메모리가 최종화되기 전에 실패를 수정하는 작업을 수행합니다. LoCoMo 데이터셋에 대한 광범위한 실험 결과, MemMA는 다양한 LLM 기반 모델에서 기존 시스템보다 일관되게 우수한 성능을 보이며, 세 가지 서로 다른 저장 백엔드를 플러그 앤 플레이 방식으로 개선할 수 있음을 확인했습니다. MemMA의 코드는 https://github.com/ventr1c/memma 에서 공개적으로 이용할 수 있습니다.

Original Abstract

Memory-augmented LLM agents maintain external memory banks to support long-horizon interaction, yet most existing systems treat construction, retrieval, and utilization as isolated subroutines. This creates two coupled challenges: strategic blindness on the forward path of the memory cycle, where construction and retrieval are driven by local heuristics rather than explicit strategic reasoning, and sparse, delayed supervision on the backward path, where downstream failures rarely translate into direct repairs of the memory bank. To address these challenges, we propose MemMA, a plug-and-play multi-agent framework that coordinates the memory cycle along both the forward and backward paths. On the forward path, a Meta-Thinker produces structured guidance that steers a Memory Manager during construction and directs a Query Reasoner during iterative retrieval. On the backward path, MemMA introduces in-situ self-evolving memory construction, which synthesizes probe QA pairs, verifies the current memory, and converts failures into repair actions before the memory is finalized. Extensive experiments on LoCoMo show that MemMA consistently outperforms existing baselines across multiple LLM backbones and improves three different storage backends in a plug-and-play manner. Our code is publicly available at https://github.com/ventr1c/memma.

3 Citations

0 Influential

26 Altmetric

133.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!