2604.02734v1 Apr 03, 2026 cs.AI

진전과 실현 가능성의 조화: 장기 계획 LLM 에이전트를 위한 신경-기호 양방향 메모리 프레임워크

Aligning Progress and Feasibility: A Neuro-Symbolic Dual Memory Framework for Long-Horizon LLM Agents

Yangtao Chen

Citations: 56

h-index: 4

Bin Wen

Citations: 12

h-index: 2

Ruoxuan Zhang

Citations: 19

h-index: 2

Hongxia Xie

Citations: 3

h-index: 1

Lan-Zhe Guo

Citations: 66

h-index: 3

대규모 언어 모델(LLM)은 임베디드 조작 및 웹 상호 작용과 같은 장기 의사 결정 작업에서 강력한 잠재력을 보여주었습니다. 그러나 에이전트는 복잡한 환경에서 끊임없는 시행착오의 반복에 빠지거나 주요 목표에서 벗어나는 경우가 많습니다. 우리는 이러한 실패의 원인을 전반적인 '진전 왜곡'과 지역적인 '실현 가능성 위반'이라는 두 가지 근본적인 오류로 규정합니다. 기존 방법은 일반적으로 단일 패러다임을 사용하여 이 두 가지 문제를 동시에 해결하려고 시도합니다. 그러나 이 두 가지 과제는 근본적으로 다릅니다. 전자는 모호한 의미론적 계획에 의존하는 반면, 후자는 엄격한 논리적 제약 조건과 상태 검증을 요구합니다. 이러한 단일 패러다임 접근 방식의 고유한 한계는 기존 모델이 장기 과제를 처리하는 데 있어 근본적인 어려움을 야기합니다. 이러한 통찰력을 바탕으로, 우리는 의미론적 진전 지침과 논리적 실현 가능성 검증을 명시적으로 분리하는 신경-기호 양방향 메모리 프레임워크를 제안합니다. 구체적으로, 추론 단계에서 이 프레임워크는 두 가지 메모리 메커니즘을 동시에 호출합니다. 한편, 신경망 기반의 '진전 메모리'는 성공적인 경로에서 의미론적 청사진을 추출하여 전반적인 작업 진행을 안내합니다. 다른 한편, 기호 논리에 기반한 '실현 가능성 메모리'는 실패한 전환에서 합성된 실행 가능한 Python 검증 함수를 사용하여 엄격한 논리적 검증을 수행합니다. 실험 결과, 이 방법은 ALFWorld, WebShop 및 TextCraft에서 기존의 경쟁적인 기준 모델보다 훨씬 뛰어난 성능을 보이며, 동시에 잘못된 액션 비율과 평균 경로 길이를 크게 줄입니다.

Original Abstract

Large language models (LLMs) have demonstrated strong potential in long-horizon decision-making tasks, such as embodied manipulation and web interaction. However, agents frequently struggle with endless trial-and-error loops or deviate from the main objective in complex environments. We attribute these failures to two fundamental errors: global Progress Drift and local Feasibility Violation. Existing methods typically attempt to address both issues simultaneously using a single paradigm. However, these two challenges are fundamentally distinct: the former relies on fuzzy semantic planning, while the latter demands strict logical constraints and state validation. The inherent limitations of such a single-paradigm approach pose a fundamental challenge for existing models in handling long-horizon tasks. Motivated by this insight, we propose a Neuro-Symbolic Dual Memory Framework that explicitly decouples semantic progress guidance from logical feasibility verification. Specifically, during the inference phase, the framework invokes both memory mechanisms synchronously: on one hand, a neural-network-based Progress Memory extracts semantic blueprints from successful trajectories to guide global task advancement; on the other hand, a symbolic-logic-based Feasibility Memory utilizes executable Python verification functions synthesized from failed transitions to perform strict logical validation. Experiments demonstrate that this method significantly outperforms existing competitive baselines on ALFWorld, WebShop, and TextCraft, while drastically reducing the invalid action rate and average trajectory length.

1 Citations

1 Influential

2 Altmetric

13.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!