2602.13320v1 Feb 10, 2026 cs.AI

도구 사용 LLM 에이전트에서의 정보 충실성: 모델 컨텍스트 프로토콜의 마팅게일 분석

Information Fidelity in Tool-Using LLM Agents: A Martingale Analysis of the Model Context Protocol

Cheston Tan

Citations: 102

h-index: 6

R. Wattenhofer

Citations: 3,080

h-index: 24

Flint Xiaofeng Fan

ETH Zürich

Citations: 213

h-index: 8

Y. Ong

Citations: 288

h-index: 9

대규모 언어 모델(LLM)로 구동되는 AI 에이전트가 중요한 의사 결정을 위해 외부 도구를 점점 더 많이 사용함에 따라, 중요한 신뢰성 문제가 발생합니다. 즉, 순차적인 도구 호출 과정에서 오류가 어떻게 전파되는가입니다. 본 연구에서는 모델 컨텍스트 프로토콜(MCP) 에이전트의 오류 누적을 분석하는 최초의 이론적 프레임워크를 제시하고, 누적 왜곡이 선형적으로 증가하며, 확률적으로 $O( ext{√}T)$로 제한되는 편차를 보인다는 것을 증명합니다. 이러한 집중 특성은 예측 가능한 시스템 동작을 보장하며, 지수적인 오류 발생 가능성을 배제합니다. 우리는 이산적인 사실 매칭과 연속적인 의미 유사성을 결합한 하이브리드 왜곡 지표를 개발하고, 순차적인 도구 상호 작용을 통한 오류 전파에 대한 마팅게일 집중 경계를 설정합니다. Qwen2-7B, Llama-3-8B, Mistral-7B 모델에 대한 실험 결과는 우리의 이론적 예측을 검증하며, 실제 왜곡이 선형 추세를 따르며, 편차가 일관되게 $O( ext{√}T)$ 범위 내에 있음을 보여줍니다. 주요 결과는 다음과 같습니다. 의미적 가중치는 왜곡을 80%까지 줄이며, 약 9단계마다 주기적으로 재접지(re-grounding)하는 것이 오류 제어에 충분합니다. 우리는 이러한 집중 보장을 신뢰할 수 있는 에이전트 시스템을 위한 실질적인 배포 원칙으로 변환합니다.

Original Abstract

As AI agents powered by large language models (LLMs) increasingly use external tools for high-stakes decisions, a critical reliability question arises: how do errors propagate across sequential tool calls? We introduce the first theoretical framework for analyzing error accumulation in Model Context Protocol (MCP) agents, proving that cumulative distortion exhibits linear growth and high-probability deviations bounded by $O(\sqrt{T})$. This concentration property ensures predictable system behavior and rules out exponential failure modes. We develop a hybrid distortion metric combining discrete fact matching with continuous semantic similarity, then establish martingale concentration bounds on error propagation through sequential tool interactions. Experiments across Qwen2-7B, Llama-3-8B, and Mistral-7B validate our theoretical predictions, showing empirical distortion tracks the linear trend with deviations consistently within $O(\sqrt{T})$ envelopes. Key findings include: semantic weighting reduces distortion by 80\%, and periodic re-grounding approximately every 9 steps suffices for error control. We translate these concentration guarantees into actionable deployment principles for trustworthy agent systems.

1 Citations

0 Influential

12 Altmetric

61.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!