2602.23258v1 Feb 26, 2026 cs.AI

AgentDropoutV2: 테스트 시간 정제 또는 거부 가지치기를 통한 다중 에이전트 시스템 정보 흐름 최적화

AgentDropoutV2: Optimizing Information Flow in Multi-Agent Systems via Test-Time Rectify-or-Reject Pruning

Xuebo Liu

Citations: 477

h-index: 11

Yutong Wang

Citations: 113

h-index: 4

Siyuan Xiong

Citations: 107

h-index: 4

Miao Zhang

Citations: 181

h-index: 4

Min Zhang

Citations: 24

h-index: 3

Wenkang Zhou

Citations: 1

h-index: 1

Liang Ding

University of Sydney / JD Explore Academy

Citations: 5,343

h-index: 40

다중 에이전트 시스템(MAS)은 복잡한 추론에 뛰어난 성능을 보이지만, 개별 참여자가 생성하는 오류 정보의 연쇄적인 영향으로 인해 어려움을 겪습니다. 기존 솔루션은 종종 경직된 구조 설계 또는 비용이 많이 드는 미세 조정에 의존하여, 시스템의 적용 가능성과 적응성을 제한합니다. 본 논문에서는 재학습 없이 MAS의 정보 흐름을 동적으로 최적화하도록 설계된 테스트 시간 정제 또는 거부 가지치기 프레임워크인 AgentDropoutV2를 제안합니다. 제안하는 접근 방식은 능동적인 방화벽 역할을 하여, 에이전트의 출력을 가로채고, 검색 증강 정류기를 사용하여 오류 지표 풀을 기반으로 반복적으로 오류를 수정합니다. 이 메커니즘은 추출된 오류 패턴을 사전 지식으로 활용하여 잠재적인 오류를 정확하게 식별할 수 있도록 합니다. 수정할 수 없는 출력은 오류 전파를 방지하기 위해 가지치기되며, 백업 전략은 시스템의 무결성을 유지합니다. 광범위한 수학 벤치마크에서 수행된 실험 결과, AgentDropoutV2는 MAS의 작업 성능을 크게 향상시켰으며, 수학 벤치마크에서 평균 6.3%의 정확도 향상을 달성했습니다. 또한, 시스템은 강력한 일반화 및 적응성을 보여주며, 작업 난이도에 따라 정류 노력을 동적으로 조절하고, 문맥 인지 지표를 활용하여 다양한 오류 패턴을 해결합니다. 본 논문의 코드와 데이터셋은 https://github.com/TonySY2/AgentDropoutV2 에서 확인할 수 있습니다.

Original Abstract

While Multi-Agent Systems (MAS) excel in complex reasoning, they suffer from the cascading impact of erroneous information generated by individual participants. Current solutions often resort to rigid structural engineering or expensive fine-tuning, limiting their deployability and adaptability. We propose AgentDropoutV2, a test-time rectify-or-reject pruning framework designed to dynamically optimize MAS information flow without retraining. Our approach acts as an active firewall, intercepting agent outputs and employing a retrieval-augmented rectifier to iteratively correct errors based on a failure-driven indicator pool. This mechanism allows for the precise identification of potential errors using distilled failure patterns as prior knowledge. Irreparable outputs are subsequently pruned to prevent error propagation, while a fallback strategy preserves system integrity. Empirical results on extensive math benchmarks show that AgentDropoutV2 significantly boosts the MAS's task performance, achieving an average accuracy gain of 6.3 percentage points on math benchmarks. Furthermore, the system exhibits robust generalization and adaptivity, dynamically modulating rectification efforts based on task difficulty while leveraging context-aware indicators to resolve a wide spectrum of error patterns. Our code and dataset are released at https://github.com/TonySY2/AgentDropoutV2.

1 Citations

0 Influential

56.661022550876 Altmetric

284.3 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!