2606.13003v1 Jun 11, 2026 cs.AI

The Illusion of Multi-Agent Advantage

Zixuan Ke
Zixuan Ke
Citations: 345
h-index: 9
Yifei Ming
Yifei Ming
Citations: 547
h-index: 10
Prathyusha Jwalapuram
Prathyusha Jwalapuram
Citations: 314
h-index: 9
Shafiq Joty
Shafiq Joty
Citations: 816
h-index: 18
Hehai Lin
Hehai Lin
Citations: 73
h-index: 3
Sudong Wang
Sudong Wang
Citations: 90
h-index: 4
Chengwei Qin
Chengwei Qin
Citations: 46
h-index: 2
Chuyuan Li
Chuyuan Li
Citations: 88
h-index: 5
Giuseppe Carenini
Giuseppe Carenini
Citations: 85
h-index: 5
Fangkai Jiao
Fangkai Jiao
Nanyang Technological University
Citations: 832
h-index: 15

Prevailing wisdom posits that Multi-Agent Systems (MAS) are superior to Single-Agent Systems (SAS), citing advantages like context protection, parallel processing and distributed decision-making. However, empirical support for this claim relies primarily on comparisons with SAS baselines using benchmarks that prioritize isolated reasoning tasks, which do not adequately assess these advantages. Focusing on automatically generated MAS that are designed for enhanced generalizability over manually-designed counterparts, we perform a rigorous, systematic evaluation against SAS, specifically Chain-of-Thought with Self-Consistency (CoT-SC). Across traditional reasoning datasets and tasks with interactive multi-step workflows (e.g., BrowseComp-Plus), we demonstrate that automatic MAS consistently underperform CoT-SC despite being up to 10x more expensive. To isolate these failures from limitations inherent to task structure, we introduce a diagnostic synthetic dataset tailored for MAS featuring explicit task decomposition, context separation and parallelization potential. We show that expert-architected MAS consistently outperforms automatically generated architectures in both raw performance and cost-efficiency on this dataset, demonstrating that existing evaluation frameworks mask critical architectural gaps and inefficiencies of complex MAS by failing to account for the marginal utility of increased computational cost. Critically, systematic deconstruction of the generated MAS architectures reveals that current automated design paradigms produce architectural bloat that prioritizes superficial complexity which does not translate into functional utility, exposing a fundamental misalignment with multi-agent principles.

0 Citations
0 Influential
9 Altmetric
45.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!