2604.08963v2 Apr 10, 2026 cs.MA

정렬된 에이전트, 편향된 군집: 다중 에이전트 시스템에서 편향 증폭 현상 측정

Aligned Agents, Biased Swarm: Measuring Bias Amplification in Multi-Agent Systems

Keyu Li

Citations: 42

h-index: 3

Dequan Wang

Citations: 17

h-index: 1

Jin Gao

Citations: 223

h-index: 5

다중 에이전트 시스템(MAS)은 점점 더 복잡한 워크플로우에 활용되고 있지만, 특히 편향의 누적과 같은 시스템의 특성에 대한 이해는 여전히 부족합니다. 실제 MAS는 분석하기 너무 복잡하기 때문에, 윤리적 안정성을 평가하려면 먼저 시스템의 기본 작동 원리를 분리해야 합니다. 본 연구에서는 기본적인 MAS 토폴로지와 피드백 루프가 편향에 미치는 영향을 조사하는 기초적인 실증 연구를 수행했습니다. 다중 에이전트 협력이 자연스럽게 편향을 완화한다는 가정과는 달리, 우리는 구조화된 워크플로우가 마치 메아리 방과 같이 작용하여 미미한 확률적 편향을 시스템적 양극화로 증폭시킨다고 가설을 설정했습니다. 이를 평가하기 위해, 개별 모델의 중립성을 우회하여 인구 집단 간의 강제 비교 판단을 통해 편향을 평가하는 오픈형 벤치마크인 Discrim-Eval-Open을 소개합니다. 다양한 구조에서의 편향 전파를 분석한 결과, 복잡한 아키텍처가 종종 편향을 완화하는 대신 악화시키는 경향이 있음을 확인했습니다. 개별 에이전트가 중립적으로 작동하는 경우에도 시스템적 증폭이 관찰되었으며, 순수하게 객관적인 정보를 주입하면 양극화가 극적으로 가속화되는 '트리거 취약성'을 확인했습니다. 우리는 고급 군집 복잡성을 제거하여 기본적인 역학을 연구함으로써, 중요한 기준점을 확립했습니다. 즉, 구조적 복잡성이 윤리적 안정성을 보장하지는 않습니다. 저희 코드는 https://github.com/weizhihao1/MAS-Bias 에서 확인할 수 있습니다.

Original Abstract

While Multi-Agent Systems (MAS) are increasingly deployed for complex workflows, their emergent properties-particularly the accumulation of bias-remain poorly understood. Because real-world MAS are too complex to analyze entirely, evaluating their ethical robustness requires first isolating their foundational mechanics. In this work, we conduct a baseline empirical study investigating how basic MAS topologies and feedback loops influence prejudice. Contrary to the assumption that multi-agent collaboration naturally dilutes bias, we hypothesize that structured workflows act as echo chambers, amplifying minor stochastic biases into systemic polarization. To evaluate this, we introduce Discrim-Eval-Open, an open-ended benchmark that bypasses individual model neutrality through forced comparative judgments across demographic groups. Analyzing bias cascades across various structures reveals that architectural sophistication frequently exacerbates bias rather than mitigating it. We observe systemic amplification even when isolated agents operate neutrally, and identify a 'Trigger Vulnerability' where injecting purely objective context drastically accelerates polarization. By stripping away advanced swarm complexity to study foundational dynamics, we establish a crucial baseline: structural complexity does not guarantee ethical robustness. Our code is available at https://github.com/weizhihao1/MAS-Bias.

0 Citations

0 Influential

31.45879734614 Altmetric

157.3 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!