2605.28104v1 May 27, 2026 cs.AI

Defending LLM-based Multi-Agent Systems Against Cooperative Attacks with Sentence-Level Rectification

Zhi Zheng
Zhi Zheng
Citations: 174
h-index: 8
Yong Chen
Yong Chen
Citations: 11
h-index: 2
Tong Xu
Tong Xu
Citations: 65
h-index: 5
Enhong Chen
Enhong Chen
Citations: 1,115
h-index: 14
Ziwei Zhao
Ziwei Zhao
Citations: 126
h-index: 6
Jielun Zhao
Jielun Zhao
Citations: 0
h-index: 0
Wenjun Xue
Wenjun Xue
Citations: 23
h-index: 1
Yao Luo
Yao Luo
Citations: 2
h-index: 1

Recent years have witnessed the rapid development of Large Language Model-based Multi-Agent Systems (MAS), which excel at collaborative decision-making and complex problem-solving. However, malicious agents in MAS may inject misinformation to mislead other agents and disrupt system performance, giving rise to a new research direction that focuses on attack mechanisms and defense strategies in MAS. Prior studies largely assume malicious agents act independently and investigate the corresponding defense strategies. However, we argue that malicious agents may exhibit collaborative behaviors, enabling more effective attacks through internal information exchange. In this paper, we propose an adaptive cooperative attack framework, where malicious agents autonomously coordinate and dynamically adjust their attack strategies through multi-round interactions. Furthermore, we introduce Sentence-Level Trustworthiness Analysis and Rectification (STAR), a defense framework that identifies and rectifies misleading information at the sentence level within agent communications. Our experiments show that cooperative attacks lead to a significantly larger degradation in task success rate than independent attacks, resulting in a relative drop of 5.34\%. Meanwhile, STAR effectively mitigates both cooperative and independent threats and improves task success rate by an average of 36.76\%. The code is available at https://github.com/smoooom/STAR.

0 Citations
0 Influential
27 Altmetric
135.0 Score
Original PDF
0

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!