2605.25389v1 May 25, 2026 cs.CR

Evo-Attacker: Memory-Augmented Reinforcement Learning for Long-Horizon Tool Attacks on LLM-MAS

Chaozhuo Li
Chaozhuo Li
Citations: 364
h-index: 11
Jinyu Hou
Jinyu Hou
Citations: 61
h-index: 4
Litian Zhang
Litian Zhang
Citations: 410
h-index: 11
Yiming Hei
Yiming Hei
Citations: 542
h-index: 12
Bingyu Yan
Bingyu Yan
Citations: 78
h-index: 2
Xiaoming Zhang
Xiaoming Zhang
Citations: 174
h-index: 6
Ziyi Zhou
Ziyi Zhou
Citations: 275
h-index: 8

While Large Language Model-based Multi-Agent Systems (LLM-MAS) demonstrate remarkable capabilities in solving complex tasks by orchestrating specialized agents and external tools, the implicit trust in tool outputs creates a critical attack surface. Existing tool attacks are limited by domain specificity or fixed and static templates. To address these challenges, we propose Evo-Attacker, which formulates the tool attack as a self-evolving, memory-augmented reinforcement learning process. Evo-Attacker constructs a dynamic attack memory and employs deliberative reasoning to retrieve adversarial patterns and strategize modifying interventions at critical moments. Furthermore, we introduce Attack-Flow GRPO to optimize intermediate reasoning steps via terminal outcomes, addressing the long-horizon credit assignment challenge. Comprehensive experiments demonstrate that Evo-Attacker consistently outperforms baselines, highlighting its generalization and evolutionary capabilities and the urgent need for defensive tool safeguards.

0 Citations
0 Influential
6 Altmetric
30.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!