2605.26754v1 May 26, 2026 cs.CR

Cordon-MAS: Defending RAG against Knowledge Poisoning via Information-Flow Control

Hongzhi Wang
Hongzhi Wang
Citations: 6
h-index: 2
Wenpeng Xing
Wenpeng Xing
Citations: 205
h-index: 10
Zhengtao Yu
Zhengtao Yu
Citations: 20
h-index: 2
Xuyang Teng
Xuyang Teng
Citations: 6
h-index: 2
Meng Han
Meng Han
Citations: 78
h-index: 3
Shuguang Xiong
Shuguang Xiong
Citations: 0
h-index: 0
Gaolei Li
Gaolei Li
Citations: 0
h-index: 0

Retrieval-augmented generation (RAG) increasingly underpins high-stakes applications, yet remains vulnerable to Confundo-style poisoning where adversarially optimized documents manipulate generated outputs. Existing defenses assume that detecting poisoned evidence prevents harm. We show this assumption is incorrect: models exhibit a monitoring-control gap -- they can detect contradictions in retrieved evidence yet still act on poisoned claims. We introduce the Cordon Principle -- no agent capable of final synthesis may access untrusted natural-language evidence -- and realize it through CORDON-MAS, a compartmentalized framework that enforces this principle architecturally by separating evidence extraction, cross-source audit, and answer synthesis into agents with asymmetric memory privileges. Across five BEIR datasets, CORDON-MAS reduces attack success rate by 92.4\% relative to undefended RAG. This reframes RAG poisoning from a detection problem to an information-flow control problem.

0 Citations
0 Influential
5 Altmetric
25.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!