2605.26754v1 May 26, 2026 cs.CR

Cordon-MAS: Defending RAG against Knowledge Poisoning via Information-Flow Control

Hongzhi Wang

Citations: 6

h-index: 2

Wenpeng Xing

Citations: 205

h-index: 10

Zhengtao Yu

Citations: 20

h-index: 2

Xuyang Teng

Citations: 6

h-index: 2

Meng Han

Citations: 78

h-index: 3

Shuguang Xiong

Citations: 0

h-index: 0

Gaolei Li

Citations: 0

h-index: 0

Retrieval-augmented generation (RAG) increasingly underpins high-stakes applications, yet remains vulnerable to Confundo-style poisoning where adversarially optimized documents manipulate generated outputs. Existing defenses assume that detecting poisoned evidence prevents harm. We show this assumption is incorrect: models exhibit a monitoring-control gap -- they can detect contradictions in retrieved evidence yet still act on poisoned claims. We introduce the Cordon Principle -- no agent capable of final synthesis may access untrusted natural-language evidence -- and realize it through CORDON-MAS, a compartmentalized framework that enforces this principle architecturally by separating evidence extraction, cross-source audit, and answer synthesis into agents with asymmetric memory privileges. Across five BEIR datasets, CORDON-MAS reduces attack success rate by 92.4\% relative to undefended RAG. This reframes RAG poisoning from a detection problem to an information-flow control problem.

0 Citations

0 Influential

5 Altmetric

25.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!