2602.03689v1 Feb 03, 2026 cs.CL

재검색기 재고: 견고한 검색 증강 생성을 위한 경계 인식 증거 선택

Rethinking the Reranker: Boundary-Aware Evidence Selection for Robust Retrieval-Augmented Generation

Pengrui Han

Citations: 147

h-index: 8

Xueqiang Xu

Citations: 124

h-index: 4

Siru Ouyang

Citations: 591

h-index: 9

Ming Zhong

Citations: 175

h-index: 6

Pengcheng Jiang

Citations: 77

h-index: 4

Sai Wang

Citations: 968

h-index: 9

Jiajun Fan

Citations: 78

h-index: 5

Heng Wang

Citations: 14

h-index: 1

Yizhu Jiao

Citations: 1,256

h-index: 15

Chen Huang

Citations: 39

h-index: 2

Peiran Li

Citations: 6

h-index: 1

Jiaxin Huang

Citations: 507

h-index: 8

Ge Liu

Citations: 32

h-index: 4

Heng Ji

Citations: 16

h-index: 2

Jiawei Han

Citations: 226

h-index: 6

Jiashuo Sun

DAMO Academy, Alibaba Group

Citations: 997

h-index: 10

검색 증강 생성(RAG) 시스템은 현실적인 검색 노이즈 환경에서도 여전히 취약하며, 필요한 증거가 상위 K개 결과에 포함되어 있더라도 문제가 발생합니다. 주요 원인은 검색기와 재검색기가 관련성만을 최적화하여, 종종 사소하고 답을 직접적으로 드러내는 구절이나, 질문에 답하기 위해 필요한 핵심 정보를 포함하지 않는 증거를 선택한다는 것입니다. 이러한 증거는 생성기에 적합하지 않을 수 있습니다. 본 논문에서는 재검색기를 생성기의 '황금 적정 영역'을 목표로 하는 경계 인식 증거 선택기로 재정의하는 BAR-RAG을 제안합니다. 즉, 생성기가 너무 쉽게 또는 근본적으로 답할 수 없는 증거가 아닌, 추론에 적합하면서도 충분히 도전적인 증거를 선택합니다. BAR-RAG은 생성기의 피드백을 활용한 강화 학습을 통해 선택기를 훈련하며, 두 단계 파이프라인을 통해 유도된 증거 분포 하에서 생성기를 미세 조정하여 훈련과 추론 간의 분포 불일치를 완화합니다. 지식 집약적인 질문 답변 벤치마크에서의 실험 결과, BAR-RAG은 노이즈가 많은 검색 환경에서 전체 성능을 꾸준히 향상시키며, 강력한 RAG 및 재검색 기반 모델보다 평균 10.3%의 성능 향상을 달성하고, 견고성을 크게 개선합니다. 관련 코드는 https://github.com/GasolSun36/BAR-RAG 에서 공개적으로 이용 가능합니다.

Original Abstract

Retrieval-Augmented Generation (RAG) systems remain brittle under realistic retrieval noise, even when the required evidence appears in the top-K results. A key reason is that retrievers and rerankers optimize solely for relevance, often selecting either trivial, answer-revealing passages or evidence that lacks the critical information required to answer the question, without considering whether the evidence is suitable for the generator. We propose BAR-RAG, which reframes the reranker as a boundary-aware evidence selector that targets the generator's Goldilocks Zone -- evidence that is neither trivially easy nor fundamentally unanswerable for the generator, but is challenging yet sufficient for inference and thus provides the strongest learning signal. BAR-RAG trains the selector with reinforcement learning using generator feedback, and adopts a two-stage pipeline that fine-tunes the generator under the induced evidence distribution to mitigate the distribution mismatch between training and inference. Experiments on knowledge-intensive question answering benchmarks show that BAR-RAG consistently improves end-to-end performance under noisy retrieval, achieving an average gain of 10.3 percent over strong RAG and reranking baselines while substantially improving robustness. Code is publicly avaliable at https://github.com/GasolSun36/BAR-RAG.

1 Citations

0 Influential

34.431471805599 Altmetric

173.2 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!