2604.23783v1 Apr 26, 2026 cs.IR

S2G-RAG: 반복적인 검색 증강 질의응답을 위한 구조화된 충분성 및 격차 판단

S2G-RAG: Structured Sufficiency and Gap Judging for Iterative Retrieval-Augmented QA

Minghan Li

Citations: 32

h-index: 3

Guodong Zhou

Citations: 24

h-index: 3

Chao Zhang

Citations: 6

h-index: 1

Junjie Zou

Citations: 14

h-index: 2

Xinxuan Lv

Citations: 38

h-index: 3

검색 증강 생성(RAG)은 언어 모델을 외부 증거에 기반하도록 하지만, 다단계 질의응답은 반복적인 파이프라인이 다음에 무엇을 검색해야 하는지, 그리고 현재 이용 가능한 증거가 충분한지 여부를 제어해야 하기 때문에 여전히 어렵습니다. 실제로 시스템은 불완전한 증거 체인으로부터 답변을 내리거나, 이후 검색 및 추론을 방해하는 중복되거나 주의를 분산시키는 텍스트를 축적할 수 있습니다. 본 논문에서는 명시적인 제어기인 S2G-Judge를 갖춘 반복적인 프레임워크인 S2G-RAG(Structured Sufficiency and Gap-judging RAG)을 제안합니다. 각 단계에서 S2G-Judge는 현재 증거 메모리가 답변을 지원하는지 예측하고, 그렇지 않은 경우 누락된 정보를 설명하는 구조화된 격차 항목을 출력합니다. 이러한 격차 항목은 다음 검색 쿼리로 매핑되어 안정적인 다단계 검색 경로를 생성합니다. 노이즈 축적을 줄이기 위해 S2G-RAG는 검색된 문서에서 관련 문장의 간결한 집합을 추출하여 문장 수준의 증거 컨텍스트를 유지합니다. TriviaQA, HotpotQA 및 2WikiMultiHopQA에 대한 실험 결과, S2G-RAG는 다단계 질의응답 성능과 다단계 검색 환경에서의 견고성을 향상시키는 것으로 나타났습니다. 또한 S2G-RAG는 검색 엔진을 수정하거나 생성기를 재학습하지 않고도 기존 RAG 파이프라인에 가벼운 구성 요소로 통합될 수 있습니다.

Original Abstract

Retrieval-Augmented Generation (RAG) grounds language models in external evidence, but multi-hop question answering remains difficult because iterative pipelines must control what to retrieve next and when the available evidence is adequate. In practice, systems may answer from incomplete evidence chains, or they may accumulate redundant or distractor-heavy text that interferes with later retrieval and reasoning. We propose S2G-RAG (Structured Sufficiency and Gap-judging RAG), an iterative framework with an explicit controller, S2G-Judge. At each turn, S2G-Judge predicts whether the current evidence memory supports answering and, if not, outputs structured gap items that describe the missing information. These gap items are then mapped into the next retrieval query, producing stable multi-turn retrieval trajectories. To reduce noise accumulation, S2G-RAG maintains a sentence-level Evidence Context by extracting a compact set of relevant sentences from retrieved documents. Experiments on TriviaQA, HotpotQA, and 2WikiMultiHopQA show that S2G-RAG improves multi-hop QA performance and robustness under multi-turn retrieval. Furthermore, S2G-RAG can be integrated into existing RAG pipelines as a lightweight component, without modifying the search engine or retraining the generator.

0 Citations

0 Influential

1.5 Altmetric

7.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!