2604.17337v1 Apr 19, 2026 cs.AI

AutoSearch: 강화 학습을 이용한 효율적인 에이전트 기반 RAG 시스템을 위한 적응적 검색 깊이

AutoSearch: Adaptive Search Depth for Efficient Agentic RAG via Reinforcement Learning

Guojun Yin

Citations: 265

h-index: 8

Wei Lin

Citations: 257

h-index: 8

Xiaohan Wang

Citations: 101

h-index: 6

Jiajun Chai

Citations: 116

h-index: 6

Songjun Tu

Citations: 256

h-index: 9

Qichao Zhang

Citations: 2,837

h-index: 27

Yaocheng Zhang

Citations: 72

h-index: 4

Jingbo Sun

Citations: 44

h-index: 5

W. Chong

Citations: 58

h-index: 5

Dongbin Zhao

Citations: 387

h-index: 10

에이전트 기반의 검색 증강 생성 (RAG) 시스템은 대규모 언어 모델 (LLM)이 외부 검색 도구와의 다단계 상호 작용을 통해 복잡한 문제를 해결하도록 지원합니다. 그러나 이러한 다단계 상호 작용은 종종 불필요한 검색 단계를 포함하여 상당한 계산 비용과 지연 시간을 초래합니다. 기존 연구에서는 비용을 줄이기 위해 검색 깊이 (즉, 검색 단계 수)를 제한했지만, 이는 종종 복잡한 질문에 대한 충분한 탐색을 저해합니다. 이에 대한 해결책으로, 우리는 먼저 검색 깊이가 정확도에 미치는 영향을 조사하고, 질문의 복잡성과 에이전트의 능력을 동시에 고려하여 정확도와 효율성 간의 균형을 이루는 최소 충분한 검색 깊이를 찾습니다. 또한, 우리는 각 검색 단계를 스스로 생성된 중간 답변을 통해 평가하는 강화 학습 (RL) 프레임워크인 AutoSearch를 제안합니다. AutoSearch는 자기 답변 메커니즘을 통해 최소 충분한 검색 깊이를 식별하고, 효율적인 검색을 장려하기 위해 이를 달성하면 보상을 제공하고, 과도한 검색에는 패널티를 부여합니다. 또한, 검색 행동을 안정화하고 복잡한 질문에 대한 답변 품질을 향상시키기 위한 보상 메커니즘이 도입되었습니다. 다양한 벤치마크에 대한 광범위한 실험 결과, AutoSearch는 우수한 정확도-효율성 균형을 달성하며, 과도한 검색을 완화하면서도 검색 품질을 유지하는 것으로 나타났습니다.

Original Abstract

Agentic retrieval-augmented generation (RAG) systems enable large language models (LLMs) to solve complex tasks through multi-step interaction with external retrieval tools. However, such multi-step interaction often involves redundant search steps, incurring substantial computational cost and latency. Prior work limits search depth (i.e., the number of search steps) to reduce cost, but this often leads to underexploration of complex questions. To address this, we first investigate how search depth affects accuracy and find a minimal sufficient search depth that defines an accuracy-efficiency trade-off, jointly determined by question complexity and the agent's capability. Furthermore, we propose AutoSearch, a reinforcement learning (RL) framework that evaluates each search step via self-generated intermediate answers. By a self-answering mechanism, AutoSearch identifies the minimal sufficient search depth and promotes efficient search by rewarding its attainment while penalizing over-searching. In addition, reward mechanisms are introduced to stabilize search behavior and improve answer quality on complex questions. Extensive experiments on multiple benchmarks show that AutoSearch achieves a superior accuracy-efficiency trade-off, alleviating over-searching while preserving search quality.

0 Citations

0 Influential

13.5 Altmetric

67.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!