2605.26902v1 May 26, 2026 cs.IR

ICICLE: Expanding Retrieval with In-Context Documents

Kuan-Yu Chen
Kuan-Yu Chen
Citations: 6
h-index: 1
Eugene Yang
Eugene Yang
Citations: 4
h-index: 2
Zhi Rui Tam
Zhi Rui Tam
Citations: 1,209
h-index: 9
Yuliia Den
Yuliia Den
Citations: 1
h-index: 1
Yung-Yu Shih
Yung-Yu Shih
Citations: 41
h-index: 2
P. Cheng
P. Cheng
Citations: 6
h-index: 2
Yun-Nung Chen
Yun-Nung Chen
Citations: 2
h-index: 1

Generative retrieval (GR) maps queries directly to document identifiers (docids) using parametric knowledge, However, this design makes corpus expansion costly: adding new documents requires updating model parameters to encode new document-docid associations incurs repeated training and catastrophic forgetting of previously indexed documents. In this work, we revisit incremental GR as an in-context retrieval problem, where newly added documents are supplied as inference-time document-docid evidence. We propose ICICLE, an in-context indexing framework that performs source-aware docid generation over both parametric memory and context-provided document-docid pairs. ICICLE combines a `[COPY]`-based routing mechanism, preference-based calibration, and large context adaptation to distinguish context-grounded retrieval from parametric retrieval. Experiments on MS MARCO and NQ320K show that ICICLE improves retrieval of newly introduced documents while preserving seen-document retention without corpus-specific retraining. Our analysis further shows that high-shot degradation is mainly caused by routing failure, highlighting source-selection calibration as a key bottleneck for scaling in-context generative retrieval.

0 Citations
0 Influential
4.5 Altmetric
22.5 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!