2605.29507v1 May 28, 2026 cs.AI

Xetrieval: Mechanistically Explaining Dense Retrieval

Jiaqi Li
Jiaqi Li
Citations: 360
h-index: 7
Zhixi Cai
Zhixi Cai
Citations: 87
h-index: 6
Zilong Zheng
Zilong Zheng
Citations: 420
h-index: 8
Jun Bai
Jun Bai
Beijing Institute for General Artificial Intelligence
Citations: 91
h-index: 5
Yang Liu
Yang Liu
Citations: 11
h-index: 3
Yichi Zhang
Yichi Zhang
Citations: 4
h-index: 1
Taichuan Li
Taichuan Li
Citations: 11
h-index: 1
Zhuofan Chen
Zhuofan Chen
Citations: 25
h-index: 2
Zixiao Jia
Zixiao Jia
Citations: 63
h-index: 1
Wenge Rong
Wenge Rong
Citations: 72
h-index: 3

Explaining why dense retrievers assign high relevance scores remains challenging because retrieval decisions are made through opaque high-dimensional embeddings. Existing explanations often focus on surface signals, such as lexical matches, token alignments, or post-hoc textual rationales, and thus provide limited insight into the latent factors that shape dense retrieval behavior at the embedding level. We propose \textit{Xetrieval}, an embedding-level mechanistic framework for explaining dense retrieval. \textit{Xetrieval} first introduces a lightweight reasoning internalizer that approximates Chain-of-Thought reasoning directly in the embedding space with a single forward pass, enriching sentence embeddings with reasoning-oriented information while avoiding expensive autoregressive generation. It then decomposes these reasoning-enhanced embeddings into sparse, human-interpretable features, each associated with a coherent natural language description. By aggregating sparse feature overlaps across multiple document-side views, \textit{Xetrieval} provides feature-level explanations of individual retrieval decisions. Experiments on diverse retrievers and benchmarks show that \textit{Xetrieval} uncovers coherent interpretable features, yields stronger pair-level intervention effects, and supports task-level feature steering. The project page and source code are available at https://hihiczx.github.io/Xetrieval .

0 Citations
0 Influential
4 Altmetric
20.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!