2606.13680v1 Jun 11, 2026 cs.CL

Learning to Reason by Analogy via Retrieval-Augmented Reinforcement Fine-Tuning

Qi Ma
Qi Ma
Citations: 49
h-index: 2
Vicente Ordonez
Vicente Ordonez
Citations: 57
h-index: 4
Zilin Xiao
Zilin Xiao
Rice University
Citations: 63
h-index: 4
Avinash Atreya
Avinash Atreya
Citations: 40
h-index: 1
Xintao Chen
Xintao Chen
Citations: 22
h-index: 1
Hanjie Chen
Hanjie Chen
Citations: 16
h-index: 2
Chunliang Chen
Chunliang Chen
Citations: 16
h-index: 2

Retrieval-augmented generation (RAG) has become a standard mechanism for grounding language models in external knowledge, yet conventional retrieval based on lexical or semantic similarity is poorly suited for complex reasoning tasks: a semantically similar problem may demand an entirely different solution strategy, while a superficially different problem may share the same underlying reasoning pattern. We propose Retrieval-Augmented Reinforcement Fine-Tuning (RA-RFT), a post-training framework that teaches language models to reason by analogy. RA-RFT uses gold-relevance distillation to train a retriever that ranks contexts by expected reasoning benefit rather than semantic overlap, and then fine-tunes the policy model via reinforcement fine-tuning methods with retrieved analogous demonstrations, so the model learns to leverage reasoning traces under verifiable outcome rewards. We further analyze the diversity of retrieved contexts and find that reasoning-aware retrieval surfaces complementary solution strategies that provide distinct reasoning scaffolds for individual problems. Across challenging mathematical reasoning benchmarks, RA-RFT consistently outperforms standard reinforcement fine-tuning methods. For example, it improves AIME 2025 average@32 accuracy by 7.1 and 2.8 points over GRPO for Qwen3-1.7B and Qwen3-4B respectively -- suggesting that reasoning-aware retrieval is a complementary axis of improvement and orthogonal to advances in reward design or training curricula.

0 Citations
0 Influential
2 Altmetric
10.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!