G
G. Shih
Total Citations
308
h-index
10
Papers
1
Publications
#1
2604.19060v1
Apr 21, 2026
Reinforcement Learning Improves LLM Accuracy and Reasoning in Disease Classification from Radiology Reports
Accurate disease classification from radiology reports is essential for many applications. While supervised fine-tuning (SFT) of lightweight LLMs improves accuracy, it can degrade reasoning. We propose a two-stage approach: SFT on disease labels followed by Group Relative Policy Optimization (GRPO) to refine predictions by optimizing accuracy and format without reasoning supervision. Across three radiologist-annotated datasets, SFT outperformed baselines and GRPO further improved classification and enhanced reasoning recall and comprehensiveness.
Yi Lin
Yishu Wei
Adam E. Flanders
G. Shih
Yifan Peng
0 Citations