2604.11137v1 Apr 13, 2026 cs.AI

답변에서 논증으로: 톨민 모델 기반 교육과정 및 목표 조건부 학습을 통한 신뢰할 수 있는 임상 진단 추론

From Answers to Arguments: Toward Trustworthy Clinical Diagnostic Reasoning with Toulmin-Guided Curriculum Goal-Conditioned Learning

X. Tan

Citations: 456

h-index: 13

Gengchen Ma

Citations: 19

h-index: 2

Xihe Qiu

Citations: 125

h-index: 7

Chen Zhan

Citations: 4

h-index: 2

Yuyuan Xiong

Citations: 0

h-index: 0

Xiaoyan Jiang

Citations: 5

h-index: 2

대규모 언어 모델(LLM)을 임상 의사 결정 지원에 통합하는 데 있어 가장 큰 장애는 모델의 불투명하고 종종 신뢰할 수 없는 추론 방식입니다. 의료 분야와 같이 중요한 영역에서는 정확한 답변만으로는 충분하지 않으며, 환자 안전을 보장하고 의료 전문가의 책임을 가능하게 하기 위해서는 완전한 투명성이 필수적입니다. 현재 LLM의 심각한 문제점은 "잘못된 추론을 통해 정확한 답변을 생성하는" 경향입니다. 이러한 과정상의 오류는 단순한 학문적 결함이 아니라, 모델이 강력한 이해를 갖추지 못했음을 나타내며, 실제 임상 복잡성에 직면했을 때 광범위한 환각 현상과 예측 불가능한 오류를 유발할 수 있습니다. 본 연구에서는 톨민 모델을 진단 과정에 적용하여 신뢰할 수 있는 임상 논증을 위한 프레임워크를 제시합니다. 우리는 새로운 훈련 파이프라인인 '교육과정 목표 조건부 학습(CGCL)'을 제안하며, 이는 LLM이 톨민 구조를 명시적으로 따르는 진단 논증을 생성하도록 점진적으로 훈련하는 것을 목표로 합니다. CGCL의 점진적인 세 단계 교육 과정은 다음과 같이 견고한 임상 논증을 구축합니다. (1) 사실 추출 및 감별 진단 생성, (2) 핵심 가설을 정당화하고 대안을 반박, (3) 분석을 최종적이고 제한적인 결론으로 종합합니다. 우리는 정량적 평가 프레임워크인 T-Eval을 사용하여 CGCL을 검증했습니다. 실험 결과, 제안하는 방법은 리소스 집약적인 강화 학습(RL) 방법과 유사한 수준의 진단 정확도와 추론 품질을 달성하는 동시에 더욱 안정적이고 효율적인 훈련 파이프라인을 제공하는 것으로 나타났습니다.

Original Abstract

The integration of Large Language Models (LLMs) into clinical decision support is critically obstructed by their opaque and often unreliable reasoning. In the high-stakes domain of healthcare, correct answers alone are insufficient; clinical practice demands full transparency to ensure patient safety and enable professional accountability. A pervasive and dangerous weakness of current LLMs is their tendency to produce "correct answers through flawed reasoning." This issue is far more than a minor academic flaw; such process errors signal a fundamental lack of robust understanding, making the model prone to broader hallucinations and unpredictable failures when faced with real-world clinical complexity. In this paper, we establish a framework for trustworthy clinical argumentation by adapting the Toulmin model to the diagnostic process. We propose a novel training pipeline: Curriculum Goal-Conditioned Learning (CGCL), designed to progressively train LLM to generate diagnostic arguments that explicitly follow this Toulmin structure. CGCL's progressive three-stage curriculum systematically builds a solid clinical argument: (1) extracting facts and generating differential diagnoses; (2) justifying a core hypothesis while rebutting alternatives; and (3) synthesizing the analysis into a final, qualified conclusion. We validate CGCL using T-Eval, a quantitative framework measuring the integrity of the diagnosis reasoning. Experiments show that our method achieves diagnostic accuracy and reasoning quality comparable to resource-intensive Reinforcement Learning (RL) methods, while offering a more stable and efficient training pipeline.

0 Citations

0 Influential

6.5 Altmetric

32.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!