2604.15231v1 Apr 16, 2026 cs.AI

RadAgent: 흉부 컴퓨터 단층 촬영 영상의 단계별 해석을 위한 도구 사용 AI 에이전트

RadAgent: A tool-using AI agent for stepwise interpretation of chest computed tomography

Jean-Benoit Delbrouck

Citations: 2,821

h-index: 22

Julia E. Vogt

Citations: 164

h-index: 5

Christian Bluethgen

Citations: 1,256

h-index: 13

Jiwoong Sohn

Citations: 337

h-index: 6

Kenneth Styppa

Citations: 2

h-index: 1

Mélanie Roschewitz

Citations: 220

h-index: 8

Benjamin Gundersen

Citations: 4

h-index: 1

Nicolas Deperrois

Citations: 108

h-index: 4

B. Menze

Citations: 5

h-index: 1

F. Nooralahzadeh

Citations: 494

h-index: 9

M. Krauthammer

Citations: 103

h-index: 5

Michael Moor

Citations: 5

h-index: 1

Yitian Tao

Citations: 47

h-index: 3

비전-언어 모델(VLM)은 컴퓨터 단층 촬영(CT)과 같은 복잡한 의료 영상의 AI 기반 해석 및 보고 기능을 크게 발전시켰습니다. 그러나 기존 방법은 대부분 임상의를 최종 결과물의 수동적인 관찰자로 한정하며, 검토, 검증 또는 개선을 위한 해석 가능한 추론 과정을 제공하지 않습니다. 이러한 문제를 해결하기 위해, 저희는 CT 보고서를 단계별 및 해석 가능한 방식으로 생성하는 도구 사용 AI 에이전트인 RadAgent를 소개합니다. 생성된 각 보고서에는 중간 결정 및 도구 상호 작용에 대한 완전한 검토 가능한 추론 과정이 함께 제공되어 임상의가 보고된 결과가 어떻게 도출되었는지 확인할 수 있습니다. 실험 결과, RadAgent는 세 가지 측면에서 기존의 3D VLM 모델인 CT-Chat보다 흉부 CT 보고서 생성 성능을 향상시키는 것으로 나타났습니다. 거시 F1 점수가 6.0점(36.4% 상대적 증가), 미시 F1 점수가 5.4점(19.6% 상대적 증가) 향상되었으며, 적대적 조건 하에서의 안정성 또한 24.7점(41.9% 상대적 증가) 향상되었습니다. 또한, RadAgent는 37.0%의 충실도를 달성했으며, 이는 기존의 3D VLM 모델에서는 전혀 나타나지 않는 새로운 기능입니다. RadAgent는 흉부 CT 해석을 명시적이고, 도구 기반이며, 반복적인 추론 과정으로 구조화함으로써, 방사선학 분야에서 투명하고 신뢰할 수 있는 AI 시스템 개발에 한 걸음 더 다가갈 수 있도록 합니다.

Original Abstract

Vision-language models (VLM) have markedly advanced AI-driven interpretation and reporting of complex medical imaging, such as computed tomography (CT). Yet, existing methods largely relegate clinicians to passive observers of final outputs, offering no interpretable reasoning trace for them to inspect, validate, or refine. To address this, we introduce RadAgent, a tool-using AI agent that generates CT reports through a stepwise and interpretable process. Each resulting report is accompanied by a fully inspectable trace of intermediate decisions and tool interactions, allowing clinicians to examine how the reported findings are derived. In our experiments, we observe that RadAgent improves Chest CT report generation over its 3D VLM counterpart, CT-Chat, across three dimensions. Clinical accuracy improves by 6.0 points (36.4% relative) in macro-F1 and 5.4 points (19.6% relative) in micro-F1. Robustness under adversarial conditions improves by 24.7 points (41.9% relative). Furthermore, RadAgent achieves 37.0% in faithfulness, a new capability entirely absent in its 3D VLM counterpart. By structuring the interpretation of chest CT as an explicit, tool-augmented and iterative reasoning trace, RadAgent brings us closer toward transparent and reliable AI for radiology.

0 Citations

0 Influential

11 Altmetric

55.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!