2601.21340v1 Jan 29, 2026 cs.AI

EHR-RAG: 향상된 검색 증강 생성을 통한 장기 구조화 전자의무기록과 대규모 언어 모델의 연결

EHR-RAG: Bridging Long-Horizon Structured Electronic Health Records and Large Language Models via Enhanced Retrieval-Augmented Generation

Lang Cao

Citations: 93

h-index: 5

Yue Guo

Citations: 76

h-index: 3

Qingyu Chen

Citations: 630

h-index: 11

전자의무기록(EHR)은 의료 의사결정의 핵심이 되는 풍부한 종단적 임상 근거를 제공하며, 이는 대규모 언어 모델(LLM) 예측의 근거를 마련하기 위한 검색 증강 생성(RAG)의 활용을 촉진한다. 그러나 장기간의 EHR은 종종 LLM의 컨텍스트 제한을 초과하며, 기존 접근 방식들은 임상적으로 중요한 사건과 시간적 의존성을 누락시키는 단순 잘라내기나 기본적인 검색 전략에 의존하는 경우가 많다. 이러한 문제를 해결하기 위해, 우리는 장기 구조화 EHR 데이터의 정확한 해석을 위해 설계된 검색 증강 프레임워크인 EHR-RAG를 제안한다. EHR-RAG는 종단적 임상 예측 작업에 특화된 세 가지 구성 요소를 도입한다. 첫째, 임상 구조와 시간적 역학을 보존하기 위한 '사건 및 시간 인식 하이브리드 EHR 검색', 둘째, 광범위한 근거 범위를 확보하기 위해 쿼리를 점진적으로 개선하는 '적응형 반복 검색', 셋째, 사실적 근거와 반사실적 근거를 동시에 검색하고 추론하는 '이중 경로 근거 검색 및 추론'이다. 4가지 장기 EHR 예측 작업에 대한 실험 결과, EHR-RAG는 가장 강력한 LLM 기반 베이스라인 모델들을 일관되게 능가하였으며, 평균 Macro-F1 점수를 10.76% 향상시켰다. 전반적으로 본 연구는 실제 구조화된 EHR 데이터에 대한 임상 예측을 발전시키는 데 있어 검색 증강 LLM의 잠재력을 강조한다.

Original Abstract

Electronic Health Records (EHRs) provide rich longitudinal clinical evidence that is central to medical decision-making, motivating the use of retrieval-augmented generation (RAG) to ground large language model (LLM) predictions. However, long-horizon EHRs often exceed LLM context limits, and existing approaches commonly rely on truncation or vanilla retrieval strategies that discard clinically relevant events and temporal dependencies. To address these challenges, we propose EHR-RAG, a retrieval-augmented framework designed for accurate interpretation of long-horizon structured EHR data. EHR-RAG introduces three components tailored to longitudinal clinical prediction tasks: Event- and Time-Aware Hybrid EHR Retrieval to preserve clinical structure and temporal dynamics, Adaptive Iterative Retrieval to progressively refine queries in order to expand broad evidence coverage, and Dual-Path Evidence Retrieval and Reasoning to jointly retrieves and reasons over both factual and counterfactual evidence. Experiments across four long-horizon EHR prediction tasks show that EHR-RAG consistently outperforms the strongest LLM-based baselines, achieving an average Macro-F1 improvement of 10.76%. Overall, our work highlights the potential of retrieval-augmented LLMs to advance clinical prediction on structured EHR data in practice.

3 Citations

0 Influential

5.5 Altmetric

30.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!