2602.09566v1 Feb 10, 2026 cs.LG

ECG-IMN: 12-극 심전도 해석을 위한 해석 가능한 중간층 신경망

ECG-IMN: Interpretable Mesomorphic Neural Networks for 12-Lead Electrocardiogram Interpretation

Hugo L Hammer

Citations: 19

h-index: 3

Vajira Lasantha Thambawita

Citations: 2,674

h-index: 18

Paal Halvorsen

Citations: 237

h-index: 7

J.L. Isaksen

Citations: 3

h-index: 1

J. Kim Kanters

Citations: 94

h-index: 5

심층 학습은 자동 심전도(ECG) 진단에서 전문가 수준의 성능을 달성했지만, 이러한 모델의 '블랙박스' 특성은 임상 환경에서의 활용을 어렵게 만듭니다. 의료 AI에 대한 신뢰는 높은 정확도뿐만 아니라 예측을 이끄는 특정 생리적 특징에 대한 투명성 또한 필요합니다. 기존의 ECG 설명 방법은 일반적으로 사후 근사(예: Grad-CAM 및 SHAP)에 의존하는데, 이는 불안정하고, 계산 비용이 많이 들며, 모델의 실제 의사 결정 프로세스와 일치하지 않을 수 있습니다. 본 연구에서는 고해상도 12-극 ECG 분류에 특화된 해석 가능한 중간층 신경망인 ECG-IMN을 제안합니다. 표준 분류기와 달리, ECG-IMN은 하이퍼네트워크로 작동합니다. 즉, 심층 컨볼루션 백본은 각 입력 샘플에 특정한 엄격하게 선형적인 모델의 파라미터를 생성합니다. 이러한 아키텍처는 고유한 해석 가능성을 보장합니다. 의사 결정 논리가 수학적으로 투명하며, 생성된 가중치(W)는 정확하고 고해상도의 특징 기여도 맵 역할을 합니다. 우리는 잠재적인 특징을 샘플별 가중치로 효과적으로 매핑하는 전환 디코더를 도입하여 시간 및 리드 차원에서 병리적 증거(예: ST-상승, T파 역전)를 정확하게 위치시키는 기능을 제공합니다. 우리는 PTB-XL 데이터셋을 사용하여 분류 작업을 평가했으며, ECG-IMN이 경쟁력 있는 예측 성능(블랙박스 모델과 비교 가능한 AUROC)을 달성하는 동시에 신뢰할 수 있고, 인스턴스별 설명을 제공한다는 것을 보여주었습니다. 저희의 프레임워크는 파라미터 생성과 예측 실행을 명시적으로 분리함으로써, 심층 학습의 능력과 임상적 신뢰성 간의 격차를 해소하고, '화이트박스' 심장 진단으로 나아가는 원칙적인 방법을 제시합니다.

Original Abstract

Deep learning has achieved expert-level performance in automated electrocardiogram (ECG) diagnosis, yet the "black-box" nature of these models hinders their clinical deployment. Trust in medical AI requires not just high accuracy but also transparency regarding the specific physiological features driving predictions. Existing explainability methods for ECGs typically rely on post-hoc approximations (e.g., Grad-CAM and SHAP), which can be unstable, computationally expensive, and unfaithful to the model's actual decision-making process. In this work, we propose the ECG-IMN, an Interpretable Mesomorphic Neural Network tailored for high-resolution 12-lead ECG classification. Unlike standard classifiers, the ECG-IMN functions as a hypernetwork: a deep convolutional backbone generates the parameters of a strictly linear model specific to each input sample. This architecture enforces intrinsic interpretability, as the decision logic is mathematically transparent and the generated weights (W) serve as exact, high-resolution feature attribution maps. We introduce a transition decoder that effectively maps latent features to sample-wise weights, enabling precise localization of pathological evidence (e.g., ST-elevation, T-wave inversion) in both time and lead dimensions. We evaluate our approach on the PTB-XL dataset for classification tasks, demonstrating that the ECG-IMN achieves competitive predictive performance (AUROC comparable to black-box baselines) while providing faithful, instance-specific explanations. By explicitly decoupling parameter generation from prediction execution, our framework bridges the gap between deep learning capability and clinical trustworthiness, offering a principled path toward "white-box" cardiac diagnostics.

0 Citations

0 Influential

9 Altmetric

45.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!