2605.02734v1 May 04, 2026 cs.AI

의료 영상 처리를 위한 계층적 다중 레이블 학습 기반 지연 예측 모델

Coherent Hierarchical Multi-Label Learning to Defer for Medical Imaging

Pramit Saha

Citations: 133

h-index: 7

Joshua Strong

Citations: 136

h-index: 2

Emma Sun

Citations: 2

h-index: 1

H. Higham

Citations: 99

h-index: 5

Alison Noble

Citations: 14

h-index: 2

학습 기반 지연 예측(Learning to Defer, L2D)은 모델이 자율적으로 예측하거나 전문가에게 위임할 수 있도록 하지만, 기존 연구에서는 주로 단순한 레이블 구조를 가정했습니다. 본 연구에서는 의료 영상 워크플로우에서 발견 사항이 임상 분류 체계에 따라 구성되는 맥락에서, 계층적 다중 레이블 결정을 포함하는 첫 번째 L2D 모델을 제시합니다. 이 환경에서, 지연은 레이블 할당이 아닌 위임 행위이므로, 각 레이블에 대한 독립적인 결정으로 처리하면 분류 체계의 모순, 위임 위반, 모델 자체의 주장에 의해 이미 암시된 레이블에 대한 지연 등과 같은 비일관성이 발생할 수 있습니다. 본 연구에서는 선택적 제외(Selective-Exclusion) 방식의 핸드오프 계약 하에서 일관된 계층적 지연을 정의하고, 베이즈 최적의 일관된 지연 규칙을 분석하며, 노드별 베이즈 L2D 모델에서도 행위 불일치가 발생할 수 있음을 보여줍니다. 또한, 이러한 문제를 해결하기 위해 정확한 일관성 투영(exact coherent projection), 일관된 행동 집합에 대한 동적 프로그래밍 디코더, 그리고 계약을 고려한 공동 행동 모델인 분류 체계 기반 신뢰 전파(Taxonomic Belief Propagation, TBP)와 재귀적 정책 최적화(Recursive Policy Optimization, RPO)라는 두 가지 방법을 제안합니다. 실제 의료 영상 데이터와 제어된 전문가 환경에서의 실험 결과, 단순한 이진 관련성 L2D 모델은 상당한 비일관성을 보입니다. 투영 방식은 이러한 비일관성을 완전히 제거하며, 빠르고 효율적인 TBP+RPO 방식은 비일관성을 거의 0으로 줄이면서도 높은 성능을 유지합니다.

Original Abstract

Learning to Defer (L2D) enables a model to predict autonomously or defer to an expert, but prior work largely assumes flat label spaces. We study the first L2D setting with hierarchical multi-label decisions, motivated by medical-imaging workflows in which findings are organised by clinical taxonomies. In this setting, deferral is a delegation action rather than a label assignment, so treating it as an independent per-label decision can produce deferral incoherence, including taxonomic contradictions, delegation violations, and deferrals of labels already implied by the model's own assertions. We formalise coherent hierarchical deferral under a Selective-Exclusion handoff contract, characterise the Bayes-optimal coherent deferral rule, and show that even nodewise Bayes L2D can be action-incoherent. We then propose two remedies: exact coherent projection, a dynamic-programming decoder over the coherent action set, and Taxonomic Belief Propagation (TBP) with Recursive Policy Optimisation (RPO), a contract-aware joint action model trained through the same recursion used at inference. Across real-reader and controlled-expert medical-imaging benchmarks, naive binary-relevance L2D exhibits non-trivial incoherence. Projection removes it exactly, and fast TBP+RPO drives incoherence near zero while retaining strong utility.

0 Citations

0 Influential

3.5 Altmetric

17.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!