2603.12581v1 Mar 13, 2026 eess.IV

다중 척도 구조 기반 잠재 확산 모델을 이용한 다중 모드 MRI 변환

Multiscale Structure-Guided Latent Diffusion for Multimodal MRI Translation

Beijing

Citations: 219

h-index: 7

Peng Cao

Citations: 48

h-index: 4

Osmar R. Zaiane

Citations: 498

h-index: 12

Jinzhu Yang

Citations: 209

h-index: 7

Jianqiang Lin

Citations: 7

h-index: 1

Zhiqiang Shen

Citations: 8

h-index: 2

Xiaoli Liu Northeastern University

Citations: 0

h-index: 0

Shenyang

Citations: 3

h-index: 1

China

Citations: 88

h-index: 3

Key Laboratory of Intelligent Computing in Medical Image

Citations: 0

h-index: 0

National Frontiers Science Center for Industrial Intelligence

Citations: 0

h-index: 0

Systems Optimization

Citations: 0

h-index: 0

U. Alberta

Citations: 1,534

h-index: 20

Edmonton

Citations: 576

h-index: 9

Canada

Citations: 62

h-index: 6

AI Research

Citations: 31

h-index: 2

확산 모델은 다중 모드 자기 공명 영상(MRI) 변환 작업에서 놀라운 발전을 이루었지만, 기존 방법은 여전히 임의의 누락 모드 시나리오를 처리할 때 해부학적 불일치 또는 저하된 텍스처 디테일을 나타내는 경향이 있습니다. 이러한 문제를 해결하기 위해, 우리는 잠재 확산 모델 기반의 다중 모드 MRI 변환 프레임워크인 MSG-LDM을 제안합니다. 제안된 방법은 사용 가능한 모드를 활용하여 완전한 구조 정보를 추론하며, 이를 통해 신뢰할 수 있는 경계 디테일을 유지합니다. 구체적으로, 우리는 잠재 공간에서 스타일-구조 분리 메커니즘을 도입하여, 모드별 스타일 특징과 공유된 구조 표현을 명시적으로 분리하고, 다중 척도 특징 공간에서 저주파 해부학적 레이아웃과 고주파 경계 디테일을 공동으로 모델링합니다. 구조 분리 단계에서, 고주파 구조 정보가 명시적으로 통합되어 특징 표현을 향상시키고, 모델이 모드 불변의 저주파 해부학적 표현을 학습하는 동시에 미세한 구조적 단서를 중점적으로 다루도록 유도합니다. 또한, 모드별 스타일로부터의 간섭을 줄이고 구조 표현의 안정성을 향상시키기 위해, 스타일 일관성 손실 및 구조 인식 손실을 설계했습니다. BraTS2020 및 WMH 데이터 세트에 대한 광범위한 실험 결과, 제안된 방법이 기존의 MRI 합성 접근 방식보다 우수한 성능을 보이며, 특히 완전한 구조 복원 측면에서 뛰어난 성능을 나타냅니다. 소스 코드는 https://github.com/ziyi-start/MSG-LDM 에서 공개적으로 이용할 수 있습니다.

Original Abstract

Although diffusion models have achieved remarkable progress in multi-modal magnetic resonance imaging (MRI) translation tasks, existing methods still tend to suffer from anatomical inconsistencies or degraded texture details when handling arbitrary missing-modality scenarios. To address these issues, we propose a latent diffusion-based multi-modal MRI translation framework, termed MSG-LDM. By leveraging the available modalities, the proposed method infers complete structural information, which preserves reliable boundary details. Specifically, we introduce a style--structure disentanglement mechanism in the latent space, which explicitly separates modality-specific style features from shared structural representations, and jointly models low-frequency anatomical layouts and high-frequency boundary details in a multi-scale feature space. During the structure disentanglement stage, high-frequency structural information is explicitly incorporated to enhance feature representations, guiding the model to focus on fine-grained structural cues while learning modality-invariant low-frequency anatomical representations. Furthermore, to reduce interference from modality-specific styles and improve the stability of structure representations, we design a style consistency loss and a structure-aware loss. Extensive experiments on the BraTS2020 and WMH datasets demonstrate that the proposed method outperforms existing MRI synthesis approaches, particularly in reconstructing complete structures. The source code is publicly available at https://github.com/ziyi-start/MSG-LDM.

0 Citations

0 Influential

30 Altmetric

150.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!