2603.20836v1 Mar 21, 2026 cs.CV

MERIT: 다중 도메인 효율적인 원본 이미지 변환

MERIT: Multi-domain Efficient RAW Image Translation

Ryozo Masukawa

Citations: 83

h-index: 5

Sanggeon Yun

Citations: 237

h-index: 9

Hanning Chen

Citations: 356

h-index: 11

Wenjun Huang

Citations: 158

h-index: 7

Shenghao Fu

Citations: 267

h-index: 8

Yi Jin

Citations: 353

h-index: 7

Yang Ni

Citations: 118

h-index: 6

Ziteng Cui

Citations: 1,104

h-index: 16

Yirui He

Citations: 5

h-index: 2

Yezi Liu

Citations: 64

h-index: 5

Sungheon Jeong

Citations: 87

h-index: 6

William Youngwoo Chung

Citations: 20

h-index: 3

Mohsen Imani

Citations: 344

h-index: 11

다양한 카메라 센서에서 촬영된 원본 이미지는 스펙트럼 응답, 노이즈 특성 및 톤 동작의 차이로 인해 상당한 도메인 간 차이를 보이며, 이는 다운스트림 컴퓨터 비전 작업에 직접 활용하는 것을 어렵게 만듭니다. 기존 방법들은 이러한 문제를 해결하기 위해 각 소스-타겟 쌍에 대한 도메인별 원본-원본 변환기를 학습하지만, 이러한 접근 방식은 여러 종류의 상용 카메라가 관련된 실제 시나리오에는 적용하기 어렵습니다. 본 연구에서는 MERIT을 제안합니다. MERIT은 임의의 카메라 도메인 간 변환을 수행할 수 있는 단일 모델을 활용하는, 최초의 통합된 다중 도메인 원본 이미지 변환 프레임워크입니다. 도메인별 노이즈 불일치를 해결하기 위해, 생성된 이미지의 신호 의존적인 노이즈 통계와 타겟 도메인의 통계를 명시적으로 정렬하는 센서 인식 노이즈 모델링 손실 함수를 제안합니다. 또한, 더 나은 컨텍스트 및 센서 인식 특징 모델링을 위해, 생성기에 조건부 멀티 스케일 대용량 커널 어텐션 모듈을 추가했습니다. 표준화된 평가를 용이하게 하기 위해, 다중 도메인 원본 이미지 변환을 위해 특별히 설계된 최초의 데이터셋인 MDRAW을 소개합니다. MDRAW은 다양한 장면에서 촬영된 5가지 서로 다른 카메라 센서의 페어링된 및 비페어링된 원본 이미지를 포함합니다. 광범위한 실험 결과, MERIT이 기존 모델보다 품질(5.56 dB 향상) 및 확장성(학습 반복 횟수 80% 감소) 측면에서 모두 우수한 성능을 보임을 확인했습니다.

Original Abstract

RAW images captured by different camera sensors exhibit substantial domain shifts due to varying spectral responses, noise characteristics, and tone behaviors, complicating their direct use in downstream computer vision tasks. Prior methods address this problem by training domain-specific RAW-to-RAW translators for each source-target pair, but such approaches do not scale to real-world scenarios involving multiple types of commercial cameras. In this work, we introduce MERIT, the first unified framework for multi-domain RAW image translation, which leverages a single model to perform translations across arbitrary camera domains. To address domain-specific noise discrepancies, we propose a sensor-aware noise modeling loss that explicitly aligns the signal-dependent noise statistics of the generated images with those of the target domain. We further enhance the generator with a conditional multi-scale large kernel attention module for improved context and sensor-aware feature modeling. To facilitate standardized evaluation, we introduce MDRAW, the first dataset tailored for multi-domain RAW image translation, comprising both paired and unpaired RAW captures from five diverse camera sensors across a wide range of scenes. Extensive experiments demonstrate that MERIT outperforms prior models in both quality (5.56 dB improvement) and scalability (80% reduction in training iterations).

0 Citations

0 Influential

8 Altmetric

40.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!