2603.00938v1 Mar 01, 2026 cs.CV

8비트 너머로: HDR-UGC 비디오의 주관적 및 객관적 품질 평가

Seeing Beyond 8bits: Subjective and Objective Quality Assessment of HDR-UGC Videos

Shreshth Saini

Citations: 27

h-index: 2

Bowen Chen

Citations: 78

h-index: 2

N. Birkbeck

Citations: 2,335

h-index: 24

Balu Adsumilli

Citations: 41

h-index: 4

A. Bovik

Citations: 2,080

h-index: 3

Yilin Wang

Citations: 3,176

h-index: 16

고 다이내믹 레인지(HDR) 사용자 생성(UGC) 비디오는 소셜 플랫폼에서 빠르게 확산되고 있지만, 대부분의 시각적 품질 평가(VQA) 시스템은 여전히 표준 다이내믹 레인지(SDR)에 맞춰져 있습니다. HDR은 더 높은 비트 심도, 넓은 색 영역 및 향상된 휘도 범위를 가지므로, 흑색 왜곡, 하이라이트 클리핑, 밴딩 및 노출 깜박임과 같은 왜곡이 UGC 콘텐츠의 결함을 증폭시키고 SDR 모델에 대한 과제를 제시합니다. 연구의 진전을 촉진하기 위해, 우리는 다양한 장면, 촬영 조건 및 압축 설정을 포괄하는 6,500개 이상의 소스에서 수집된 44,000개의 비디오로 구성된 대규모 주관적 데이터셋인 Beyond8Bits를 구축했습니다. 또한 HDR-UGC VQA를 위한 최초의 다중 모드 대규모 언어 모델(MLLM)인 HDR-Q를 소개합니다. 우리는 (i) HDR에 민감한 임베딩을 생성하는 새로운 HDR 인지 시각 인코더와 (ii) 추론을 HDR 힌트에 고정시키는 강화 학습 미세 조정 프레임워크인 HDR-Aware Policy Optimization (HAPO)을 제안합니다. HAPO는 HDR-SDR 대비 KL을 통해 토큰이 HDR 입력에 의존하도록 장려하고, 세밀한 MOS(Mean Opinion Score) 보정의 위해 가우시안 가중 회귀 보상을 GRPO에 추가합니다. Beyond8Bits 데이터셋과 공개 HDR-VQA 벤치마크에서 HDR-Q는 최첨단 성능을 달성했습니다.

Original Abstract

High Dynamic Range (HDR) user-generated (UGC) videos are rapidly proliferating across social platforms, yet most perceptual video quality assessment (VQA) systems remain tailored to Standard Dynamic Range (SDR). HDR has a higher bit depth, wide color gamut, and elevated luminance range, exposing distortions such as near-black crushing, highlight clipping, banding, and exposure flicker that amplify UGC artifacts and challenge SDR models. To catalyze progress, we curate Beyond8Bits, a large-scale subjective dataset of 44K videos from 6.5K sources with over 1.5M crowd ratings, spanning diverse scenes, capture conditions, and compression settings. We further introduce HDR-Q, the first Multimodal Large Language Model (MLLM) for HDR-UGC VQA. We propose (i) a novel HDR-aware vision encoder to produce HDR-sensitive embeddings, and (ii) HDR-Aware Policy Optimization (HAPO), an RL finetuning framework that anchors reasoning to HDR cues. HAPO augments GRPO via an HDR-SDR contrastive KL that encourages token reliance on HDR inputs and a Gaussian weighted regression reward for fine-grained MOS calibration. Across Beyond8Bits and public HDR-VQA benchmarks, HDR-Q delivers state-of-the-art performance.

0 Citations

0 Influential

12 Altmetric

60.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!