2602.16498v1 Feb 18, 2026 cs.LG

빠르고 확장 가능한 분석적 확산 모델

Fast and Scalable Analytical Diffusion

Xinyi Shang

Citations: 22

h-index: 2

Jingyu Lin

Citations: 168

h-index: 6

Peng Sun

Citations: 259

h-index: 6

Zhiqiang Shen

Citations: 10

h-index: 2

분석적 확산 모델은 노이즈 제거 스코어를 경험 베이즈 사후 평균으로 공식화하여 생성 모델링에 대한 수학적으로 투명한 경로를 제공합니다. 그러나 이러한 해석 가능성은 상당한 비용을 초래합니다. 표준 방식은 각 타임스텝에서 전체 데이터 세트를 스캔해야 하므로 데이터 세트 크기에 따라 선형적으로 증가합니다. 본 연구에서는 이러한 확장성 병목 현상을 해결하는 최초의 체계적인 연구를 제시합니다. 우리는 전체 훈련 데이터가 필수라는 기존의 가정에 도전하고, 신호 대 잡음비(SNR)가 증가함에 따라 효과적인 '황금 영역'이 전역 매니폴드에서 로컬 이웃으로 점진적으로 수렴하는 '사후 점진적 집중' 현상을 발견합니다. 이를 활용하여, 훈련이 필요 없는 프레임워크인 Dynamic Time-Aware Golden Subset Diffusion (GoldDiff)을 제안합니다. GoldDiff는 정적 검색 대신, 추론 복잡성을 데이터 세트 크기에서 분리하기 위해 '황금 부분 집합'을 동적으로 식별하는 조잡한-세밀한 메커니즘을 사용합니다. 이론적으로, 우리의 희소 근사치가 정확한 스코어로 수렴함을 보장하는 엄격한 경계를 유도했습니다. 실험적으로, GoldDiff는 AFHQ에서 71배의 속도 향상을 달성했으며, 전체 스캔 기반 모델과 동등하거나 더 나은 성능을 보였습니다. 더욱 중요한 점은, 분석적 확산 모델을 ImageNet-1K에 성공적으로 확장하여 대규모 생성 모델링을 위한 확장 가능하고 훈련이 필요 없는 패러다임을 열었습니다.

Original Abstract

Analytical diffusion models offer a mathematically transparent path to generative modeling by formulating the denoising score as an empirical-Bayes posterior mean. However, this interpretability comes at a prohibitive cost: the standard formulation necessitates a full-dataset scan at every timestep, scaling linearly with dataset size. In this work, we present the first systematic study addressing this scalability bottleneck. We challenge the prevailing assumption that the entire training data is necessary, uncovering the phenomenon of Posterior Progressive Concentration: the effective golden support of the denoising score is not static but shrinks asymptotically from the global manifold to a local neighborhood as the signal-to-noise ratio increases. Capitalizing on this, we propose Dynamic Time-Aware Golden Subset Diffusion (GoldDiff), a training-free framework that decouples inference complexity from dataset size. Instead of static retrieval, GoldDiff uses a coarse-to-fine mechanism to dynamically pinpoint the ''Golden Subset'' for inference. Theoretically, we derive rigorous bounds guaranteeing that our sparse approximation converges to the exact score. Empirically, GoldDiff achieves a $\bf 71 \times$ speedup on AFHQ while matching or achieving even better performance than full-scan baselines. Most notably, we demonstrate the first successful scaling of analytical diffusion to ImageNet-1K, unlocking a scalable, training-free paradigm for large-scale generative modeling.

0 Citations

0 Influential

3 Altmetric

15.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!