2603.12261v1 Mar 12, 2026 cs.LG

잠재 색상 부분 공간: 고차원 혼돈에서의 자기 조직화

The Latent Color Subspace: Emergent Order in High-Dimensional Chaos

Quentin Bouniot

Citations: 135

h-index: 6

Zeynep Akata

Citations: 1,732

h-index: 22

Mateusz Pach

Citations: 50

h-index: 2

Jessica Bader

Citations: 46

h-index: 3

Serge Belongie

Citations: 70

h-index: 3

텍스트-이미지 생성 모델은 빠르게 발전해 왔지만, 생성된 이미지에 대한 미세한 제어를 달성하는 것은 여전히 어렵습니다. 이는 주로 의미 정보가 어떻게 인코딩되는지에 대한 이해 부족 때문입니다. 본 연구에서는 FLUX.1 [Dev]의 변분 오토인코더 잠재 공간에서 색상 표현 방식을 해석하여, Hue(색상), Saturation(채도), Lightness(명도)를 반영하는 구조를 밝혀냈습니다. 개발된 잠재 색상 부분 공간(LCS) 해석이 색상을 예측하고 명시적으로 제어할 수 있음을 입증하고, FLUX에서 완전한 학습 없이 잠재 공간 조작만을 사용하여 색상을 제어하는 방법을 제시합니다. 관련 코드는 https://github.com/ExplainableML/LCS 에서 확인할 수 있습니다.

Original Abstract

Text-to-image generation models have advanced rapidly, yet achieving fine-grained control over generated images remains difficult, largely due to limited understanding of how semantic information is encoded. We develop an interpretation of the color representation in the Variational Autoencoder latent space of FLUX.1 [Dev], revealing a structure reflecting Hue, Saturation, and Lightness. We verify our Latent Color Subspace (LCS) interpretation by demonstrating that it can both predict and explicitly control color, introducing a fully training-free method in FLUX based solely on closed-form latent-space manipulation. Code is available at https://github.com/ExplainableML/LCS.

0 Citations

0 Influential

31 Altmetric

155.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!