2601.05785v1 Jan 09, 2026 cs.CV

불완전한 다중 뷰 다중 레이블 분류를 위한 적응적 분리 표현 학습

Adaptive Disentangled Representation Learning for Incomplete Multi-View Multi-Label Classification

Zhiming Liu

Citations: 32

h-index: 3

Tianxiang Xu

Citations: 14

h-index: 2

Quanjiang Li

Citations: 44

h-index: 3

Tingjin Luo

Citations: 188

h-index: 5

Chenping Hou

Citations: 38

h-index: 3

다중 뷰 다중 레이블 학습은 데이터 획득의 어려움과 비용이 많이 드는 지도 학습 때문에 종종 특징의 결측과 불완전한 레이블이라는 동시에 문제를 겪습니다. 이러한 복잡하지만 매우 실용적인 문제를 해결하고, 기존의 특징 복원, 표현 분리, 레이블 의미 모델링의 한계를 극복하기 위해, 우리는 적응적 분리 표현 학습(Adaptive Disentangled Representation Learning, ADRL) 방법을 제안합니다. ADRL은 이웃 정보 인식을 활용하여 모달리티 간의 특징 수준의 유사성을 전파함으로써 강력한 뷰 보완을 달성하며, 확률적 마스킹 전략을 활용하여 재구성 효과를 강화합니다. ADRL은 레이블 분포 전반에 걸쳐 범주 수준의 연관성을 분산시켜, 상호 의존적인 레이블 프로토타입을 캡처하기 위한 분포 파라미터를 개선합니다. 또한, ADRL은 공유된 표현 간의 일관성을 촉진하고 뷰별 표현과 다른 모달리티 간의 정보 중복을 억제하기 위해 상호 정보 기반의 목적 함수를 사용합니다. 이론적으로, 우리는 이중 채널 네트워크를 훈련하기 위한 추론 가능한 경계를 도출했습니다. 더욱이, ADRL은 레이블 임베딩과 뷰 표현 간의 독립적인 상호 작용을 가능하게 하여 프로토타입별 특징 선택을 수행하며, 각 범주에 대한 가짜 레이블을 생성합니다. 그런 다음, 가짜 레이블 공간의 구조적 특징을 활용하여 뷰 융합 과정에서 판별적인 균형을 맞춥니다. 마지막으로, 공개 데이터 세트 및 실제 응용 프로그램에 대한 광범위한 실험을 통해 ADRL의 우수한 성능을 입증합니다.

Original Abstract

Multi-view multi-label learning frequently suffers from simultaneous feature absence and incomplete annotations, due to challenges in data acquisition and cost-intensive supervision. To tackle the complex yet highly practical problem while overcoming the existing limitations of feature recovery, representation disentanglement, and label semantics modeling, we propose an Adaptive Disentangled Representation Learning method (ADRL). ADRL achieves robust view completion by propagating feature-level affinity across modalities with neighborhood awareness, and reinforces reconstruction effectiveness by leveraging a stochastic masking strategy. Through disseminating category-level association across label distributions, ADRL refines distribution parameters for capturing interdependent label prototypes. Besides, we formulate a mutual-information-based objective to promote consistency among shared representations and suppress information overlap between view-specific representation and other modalities. Theoretically, we derive the tractable bounds to train the dual-channel network. Moreover, ADRL performs prototype-specific feature selection by enabling independent interactions between label embeddings and view representations, accompanied by the generation of pseudo-labels for each category. The structural characteristics of the pseudo-label space are then exploited to guide a discriminative trade-off during view fusion. Finally, extensive experiments on public datasets and real-world applications demonstrate the superior performance of ADRL.

0 Citations

0 Influential

2.5 Altmetric

12.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!