2604.05002v1 Apr 05, 2026 cs.LG

분포 변화 하에서의 약한 지도 학습을 통한 안정적인 예측 모델 훈련

Learning Stable Predictors from Weak Supervision under Distribution Shift

Elias Hossain

Citations: 2

h-index: 1

Niloofar Yousefi

Citations: 483

h-index: 12

Mehrdad Shoeibi

Citations: 99

h-index: 7

Ivan Garibay

Citations: 51

h-index: 2

정확한 레이블이 없을 때 약한 또는 대리 지도 학습이 흔히 사용되지만, 특히 지도 메커니즘 자체가 변경될 경우, 분포 변화에 대한 강건성은 여전히 잘 이해되지 못하고 있습니다. 본 연구에서는 이러한 현상을 '지도 드리프트(supervision drift)'라고 정의하며, 이는 컨텍스트에 따른 P(y | x, c)의 변화를 의미합니다. CRISPR-Cas13d 실험에서 RNA-seq 반응으로부터 유도된 가이드 효능을 추론하는 과정을 통해 이를 연구합니다. 두 가지 인간 세포주와 여러 시간대의 데이터를 사용하여, 약한 레이블 생성 방식을 고정한 상태에서 명시적인 도메인 및 시간 변화를 포함하는 통제된 비독립적이고 동일한 분포(non-IID) 벤치마크를 구축했습니다. 모델은 높은 수준의 동일 도메인 성능(ridge R^2 = 0.356, Spearman rho = 0.442)과 부분적인 세포주 간 전이(rho ~ 0.40)를 달성했습니다. 그러나 모든 모델에서 시간적 전이는 실패했으며, 이는 음의 R^2 값과 거의 0에 가까운 상관관계(예: XGBoost R^2 = -0.155, rho = 0.056)로 나타났습니다. 추가적인 분석 결과는 이러한 패턴을 뒷받침합니다. 특징-레이블 간의 관계는 세포주 간에는 안정적이지만 시간에 따라 급격하게 변하는 것으로 나타났습니다. 이는 모델의 한계보다는 지도 드리프트로 인해 실패가 발생한다는 것을 시사합니다. 이러한 결과는 특징 안정성이 배포 전에 비전이 가능성을 감지하는 간단한 진단 지표임을 강조합니다.

Original Abstract

Learning from weak or proxy supervision is common when ground-truth labels are unavailable, yet robustness under distribution shift remains poorly understood, especially when the supervision mechanism itself changes. We formalize this as supervision drift, defined as changes in P(y | x, c) across contexts, and study it in CRISPR-Cas13d experiments where guide efficacy is inferred indirectly from RNA-seq responses. Using data from two human cell lines and multiple time points, we build a controlled non-IID benchmark with explicit domain and temporal shifts while keeping the weak-label construction fixed. Models achieve strong in-domain performance (ridge R^2 = 0.356, Spearman rho = 0.442) and partial cross-cell-line transfer (rho ~ 0.40). However, temporal transfer fails across all models, with negative R^2 and near-zero correlation (e.g., XGBoost R^2 = -0.155, rho = 0.056). Additional analyses confirm this pattern. Feature-label relationships remain stable across cell lines but change sharply over time, indicating that failures arise from supervision drift rather than model limitations. These findings highlight feature stability as a simple diagnostic for detecting non-transferability before deployment.

0 Citations

0 Influential

6 Altmetric

30.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!