2605.05958v1 May 07, 2026 cs.AI

편향 제거를 위한 이중 로버스트 학습: 시간적 안정성을 고려한 지식 추적

Temporal Smoothness Doubly Robust Learning for Debiased Knowledge Tracing

Wei Chen

Citations: 399

h-index: 8

Ruichu Cai

Citations: 2,099

h-index: 22

Peilin Zhan

Citations: 10

h-index: 1

Weilin Chen

Citations: 104

h-index: 5

Shuyi Pan

Citations: 104

h-index: 4

지식 추적(KT)은 지능형 교육 시스템의 핵심 기술이지만, 선택적으로 기록된 교육 로그에 의존합니다. 과제 추천 및 학생 선택의 무작위성이 결여되어 심각한 선택 편향이 발생합니다. 기존 KT 방법들은 이러한 문제를 간과하고, 표준적인 경험적 위험을 사용하여 관찰된 로그로 학습하여 편향된 숙달도 추정치를 얻고, 후속 추천에서 오류가 누적되는 문제를 야기합니다. 이를 해결하기 위해, 우리는 KT를 위한 이중 로버스트(DR) 프레임워크를 제안합니다. 이 프레임워크는 경향 모델과 오차 보정 모델을 통합하여, 어느 모델이 정확하다면 이론적으로 편향이 없음을 보장합니다. KT의 순차적 설정에서, 추정기의 성능은 분산에 의존적인 확률적 편차에 의해 저해되며, 이는 시간 경과에 따라 누적되어 학습 불안정을 유발하고 성능을 제한합니다. 이러한 문제를 완화하기 위해, 우리는 추정기 분산의 영향을 명시적으로 나타내는 일반화 경계를 도출하고, 시간적 안정성이 이를 제어하는 핵심 요소임을 밝혀냈습니다. 이러한 이론적 통찰력을 바탕으로, 시간적 안정성(Temporal Smoothness)을 고려한 이중 로버스트(TSDR) 프레임워크를 제안합니다. TSDR은 KT 예측 모델과 보정 모델을 동시에 최적화하며, 안정성 제어를 위한 정규화를 적용하여 분산을 줄이는 동시에 DR의 편향 없음 보장을 유지합니다. 여러 실제 데이터셋에 대한 실험 결과는 TSDR이 최첨단 KT 모델들의 성능을 지속적으로 향상시키며, KT에서 원칙적인 편향 보정의 중요성을 강조합니다.

Original Abstract

Knowledge Tracing (KT) is fundamental to intelligent education systems, yet relies on educational logs that are selectively observed. The non-random nature of exercise recommendations and student choices inevitably induces severe selection bias. Most existing KT methods neglect this issue, training on observed logs using standard empirical risk, which yields biased mastery estimates and accumulates errors in subsequent recommendations. To address this, we introduce a doubly robust (DR) formulation for KT that integrates a propensity model with an error imputation model, theoretically guaranteeing unbiasedness if either model is accurate. Beyond unbiasedness, in the sequential setting of KT, we identify that the estimator's performance is compromised by variance-dependent stochastic deviations that accumulate over time, thereby causing training instability and limiting performance. To mitigate this, we derive a generalization bound that explicitly characterizes the impact of estimator variance and identifies temporal smoothness as a key factor in controlling it. Building on these theoretical insights, we propose the Temporal Smoothness Doubly Robust (TSDR) framework. TSDR jointly optimizes the KT predictor and the imputation model with a smoothness regularizer, effectively reducing variance while preserving the unbiasedness guarantee of DR. Experiments on multiple real-world benchmarks demonstrate that TSDR consistently enhances various state-of-the-art KT backbones, underscoring the vital role of principled bias correction in KT.

0 Citations

0 Influential

11 Altmetric

55.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!