2604.05113v1 Apr 06, 2026 cs.IR

CRAB: 생성적 추천 시스템의 편향 완화를 위한 코드북 재조정

CRAB: Codebook Rebalancing for Bias Mitigation in Generative Recommendation

Lalitesh Morishetti

Citations: 47

h-index: 4

Luyi Ma

Citations: 134

h-index: 7

Kaushiki Nag

Citations: 222

h-index: 8

Sushant Kumar

Citations: 260

h-index: 9

Kannan Achan

Citations: 367

h-index: 10

Ziheng Chen

Citations: 29

h-index: 3

Zezhong Fan

Citations: 116

h-index: 3

Jin Huang

Citations: 16

h-index: 3

생성적 추천(GeneRec)은 아이템을 이산적인 의미 토큰으로 표현하고 생성적인 방식으로 아이템을 예측하는 새로운 패러다임을 제시합니다. 다양한 추천 작업에서 강력한 성능을 보이지만, 기존 GeneRec 방식은 심각한 인기 편향을 겪고 있으며, 심지어 이를 더욱 악화시킬 수도 있습니다. 본 연구에서는 이 현상의 근본 원인을 밝히기 위해 종합적인 실증 분석을 수행했으며, 다음과 같은 두 가지 핵심적인 통찰력을 얻었습니다. 1) 불균형적인 토큰화는 과거 아이템 상호 작용에서 비롯된 인기 편향을 유발하고 증폭시킬 수 있습니다. 2) 현재의 학습 절차는 인기 있는 토큰에 불균형적으로 집중하는 반면, 토큰 간의 의미적 관계를 간과하여 인기 편향을 심화시킵니다. 이러한 통찰력을 바탕으로, 우리는 GeneRec의 사후 편향 완화 전략인 CRAB을 제안합니다. CRAB은 의미 토큰 간의 빈도 불균형을 완화하여 인기 편향을 줄입니다. 구체적으로, 잘 학습된 모델을 기반으로, 우리는 먼저 계층적인 의미 구조를 유지하면서 과도하게 인기 있는 토큰을 분할하여 코드북을 재조정합니다. 조정된 코드북을 바탕으로, 우리는 의미적 일관성을 강화하는 트리 구조의 정규화 기법을 추가하여 학습 과정에서 덜 인기 있는 토큰에 대해 더욱 유용한 표현을 학습하도록 유도합니다. 실제 데이터셋에 대한 실험 결과, CRAB은 인기 편향을 효과적으로 완화하여 추천 성능을 크게 향상시켰습니다.

Original Abstract

Generative recommendation (GeneRec) has introduced a new paradigm that represents items as discrete semantic tokens and predicts items in a generative manner. Despite its strong performance across multiple recommendation tasks, existing GeneRec approaches still suffer from severe popularity bias and may even exacerbate it. In this work, we conduct a comprehensive empirical analysis to uncover the root causes of this phenomenon, yielding two core insights: 1) imbalanced tokenization inherits and can further amplify popularity bias from historical item interactions; 2) current training procedures disproportionately favor popular tokens while neglecting semantic relationships among tokens, thereby intensifying popularity bias. Building on these insights, we propose CRAB, a post-hoc debiasing strategy for GeneRec that alleviates popularity bias by mitigating frequency imbalance among semantic tokens. Specifically, given a well-trained model, we first rebalance the codebook by splitting over-popular tokens while preserving their hierarchical semantic structure. Based on the adjusted codebook, we further introduce a tree-structured regularizer to enhance semantic consistency, encouraging more informative representations for unpopular tokens during training. Experiments on real-world datasets demonstrate that CRAB significantly improves recommendation performance by effectively alleviating popularity bias.

1 Citations

0 Influential

5 Altmetric

26.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!