2601.21609v1 Jan 29, 2026 cs.AI

RecNet: 에이전트 기반 추천 시스템을 위한 자가 진화형 선호도 전파

RecNet: Self-Evolving Preference Propagation for Agentic Recommender Systems

Xiaolei Wang

Renmin University of China

Citations: 5,222

h-index: 10

Bingqian Li

Citations: 74

h-index: 3

Junyi Li

Citations: 20

h-index: 3

Sheng Chen

Citations: 76

h-index: 3

W. Zhao

Citations: 2,955

h-index: 29

Ji-Rong Wen

Citations: 1,514

h-index: 19

Long Zhang

Citations: 21

h-index: 3

Weitao Li

Citations: 341

h-index: 4

에이전트 기반 추천 시스템은 거대 언어 모델(LLM)을 활용하여 복잡한 사용자 행동을 모델링하고 개인화된 의사 결정을 지원합니다. 그러나 기존 방법들은 주로 희소하고 노이즈가 많으며 사용자와 아이템 간의 실시간 상호 영향을 반영하지 못하는 명시적인 사용자-아이템 상호작용에 기반하여 선호도 변화를 모델링합니다. 이러한 한계를 해결하기 위해, 우리는 관련된 사용자와 아이템 간에 실시간 선호도 업데이트를 주도적으로 전파하는 자가 진화형 선호도 전파 프레임워크인 RecNet을 제안합니다. RecNet은 두 가지 상호 보완적인 단계로 구성됩니다. 순방향 단계(forward phase)에서는 중앙 집중식 선호도 라우팅 메커니즘이 라우터 에이전트를 활용하여 선호도 업데이트를 통합하고 이를 가장 관련성 높은 에이전트에게 동적으로 전파합니다. 전파된 선호도의 정확하고 개인화된 통합을 보장하기 위해, 우리는 일시적 캐싱을 위한 메시지 버퍼와 과거 경험 및 관심사를 바탕으로 선택적 선호도 수용을 유도하는 최적화 가능한 규칙 기반 필터 메모리를 결합한 개인화된 선호도 수신 메커니즘을 추가로 도입합니다. 역방향 단계(backward phase)에서는 피드백 주도 전파 최적화 메커니즘이 기여도 할당, 그래디언트 분석 및 모듈 수준 최적화를 위해 LLM을 사용하여 다중 에이전트 강화 학습 프레임워크를 시뮬레이션함으로써 전파 전략의 지속적인 자가 진화를 가능하게 합니다. 다양한 시나리오에 대한 광범위한 실험을 통해 추천 시스템을 위한 선호도 전파 모델링에 있어 RecNet의 효과를 입증했습니다.

Original Abstract

Agentic recommender systems leverage Large Language Models (LLMs) to model complex user behaviors and support personalized decision-making. However, existing methods primarily model preference changes based on explicit user-item interactions, which are sparse, noisy, and unable to reflect the real-time, mutual influences among users and items. To address these limitations, we propose RecNet, a self-evolving preference propagation framework that proactively propagates real-time preference updates across related users and items. RecNet consists of two complementary phases. In the forward phase, the centralized preference routing mechanism leverages router agents to integrate preference updates and dynamically propagate them to the most relevant agents. To ensure accurate and personalized integration of propagated preferences, we further introduce a personalized preference reception mechanism, which combines a message buffer for temporary caching and an optimizable, rule-based filter memory to guide selective preference assimilation based on past experience and interests. In the backward phase, the feedback-driven propagation optimization mechanism simulates a multi-agent reinforcement learning framework, using LLMs for credit assignment, gradient analysis, and module-level optimization, enabling continuous self-evolution of propagation strategies. Extensive experiments on various scenarios demonstrate the effectiveness of RecNet in modeling preference propagation for recommender systems.

3 Citations

0 Influential

14.5 Altmetric

75.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!