2603.18573v1 Mar 19, 2026 cs.AI

상호작용: 레퍼런스 없는 대화형 추천을 위한 독립적인 시뮬레이터 훈련

Interplay: Training Independent Simulators for Reference-Free Conversational Recommendation

Hossein A. Rahmani

Citations: 213

h-index: 8

Xiao Fu

Citations: 100

h-index: 4

Feng Xia

Citations: 679

h-index: 13

Jerome Ramos

University College London

Citations: 98

h-index: 6

Xi Wang

Citations: 4

h-index: 1

Shubham Chatterjee

Citations: 153

h-index: 5

Aldo Lipani

Citations: 160

h-index: 5

대화형 추천 시스템(CRS)을 훈련하려면 방대한 대화 데이터가 필요하지만, 이러한 데이터를 대규모로 수집하는 것은 어렵습니다. 이를 해결하기 위해 연구자들은 시뮬레이션된 사용자-추천 시스템 간의 대화를 활용해 왔습니다. 기존의 시뮬레이션 방식은 종종 목표 항목에 대한 사전 지식을 바탕으로 전체 대화를 생성하는 단일 대규모 언어 모델(LLM)을 사용하는데, 이는 미리 정해진 듯하고 인위적인 대화로 이어질 수 있습니다. 우리는 두 개의 독립적인 LLM을 훈련하는 레퍼런스 없는 시뮬레이션 프레임워크를 제안합니다. 하나의 모델은 사용자를, 다른 모델은 대화형 추천 시스템을 나타냅니다. 이 모델들은 사전에 정해진 목표 항목에 대한 접근 없이 실시간으로 상호 작용하지만, 사용자 선호도 요약 정보와 목표 항목 속성을 활용하여 추천 시스템이 대화를 통해 사용자의 실제 선호도를 추론할 수 있도록 합니다. 이 접근 방식은 보다 현실적이고 다양한 대화를 생성하며, 실제 인간-AI 상호 작용과 더욱 유사합니다. 우리의 레퍼런스 없는 시뮬레이터는 기존 방법과 동등하거나 그 이상의 품질을 제공하며, 동시에 대화가 미리 정의된 목표 항목에 제약되지 않도록 고품질의 대화형 추천 데이터를 생성하는 확장 가능한 솔루션을 제공합니다. 우리는 정량적 평가와 사용자 평가를 모두 수행하여 우리의 레퍼런스 없는 접근 방식의 효과를 확인했습니다.

Original Abstract

Training conversational recommender systems (CRS) requires extensive dialogue data, which is challenging to collect at scale. To address this, researchers have used simulated user-recommender conversations. Traditional simulation approaches often utilize a single large language model (LLM) that generates entire conversations with prior knowledge of the target items, leading to scripted and artificial dialogues. We propose a reference-free simulation framework that trains two independent LLMs, one as the user and one as the conversational recommender. These models interact in real-time without access to predetermined target items, but preference summaries and target attributes, enabling the recommender to genuinely infer user preferences through dialogue. This approach produces more realistic and diverse conversations that closely mirror authentic human-AI interactions. Our reference-free simulators match or exceed existing methods in quality, while offering a scalable solution for generating high-quality conversational recommendation data without constraining conversations to pre-defined target items. We conduct both quantitative and human evaluations to confirm the effectiveness of our reference-free approach.

0 Citations

0 Influential

6.5 Altmetric

32.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!