2605.29256v1 May 28, 2026 cs.CL

DynSess: Dynamic Session-Level Evaluation and Optimization Framework for Role-Playing Agents

Ruofan Hu
Ruofan Hu
Citations: 121
h-index: 5
Zhou Zhao
Zhou Zhao
Citations: 60
h-index: 5
Jiji Tang
Jiji Tang
Citations: 587
h-index: 8
Junnan Ren
Junnan Ren
Citations: 66
h-index: 4
Zuyi Bao
Zuyi Bao
Citations: 534
h-index: 8
Weijie Chen
Weijie Chen
Citations: 120
h-index: 5
Tangjie Lv
Tangjie Lv
Citations: 51
h-index: 5
Yan Zhang
Yan Zhang
Citations: 148
h-index: 4
Rongsheng Zhang
Rongsheng Zhang
Citations: 62
h-index: 3

Role-playing with large language models is fundamentally a session-level task, requiring agents to sustain character identity and interaction quality across extended multi-turn conversations. Yet existing evaluation and optimization methods remain largely turn-level, failing to capture long-horizon quality. We propose DynSess, a unified session-level framework for role-playing agents. DynSess-Eval scores complete dialogue sessions via rubrics targeting long-horizon behaviors. Leveraging its session-level rewards, we construct high-quality training trajectories through multi-turn lookahead search and train DynSess-Character with two complementary variants: DSPO (off-policy) and GSRPO (on-policy). Experiments show that DynSess-Eval aligns with human judgments substantially better than prior evaluators, and blind human evaluation further shows that DynSess-Character matches the strongest character model despite using substantially fewer parameters, while maintaining strong role consistency and interactive ability. Our dataset and code will be released to facilitate future research.

0 Citations
0 Influential
4 Altmetric
20.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!