2605.29256v1 May 28, 2026 cs.CL

DynSess: Dynamic Session-Level Evaluation and Optimization Framework for Role-Playing Agents

Ruofan Hu

Citations: 121

h-index: 5

Zhou Zhao

Citations: 60

h-index: 5

Jiji Tang

Citations: 587

h-index: 8

Junnan Ren

Citations: 66

h-index: 4

Zuyi Bao

Citations: 534

h-index: 8

Weijie Chen

Citations: 120

h-index: 5

Tangjie Lv

Citations: 51

h-index: 5

Yan Zhang

Citations: 148

h-index: 4

Rongsheng Zhang

Citations: 62

h-index: 3

Role-playing with large language models is fundamentally a session-level task, requiring agents to sustain character identity and interaction quality across extended multi-turn conversations. Yet existing evaluation and optimization methods remain largely turn-level, failing to capture long-horizon quality. We propose DynSess, a unified session-level framework for role-playing agents. DynSess-Eval scores complete dialogue sessions via rubrics targeting long-horizon behaviors. Leveraging its session-level rewards, we construct high-quality training trajectories through multi-turn lookahead search and train DynSess-Character with two complementary variants: DSPO (off-policy) and GSRPO (on-policy). Experiments show that DynSess-Eval aligns with human judgments substantially better than prior evaluators, and blind human evaluation further shows that DynSess-Character matches the strongest character model despite using substantially fewer parameters, while maintaining strong role consistency and interactive ability. Our dataset and code will be released to facilitate future research.

0 Citations

0 Influential

4 Altmetric

20.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!