2606.05793v1 Jun 04, 2026 cs.CL

CollabBench: Benchmarking and Unleashing Collaborative Ability of LLMs with Diverse Players via Proactive Engagement

Aimin Zhou
Aimin Zhou
Citations: 5
h-index: 1
Xiangfeng Wang
Xiangfeng Wang
Citations: 43
h-index: 2
Yuanhao Liu
Yuanhao Liu
Citations: 17
h-index: 3
Liang Dou
Liang Dou
Citations: 31
h-index: 2
Hong Qian
Hong Qian
Citations: 179
h-index: 9
Zihan Zhou
Zihan Zhou
Citations: 550
h-index: 5
Jingwen Yang
Jingwen Yang
Citations: 119
h-index: 4
Hanjie Ge
Hanjie Ge
Citations: 1
h-index: 1
Haotian Shi
Haotian Shi
Citations: 10
h-index: 1
Zongbao Zhang
Zongbao Zhang
Citations: 0
h-index: 0

While LLM-based agents excel at individual tasks, effective collaboration with realistic human partners remains challenging. Most of the existing conversation-level collaborative studies lack grounded interaction and behavioral execution, motivating the need for cooperative game environments that enable contextualized and immersive collaboration. To this end, this paper proposes CollabBench, a benchmark for evaluating and training collaborative agents in cooperative games. CollabBench features a Diverse Player Profile Simulation pipeline to model varied players behaviors, and a Collaborative Agentic Training paradigm that unifies reasoning, communication, and action via agentic rollouts, optimized with a hybrid reward balancing task efficiency and affective adaptation. We further extend classic environments to CWAH-MultiPlayer and Cook-MultiPlayer for systematic evaluation under diverse personalities. Experiments with efficiency and affective metrics show that our trained models outperform base models, achieving 19.5% higher efficiency and 24.4% improved affective performance. Further analysis reveals key collaborative limitations of existing models and offers insights for future collaborative training.

0 Citations
0 Influential
4.5 Altmetric
22.5 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!