2606.06388v1 Jun 04, 2026 cs.AI

Humans' ALMANAC: A Human Collaboration Dataset of Action-Level Mental Model Annotations for Agent Collaboration

Jian Zhao
Jian Zhao
Citations: 5
h-index: 1
Bingsheng Yao
Bingsheng Yao
Citations: 2,359
h-index: 24
Dakuo Wang
Dakuo Wang
Citations: 1,444
h-index: 17
T. Li
T. Li
Citations: 279
h-index: 8
Chaoran Chen
Chaoran Chen
Citations: 237
h-index: 8
Tongshuang Wu
Tongshuang Wu
Citations: 45
h-index: 1
Jiaju Chen
Jiaju Chen
Citations: 125
h-index: 5
Yuxuan Lu
Yuxuan Lu
Northeastern University
Citations: 537
h-index: 14
Jiayi Su
Jiayi Su
Citations: 8
h-index: 1
Songlin Xiao
Songlin Xiao
Citations: 3
h-index: 1
Zhengyou Zhang
Zhengyou Zhang
Citations: 2
h-index: 1
Yun Wang
Yun Wang
Citations: 17
h-index: 2
Yunyao Li
Yunyao Li
Citations: 51
h-index: 3

Recent advances in LLM agents have enabled complex cognitive capabilities, such as multi-step reasoning, planning, and tool use, that increasingly position these agents as human collaborators. Effective collaboration, however, requires collaborators to continuously maintain and align mental models of their own reasoning,partners' intentions, and shared goals during the collaborative process. Today's agents rarely develop such capabilities since they are primarily optimized for task completion, and the community lacks authentic human collaboration data with action-level mental model annotations that could guide agents toward process-level collaborative competence. To bridge this gap, we present ALMANAC, a dataset of Action-Level Mental model ANnotations for Agent Collaboration built from the Map Task, a classic dyadic routing task from social science. ALMANAC contains 2,987 collaboration actions, each paired with theory-informed mental model annotations that record the participants' self-reasoning, perceived partner intent, and perceived team goal. We benchmark six LLMs on predicting humans' next-turn behavior and mental models. Our results demonstrate ALMANAC's utility in evaluating models' ability to simulate human collaborative behaviors and infer their underlying mental models.

0 Citations
0 Influential
12 Altmetric
60.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!