2605.07632v1 May 08, 2026 cs.CL

추가 학습은 대규모 언어 모델을 인간과 덜 유사하게 만든다

Post-training makes large language models less human-like

David Broska
David Broska
Citations: 39
h-index: 2
Lance Ying
Lance Ying
Citations: 249
h-index: 9
F. Gunther
F. Gunther
Citations: 44
h-index: 2
Evelina Leivada
Evelina Leivada
Citations: 896
h-index: 15
Christopher Summerfield
Christopher Summerfield
Citations: 136
h-index: 7
Shashank Reddy
Shashank Reddy
Citations: 2
h-index: 1
Taisiia Tikhomirova
Taisiia Tikhomirova
Citations: 3
h-index: 1
A. Jagadish
A. Jagadish
Citations: 431
h-index: 9
Milena Rmuš
Milena Rmuš
Citations: 230
h-index: 3
Kristin Witte
Kristin Witte
Citations: 348
h-index: 7
Marvin Mathony
Marvin Mathony
Citations: 233
h-index: 3
Marcel Binz
Marcel Binz
Citations: 1,930
h-index: 18
Eric Schulz
Eric Schulz
Citations: 243
h-index: 4
Brian Christian
Brian Christian
Citations: 4
h-index: 1
Valentin Kriegmair
Valentin Kriegmair
Citations: 7
h-index: 1
Dirk U Wulff
Dirk U Wulff
Citations: 311
h-index: 4
Ji-An Li
Ji-An Li
Citations: 220
h-index: 2
Xinyu Zhang
Xinyu Zhang
Citations: 16
h-index: 2
Elif Akata
Elif Akata
Citations: 552
h-index: 4
Abdullah Almaatouq
Abdullah Almaatouq
Citations: 6
h-index: 1
Mohammed Alsobay
Mohammed Alsobay
Citations: 432
h-index: 7
Oleksii Ariasov
Oleksii Ariasov
Citations: 1
h-index: 1
Franziska Brandle
Franziska Brandle
Citations: 53
h-index: 1
Jason W. Burton
Jason W. Burton
Citations: 196
h-index: 5
Nuno Busch
Nuno Busch
Citations: 20
h-index: 3
Frederick Callaway
Frederick Callaway
Citations: 976
h-index: 15
Vanessa Cheung
Vanessa Cheung
Citations: 61
h-index: 2
Julian Coda-Forno
Julian Coda-Forno
Citations: 720
h-index: 7
Can Demircan
Can Demircan
Citations: 261
h-index: 6
Vittoria Dentella
Vittoria Dentella
Citations: 249
h-index: 8
Maria K. Eckstein
Maria K. Eckstein
Citations: 1,456
h-index: 14
No'emi 'EltetHo
No'emi 'EltetHo
Citations: 53
h-index: 1
Michael Franke
Michael Franke
Citations: 6
h-index: 1
Thomas L. Griffiths
Thomas L. Griffiths
Citations: 199
h-index: 8
Susanne Haridi
Susanne Haridi
Citations: 218
h-index: 2
Sebastian Hellmann
Sebastian Hellmann
Citations: 130
h-index: 5
Stefan Herytash
Stefan Herytash
Citations: 1
h-index: 1
Linus Hof
Linus Hof
Citations: 1
h-index: 1
Eleanor Holton
Eleanor Holton
Citations: 56
h-index: 3
I. Hoxha
I. Hoxha
Citations: 18
h-index: 3
Zak Hussain
Zak Hussain
Citations: 104
h-index: 3
Elif Kara
Elif Kara
Citations: 1
h-index: 1
Tobias Ludwig
Tobias Ludwig
Citations: 228
h-index: 3
Maximilian Maier
Maximilian Maier
Citations: 3
h-index: 1
Marcelo G. Mattar
Marcelo G. Mattar
Citations: 218
h-index: 3
Mariia Nadverniuk
Mariia Nadverniuk
Citations: 1
h-index: 1
Antonios Nasioulas
Antonios Nasioulas
Citations: 3
h-index: 1
Surabhi S. Nath
Surabhi S. Nath
Citations: 287
h-index: 6
Helen Niemeyer
Helen Niemeyer
Citations: 45
h-index: 2
Kate Nussenbaum
Kate Nussenbaum
Citations: 552
h-index: 12
Sebastian Olschewski
Sebastian Olschewski
Citations: 62
h-index: 3
T. Pachur
T. Pachur
Citations: 16
h-index: 2
Stefano Palminteri
Stefano Palminteri
Citations: 51
h-index: 4
Aliona Petrenco
Aliona Petrenco
Citations: 16
h-index: 2
Camille V. Phaneuf-Hadd
Camille V. Phaneuf-Hadd
Citations: 3
h-index: 1
A. Pirrone
A. Pirrone
Citations: 499
h-index: 12
Manuel Rausch
Manuel Rausch
Citations: 623
h-index: 13
Laura Raveling
Laura Raveling
Citations: 13
h-index: 1
Evan M. Russek
Evan M. Russek
Citations: 244
h-index: 4
Tankred Saanum
Tankred Saanum
Citations: 345
h-index: 7
Kai J Sandbrink
Kai J Sandbrink
Citations: 141
h-index: 5
Louis Schiekiera
Louis Schiekiera
Citations: 38
h-index: 2
Johannes A. Schubert
Johannes A. Schubert
Citations: 235
h-index: 3
Luca M. Schulze Buschoff
Luca M. Schulze Buschoff
Citations: 138
h-index: 4
Nishad Singhi
Nishad Singhi
Citations: 340
h-index: 5
Leah H. Somerville
Leah H. Somerville
Citations: 162
h-index: 4
Mikhail S. Spektor
Mikhail S. Spektor
Citations: 1
h-index: 1
Xinning Sui
Xinning Sui
Citations: 1
h-index: 1
Mirko Thalmann
Mirko Thalmann
Citations: 327
h-index: 5
A. I. Thoma
A. I. Thoma
Citations: 4
h-index: 1
Vuong Truong
Vuong Truong
Citations: 215
h-index: 2
Polina Tsvilodub
Polina Tsvilodub
Citations: 61
h-index: 5
Konstantinos Voudouris
Konstantinos Voudouris
Citations: 326
h-index: 4
Robert C. Wilson
Robert C. Wilson
Citations: 250
h-index: 3
Shuchen Wu
Shuchen Wu
Citations: 226
h-index: 3
Huadong Xiong
Huadong Xiong
Citations: 250
h-index: 3
Songlin Xu
Songlin Xu
Citations: 133
h-index: 6
Jian-Qiao Zhu
Jian-Qiao Zhu
Citations: 322
h-index: 9
Alireza Modirshanechi
Alireza Modirshanechi
Citations: 578
h-index: 12
Robin Na
Robin Na
Citations: 8
h-index: 2

대규모 언어 모델(LLM)은 인간 참가자를 대체하는 데 점점 더 많이 사용되지만, 어떤 모델이 인간의 행동을 가장 잘 반영하는지, 그리고 그 이유는 여전히 불분명합니다. 이를 해결하기 위해, 우리는 행동적 일관성을 대규모로 측정할 수 있는 새로운 데이터셋인 Psych-201을 소개합니다. 우리는 추가 학습, 즉 기본 모델을 유용한 도우미로 만드는 단계가 모델 계열, 크기 및 목표에 관계없이 인간 행동과의 일관성을 지속적으로 감소시킨다는 것을 발견했습니다. 더욱이, 이러한 불일치는 기본 모델이 계속 개선되는 동안에도 최신 모델 세대에서 더욱 심화됩니다. 마지막으로, 인간과 유사한 행동을 유도하기 위해 모델에 참가자별 정보를 조건부로 부여하는 인기 있는 기술인 '페르소나 유도'는 개인 수준의 예측 성능을 향상시키지 못한다는 것을 확인했습니다. 종합적으로 볼 때, 우리의 결과는 현재 LLM을 유용한 도우미로 만들기 위해 사용되는 과정 자체가 인간 행동에 대한 모델의 정확성을 떨어뜨린다는 것을 시사합니다.

Original Abstract

Large language models (LLMs) are increasingly used as surrogates for human participants, but it remains unclear which models best capture human behavior and why. To address this, we introduce Psych-201, a novel dataset that enables us to measure behavioral alignment at scale. We find that post-training -- the stage that turns base models into useful assistants -- consistently reduces alignment with human behavior across model families, sizes, and objectives. Moreover, this misalignment widens in newer model generations even as base models continue to improve. Finally, we find that persona-induction -- a popular technique for eliciting human-like behavior by conditioning models on participant-specific information -- does not improve predictions at the level of individuals. Taken together, our results suggest that the very processes that are currently employed to turn LLMs into useful assistants also make them less accurate models of human behavior.

1 Citations
0 Influential
9 Altmetric
46.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!