2601.05376v1 Jan 08, 2026 cs.AI

페르소나의 역설: 임상 언어 모델의 행동적 사전 설정으로서의 의료 페르소나

The Persona Paradox: Medical Personas as Behavioral Priors in Clinical Language Models

Tassallah Abdullahi

Brown University

Citations: 205

h-index: 8

Shrestha Ghosh

Citations: 5

h-index: 1

Hamish S Fraser

Citations: 4

h-index: 1

Daniel León Tramontini

Citations: 4

h-index: 1

A. Abbasi

Citations: 65

h-index: 5

C. Eickhoff

Citations: 49

h-index: 3

Ritambhara Singh

Citations: 226

h-index: 9

G. Bourjeily

Citations: 4

h-index: 1

페르소나 컨디셔닝은 거대언어모델(LLM)을 위한 행동적 사전 설정(behavioral prior)으로 간주될 수 있으며, 종종 전문성을 부여하고 안전성을 단조적으로 향상시킨다고 가정됩니다. 그러나 중대한 임상 의사결정에 미치는 영향은 아직 제대로 규명되지 않았습니다. 본 연구는 전문 직업 역할(예: 응급실 의사, 간호사)과 상호작용 스타일(대담함 vs. 신중함)이 모델과 의료 과제 전반에 걸쳐 행동에 어떤 영향을 미치는지 조사하여, 임상 LLM에서의 페르소나 기반 제어를 체계적으로 평가했습니다. 우리는 과제 정확도, 보정(calibration), 안전 관련 위험 행동을 포착하는 다차원 평가를 사용하여 임상 선별 진료(triage) 및 환자 안전 과제에 대한 성능을 평가했습니다. 그 결과 체계적이고 문맥 의존적이며 비단조적인 효과가 발견되었습니다. 의료 페르소나는 중환자 치료 과제에서 성능을 향상시켜 정확도와 보정에서 최대 약 20%의 이득을 보였으나, 1차 진료 환경에서는 비슷한 폭으로 성능을 저하시켰습니다. 상호작용 스타일은 위험 성향과 민감도를 조절하지만, 이는 모델에 따라 크게 달랐습니다. 종합적인 LLM 심판(LLM-judge) 순위는 안전이 중요한 사례에서 비의료 페르소나보다 의료 페르소나를 선호했으나, 인간 임상의들은 안전 준수에 대해 보통 수준의 일치도(평균 Cohen's κ= 0.43)를 보인 반면 추론 품질에 대한 응답의 95.9%에서 낮은 확신을 나타냈습니다. 본 연구는 페르소나가 안전이나 전문성을 보장하기보다는 문맥에 따른 트레이드오프를 유발하는 행동적 사전 설정으로 기능함을 보여줍니다. 코드는 https://github.com/rsinghlab/Persona_Paradox 에서 확인할 수 있습니다.

Original Abstract

Persona conditioning can be viewed as a behavioral prior for large language models (LLMs) and is often assumed to confer expertise and improve safety in a monotonic manner. However, its effects on high-stakes clinical decision-making remain poorly characterized. We systematically evaluate persona-based control in clinical LLMs, examining how professional roles (e.g., Emergency Department physician, nurse) and interaction styles (bold vs.\ cautious) influence behavior across models and medical tasks. We assess performance on clinical triage and patient-safety tasks using multidimensional evaluations that capture task accuracy, calibration, and safety-relevant risk behavior. We find systematic, context-dependent, and non-monotonic effects: Medical personas improve performance in critical care tasks, yielding gains of up to $\sim+20\%$ in accuracy and calibration, but degrade performance in primary-care settings by comparable margins. Interaction style modulates risk propensity and sensitivity, but it's highly model-dependent. While aggregated LLM-judge rankings favor medical over non-medical personas in safety-critical cases, we found that human clinicians show moderate agreement on safety compliance (average Cohen's $κ= 0.43$) but indicate a low confidence in 95.9\% of their responses on reasoning quality. Our work shows that personas function as behavioral priors that introduce context-dependent trade-offs rather than guarantees of safety or expertise. The code is available at https://github.com/rsinghlab/Persona\_Paradox.

3 Citations

0 Influential

24.5 Altmetric

125.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!