2604.10507v1 Apr 12, 2026 cs.AI

규정 준수를 넘어: 저항 이론에 기반한 동기 부여 추론 프레임워크를 활용한 심리 상담 시뮬레이션의 도전 과제

Beyond Compliance: A Resistance-Informed Motivation Reasoning Framework for Challenging Psychological Client Simulation

Yuxin Hu

Citations: 14

h-index: 2

Danni Liu

Citations: 6

h-index: 1

Bo Liu

Citations: 6

h-index: 1

Hantao Zhao

Citations: 37

h-index: 3

Yan Liu

Citations: 1,005

h-index: 15

Ding Ding

Citations: 41

h-index: 3

Jiahui Jin

Citations: 2

h-index: 1

Jiuxin Cao

Citations: 5

h-index: 1

심리 상담 시뮬레이션은 상담 교육생 및 심리 분야의 대규모 언어 모델(LLM)을 훈련하고 평가하는 확장 가능한 솔루션으로 등장했습니다. 그러나 기존 시뮬레이션은 비현실적인 과도한 순응성을 보이는 경향이 있어, 실제 환경에서 흔히 나타나는 어려운 행동에 대한 대비가 부족할 수 있습니다. 이러한 격차를 해소하기 위해, 본 연구에서는 클라이언트 저항 이론을 기반으로 외부 행동과 근본적인 동기 부여 메커니즘을 통합하여 어려운 클라이언트 행동을 체계적으로 모델링하는 ResistClient를 제안합니다. 이를 위해, 우리는 두 단계로 구성된 훈련 프레임워크인 저항 기반 동기 부여 추론(RIMR)을 제시합니다. 첫째, RIMR은 RPC(저항 중심의 광범위한 심리 대화 데이터셋)를 활용한 지도 학습을 통해 순응 편향을 완화합니다. 둘째, RIMR은 단순히 표면적인 응답 모방을 넘어, 응답 생성 전에 심리적으로 일관된 동기 부여 추론을 모델링하며, 프로세스 기반 강화 학습을 통해 동기 부여의 진정성과 응답의 일관성을 동시에 최적화합니다. 광범위한 자동 평가 및 전문가 평가 결과, ResistClient는 기존 시뮬레이션보다 도전 과제 충실도, 행동의 현실성, 추론의 일관성 측면에서 현저히 우수한 성능을 보입니다. 또한, ResistClient는 어려운 조건에서 심리 분야의 LLM을 평가하는 데 유용하며, 정신 건강 대화 시스템의 새로운 최적화 방향을 제시합니다.

Original Abstract

Psychological client simulators have emerged as a scalable solution for training and evaluating counselor trainees and psychological LLMs. Yet existing simulators exhibit unrealistic over-compliance, leaving counselors underprepared for the challenging behaviors common in real-world practice. To bridge this gap, we present ResistClient, which systematically models challenging client behaviors grounded in Client Resistance Theory by integrating external behaviors with underlying motivational mechanisms. To this end, we propose Resistance-Informed Motivation Reasoning (RIMR), a two-stage training framework. First, RIMR mitigates compliance bias via supervised fine-tuning on RPC, a large-scale resistance-oriented psychological conversation dataset covering diverse client profiles. Second, beyond surface-level response imitation, RIMR models psychologically coherent motivation reasoning before response generation, jointly optimizing motivation authenticity and response consistency via process-supervised reinforcement learning. Extensive automatic and expert evaluations show that ResistClient substantially outperforms existing simulators in challenge fidelity, behavioral plausibility, and reasoning coherence. Moreover, ResistClient facilities evaluation of psychological LLMs under challenging conditions, offering new optimization directions for mental health dialogue systems.

1 Citations

0 Influential

7.5 Altmetric

38.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!