2601.05657v1 Jan 09, 2026 cs.CL

Stephanie2: 단계별 인공지능 소셜 챗에서 인간처럼 사고하고, 기다리고, 의사 결정을 내리는 방법

Stephanie2: Thinking, Waiting, and Making Decisions Like Humans in Step-by-Step AI Social Chat

Hongyuan Lu

Citations: 299

h-index: 10

Haoran Yang

Citations: 22

h-index: 3

Dingkang Yang

Citations: 32

h-index: 3

Wenli Yang

Citations: 9

h-index: 2

Peng Sun

Citations: 63

h-index: 2

Xiaochuan Zhang

Citations: 27

h-index: 2

Jun Xiao

Citations: 243

h-index: 8

Kefan He

Citations: 87

h-index: 2

Wai Lam

Citations: 108

h-index: 3

Yang Liu

Citations: 26

h-index: 3

Xinhua Zeng

Citations: 38

h-index: 4

실시간 메시지 기반의 인간 소셜 챗은 일반적으로 짧은 메시지들의 연속으로 진행됩니다. 기존의 단계별 인공지능 챗 시스템은 하나의 응답을 여러 메시지로 나누어 순차적으로 전송하지만, 능동적인 대기 메커니즘이 부족하고 부자연스러운 메시지 속도를 보입니다. 이러한 문제점을 해결하기 위해, 우리는 새로운 차세대 단계별 의사 결정 대화 에이전트인 Stephanie2를 제안합니다. Stephanie2는 능동적인 대기와 메시지 속도 조절 기능을 통해, 각 단계에서 메시지 전송 여부를 명시적으로 결정하고, 사고 시간과 입력 시간을 합산하여 지연 시간을 모델링함으로써 보다 자연스러운 대화 흐름을 구현합니다. 또한, 인간 및 자동 평가를 위한 가짜 대화 기록을 생성하기 위해, 시간 창 기반의 이중 에이전트 대화 시스템을 도입했습니다. 실험 결과, Stephanie2는 자연스러움과 참여도 측면에서 Stephanie1보다 뛰어난 성능을 보였으며, 역할 식별 튜링 테스트에서 더 높은 합격률을 달성했습니다.

Original Abstract

Instant-messaging human social chat typically progresses through a sequence of short messages. Existing step-by-step AI chatting systems typically split a one-shot generation into multiple messages and send them sequentially, but they lack an active waiting mechanism and exhibit unnatural message pacing. In order to address these issues, we propose Stephanie2, a novel next-generation step-wise decision-making dialogue agent. With active waiting and message-pace adaptation, Stephanie2 explicitly decides at each step whether to send or wait, and models latency as the sum of thinking time and typing time to achieve more natural pacing. We further introduce a time-window-based dual-agent dialogue system to generate pseudo dialogue histories for human and automatic evaluations. Experiments show that Stephanie2 clearly outperforms Stephanie1 on metrics such as naturalness and engagement, and achieves a higher pass rate on human evaluation with the role identification Turing test.

2 Citations

0 Influential

5 Altmetric

27.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!