2603.17832v1 Mar 18, 2026 cs.CL

텍스트-투-스테이지: 장문 서사에서 추출한 공간 배치

Text-to-Stage: Spatial Layouts from Long-form Narratives

Chenxi Whitehouse

Citations: 800

h-index: 13

Sanjeel Parekh

Citations: 194

h-index: 8

Calvin Murdock

Citations: 34

h-index: 4

Yuliang Li

Citations: 111

h-index: 2

W. O. Brimijoin

Citations: 761

h-index: 15

V. Ithapu

Citations: 3,398

h-index: 20

Swarnadeep Saha

University of North Carolina Chapel Hill

Citations: 1,709

h-index: 21

Jefferson Hernandez

Citations: 13

h-index: 2

Ishwarya Ananthabhotla

Citations: 210

h-index: 8

본 연구에서는 언어 모델이 비정형 텍스트로부터 공간 추론 능력을 보여주는지 조사하며, 이는 인간의 능력을 모방하고 다양한 미디어 응용 분야에 도움이 되는 프로세스를 자동화합니다. 구체적으로, 우리는 서사-연극 작업(narrative-to-play task)을 연구합니다. 이는 명시적인 공간, 위치 또는 관계적 단서가 없는 텍스트로부터 연극 무대 배치(장면, 연기자 위치, 움직임, 방 유형)를 추론하는 작업입니다. 우리는 연극 기법에서 영감을 받은 결정적 평가 도구를 소개하고, 마지막으로, 베스트-오브-N 샘플링을 사용한 거부 기반의 지도 학습(SFT)과 검증 가능한 보상을 통한 강화 학습(RL, GRPO)을 결합한 학습 및 추론 방법을 제시합니다. 고전 영어 문학 텍스트만으로 구성된 데이터셋에 대한 실험 결과, 제안하는 방법은 기존 모델에 비해 여러 지표(캐릭터 속성 부여, 공간적 타당성, 움직임 효율성)에서 성능 향상을 보이며, 또한 LLM을 평가자로 사용하고 주관적인 인간 선호도와 일관성을 보이는 것을 확인했습니다.

Original Abstract

In this work, we probe the ability of a language model to demonstrate spatial reasoning from unstructured text, mimicking human capabilities and automating a process that benefits many downstream media applications. Concretely, we study the narrative-to-play task: inferring stage-play layouts (scenes, speaker positions, movements, and room types) from text that lacks explicit spatial, positional, or relational cues. We then introduce a dramaturgy-inspired deterministic evaluation suite and, finally, a training and inference recipe that combines rejection SFT using Best-of-N sampling with RL from verifiable rewards via GRPO. Experiments on a text-only corpus of classical English literature demonstrate improvements over vanilla models across multiple metrics (character attribution, spatial plausibility, and movement economy), as well as alignment with an LLM-as-a-judge and subjective human preferences.

0 Citations

0 Influential

10.5 Altmetric

52.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!