2606.13432v1 Jun 11, 2026 cs.CV

OmniDirector: General Multi-Shot Camera Cloning without Cross-Paired Data

Zijie Meng
Zijie Meng
Citations: 115
h-index: 6
Jiwen Liu
Jiwen Liu
Citations: 90
h-index: 5
Pengfei Wan
Pengfei Wan
Citations: 71
h-index: 3
Shujuan Li
Shujuan Li
Citations: 166
h-index: 5
Zhixue Fang
Zhixue Fang
Citations: 10
h-index: 1
Xiaohan Li
Xiaohan Li
Citations: 46
h-index: 3
Yan Zhou
Yan Zhou
Citations: 92
h-index: 4
Zhimin Zhang
Zhimin Zhang
Citations: 57
h-index: 2
Yawen Luo
Yawen Luo
Citations: 102
h-index: 4
Guoxin Zhang
Guoxin Zhang
Citations: 0
h-index: 0
Yu-Shen Liu
Yu-Shen Liu
Citations: 52
h-index: 3

Cloning camera motion from reference videos is an important task in video generation, as videos provide intuitive and precise control. Existing methods either directly use parametric representations that fail to handle multi-shot generation or synthesize cross-paired data, which suffer from data scarcity, resulting in poor performance in complicated camera motion cloning. To address these issues, we introduce a general camera motion representation that encodes cameras as grid motion videos. This camera grid represents the camera parameters visually and supports the integration of diverse trajectories for multi-shot video generation. Building upon this, we propose OmniDirector, a unified framework trained on a million-scale camera grid-video pairs that coordinates characters, actions, and cameras to provide director-level control for multimodal diffusion transformers. Furthermore, we design a novel hierarchical prompt expansion agent that harmoniously integrates different control signals by systematically describing camera motion and visual content through understanding signal relationships. Extensive experiments demonstrate the superior performance and outstanding controllability of our framework. Project page: https://ymlinfeng.github.io/OmniDirector.github.io/

1 Citations
0 Influential
3 Altmetric
16.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!