2603.17834v1 Mar 18, 2026 cs.RO

생성적 제어: 최적화 관점 - 적응적이고 견고한 로봇 제어를 위한 시간 제약 없는 플로우 매칭

Generative Control as Optimization: Time Unconditional Flow Matching for Adaptive and Robust Robotic Control

Yicheng Liu

Citations: 703

h-index: 7

Shaoting Zhu

Citations: 187

h-index: 8

Linzhan Mou

Citations: 342

h-index: 8

R. Huang

Citations: 83

h-index: 3

Hang Zhao

Citations: 166

h-index: 4

Zunzhe Zhang

Citations: 10

h-index: 2

확산 모델과 플로우 매칭은 로봇 모방 학습의 핵심 기술로 자리 잡았지만, 추론 과정이 상태 복잡도에 무관한 고정된 통합 스케줄에 묶여 있어 구조적인 비효율성을 갖습니다. 이러한 방식은 정책이 사소한 움직임과 복잡한 작업에 동일한 계산 자원을 사용하도록 강제합니다. 본 논문에서는 생성적 제어를 최적화(Generative Control as Optimization, GeCO)라는 시간 제약 없는 프레임워크를 소개합니다. GeCO는 행동 생성 과정을 경로 통합에서 반복적인 최적화로 변환합니다. GeCO는 행동 시퀀스 공간에서 전문가의 행동이 안정적인 끌림점으로 형성되는 정적인 속도장을 학습합니다. 그 결과, 테스트 시 추론은 수렴 정도에 따라 계산 자원을 적응적으로 할당하는 과정이 됩니다. 즉, 단순한 상태에서는 빠르게 종료하고, 어려운 상태에서는 더 오래 계산합니다. 또한, 이러한 정적인 기하학 구조는 훈련 없이도 안전성을 확보할 수 있는 고유한 신호를 제공합니다. 최적화된 행동에서의 속도장 크기는 로봇이 예상 범위를 벗어나는(out-of-distribution, OOD) 상황을 견고하게 감지하는 역할을 하며, 정상적인 상태에서는 값이 낮게 유지되지만, 이상 현상이 발생하면 크게 증가합니다. GeCO는 표준 시뮬레이션 벤치마크에서 검증되었으며, pi0 시리즈의 비전-언어-액션(VLA) 모델로의 원활한 확장이 가능함을 보여줍니다. GeCO는 표준 플로우 매칭 모듈을 대체하여 사용될 수 있으며, 안전한 배포를 위한 최적화 기반 메커니즘을 통해 성공률과 효율성을 향상시킵니다. 관련 비디오 및 코드는 https://hrh6666.github.io/GeCO/ 에서 확인할 수 있습니다.

Original Abstract

Diffusion models and flow matching have become a cornerstone of robotic imitation learning, yet they suffer from a structural inefficiency where inference is often bound to a fixed integration schedule that is agnostic to state complexity. This paradigm forces the policy to expend the same computational budget on trivial motions as it does on complex tasks. We introduce Generative Control as Optimization (GeCO), a time-unconditional framework that transforms action synthesis from trajectory integration into iterative optimization. GeCO learns a stationary velocity field in the action-sequence space where expert behaviors form stable attractors. Consequently, test-time inference becomes an adaptive process that allocates computation based on convergence--exiting early for simple states while refining longer for difficult ones. Furthermore, this stationary geometry yields an intrinsic, training-free safety signal, as the field norm at the optimized action serves as a robust out-of-distribution (OOD) detector, remaining low for in-distribution states while significantly increasing for anomalies. We validate GeCO on standard simulation benchmarks and demonstrate seamless scaling to pi0-series Vision-Language-Action (VLA) models. As a plug-and-play replacement for standard flow-matching heads, GeCO improves success rates and efficiency with an optimization-native mechanism for safe deployment. Video and code can be found at https://hrh6666.github.io/GeCO/

0 Citations

0 Influential

4 Altmetric

20.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!