2604.27450v1 Apr 30, 2026 cs.RO

RAY-TOLD: 레이 기반 잠재 동역학을 이용한 TDMPC를 통한 고밀도 동적 장애물 회피

RAY-TOLD: Ray-Based Latent Dynamics for Dense Dynamic Obstacle Avoidance with TDMPC

Seokju Lee

Citations: 43

h-index: 3

Jeonguk Kang

Citations: 33

h-index: 2

Seungho Han

Citations: 95

h-index: 6

고밀도의 동적인 군집은 자율 이동 로봇에게 지속적인 과제를 제시합니다. 모델 예측 경로 적분(MPPI) 제어와 같은 순수 반응 기반 계획 방법은 예측 범위의 한계로 인해 복잡한 시나리오에서 종종 지역 최솟값에 갇히는 문제가 발생합니다. 이러한 격차를 해소하기 위해, 우리는 레이 기반 작업 지향 잠재 동역학(RAY-TOLD)이라는 하이브리드 제어 아키텍처를 제안합니다. RAY-TOLD는 장애물 정보를 잠재 동역학에 통합하고, 물리 기반 MPPI의 견고성과 강화 학습의 장기 예측 기능을 결합합니다. RAY-TOLD는 LiDAR 중심의 잠재 동역학 모델을 사용하여 고차원 센서 데이터를 압축된 상태 표현으로 인코딩하여, 최종 가치 함수와 정책 사전 지식을 학습할 수 있도록 합니다. 우리는 MPPI 후보 집단에 학습된 정책에서 파생된 궤적을 추가하는 정책 혼합 샘플링 전략을 도입하여, 계획자가 목표 지향적인 방향으로 이동하면서도 운동학적 제약 조건을 유지하도록 합니다. 확률적인 환경에서 고밀도 동적 장애물이 존재하는 환경에서 수행한 광범위한 실험 결과, 제안하는 방법이 MPPI 기준 성능을 능가하며 충돌률을 감소시키는 것으로 나타났습니다. 이러한 결과는 단기 물리 기반 탐색과 학습된 장기 의도를 결합하면 탐색의 신뢰성과 안전성을 크게 향상시킬 수 있음을 확인합니다.

Original Abstract

Dense, dynamic crowds pose a persistent challenge for autonomous mobile robots. Purely reactive planning methods, such as Model Predictive Path Integral (MPPI) control, often fail to escape local minima in complex scenarios due to their limited prediction horizon. To bridge this gap, we propose Ray-based Task-Oriented Latent Dynamics (RAY-TOLD), a hybrid control architecture that integrates obstacle information into latent dynamics and utilizes the robustness of physics-based MPPI with the long-horizon foresight of reinforcement learning. RAY-TOLD leverages a LiDAR-centric latent dynamics model to encode high-dimensional sensor data into a compact state representation, enabling the learning of a terminal value function and a policy prior. We introduce a policy mixture sampling strategy that augments the MPPI candidate population with trajectories derived from the learned policy, effectively guiding the planner towards the goal while maintaining kinematic feasibility. Extensive tests in a stochastic environment with high-density dynamic obstacles demonstrate that our method outperforms the MPPI baseline, reducing the collision rate. The results confirm that blending short-horizon physics-based rollouts with learned long-horizon intent significantly enhances navigation reliability and safety.

0 Citations

0 Influential

3 Altmetric

15.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!