2606.16330v1 Jun 15, 2026 cs.AI

Phase-Aware Guidance Injection for Recurrent MAPPO in Assembly-Line Disruption Recovery

Yongcai Wang
Yongcai Wang
Citations: 4
h-index: 1
Yunjun Han
Yunjun Han
Citations: 108
h-index: 6
Fengyi Zhang
Fengyi Zhang
Citations: 36
h-index: 4
Naiqi Wu
Naiqi Wu
Citations: 1
h-index: 1
Xin Huang
Xin Huang
Citations: 613
h-index: 13
Zhikun Tao
Zhikun Tao
Citations: 20
h-index: 2

Disruption recovery in industrial assembly lines requires timely decisions under machine faults, worker absence, and emergency orders. Existing methods either rely on rigid handcrafted recovery logic or learn adaptive policies that do not readily exploit heterogeneous external recovery knowledge at decision time to reduce abnormal recovery time (ART) and preserve on-time delivery (OTD). To address this gap, we propose a phase-aware guidance injection framework that augments a trained recurrent MAPPO (RMAPPO) scheduling policy through logit-level action bias during evaluation. The framework provides a unified decision-time interface for rule-based, replay-based, and online LLM-based guidance, while activating intervention only during abnormal and recovery phases. Experiments on a custom AssemblyLineEnv show that high-quality rule guidance yields the strongest gains, replay-based guidance degrades smoothly under imperfect availability, and online LLM guidance still provides useful intermediate improvements. These results show that decision-time guidance injection can exploit heterogeneous recovery hints without redesigning the actor.

0 Citations
0 Influential
6.5 Altmetric
32.5 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!