2605.28077v1 May 27, 2026 cs.AI

MACReD: A Multi-Agent Collaborative Reasoning Framework for Reaction Diagram Parsing

Yinuo Xu
Yinuo Xu
Citations: 7
h-index: 1
Chuang Tang
Chuang Tang
Citations: 40
h-index: 3
Chen Lin
Chen Lin
Citations: 1,265
h-index: 18
Hao Wang
Hao Wang
Citations: 83
h-index: 5
Xin Li
Xin Li
Citations: 44
h-index: 2
Mingjun Xiao
Mingjun Xiao
Citations: 25
h-index: 2
Enhong Chen
Enhong Chen
Citations: 35
h-index: 3
Jinrui Zhou
Jinrui Zhou
Citations: 132
h-index: 6

Parsing chemical reaction diagrams from scientific literature is challenging due to heterogeneous layouts, intertwined visual elements, and the difficulty of integrating recognition and reasoning. Existing vision-language models advance multimodal understanding but still fail on complex diagrams, struggling to maintain spatial coherence and to integrate multidimensional information during reasoning. To address these issues, we propose MACReD, a hierarchical multi-agent framework that coordinates specialized agents for molecular perception, arrow understanding, text extraction, and reaction reconstruction within a unified VLM-guided architecture. The planning and perception layers use flexible, fine-grained detection to handle visual complexity, while the reasoning layer uses a multigraph fusion mechanism to integrate heterogeneous cues and enforce chemically consistent global reasoning. Experiments on the RxnScribe benchmark show that MACReD achieves state-of-the-art performance, with F1 scores of 75.2% and 84.6% under hard and soft match criteria, outperforming the RxnScribe baseline, which obtains 69.1% and 80.0%, respectively. These results demonstrate the robustness of MACReD across diverse diagram layouts, including multi-step and tree-structured reactions.

0 Citations
0 Influential
9 Altmetric
45.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!