2605.28805v1 May 27, 2026 cs.CL

OmniVerifier-M1: Multimodal Meta-Verifier with Explicit Structured Recalibration

Ling Yang
Ling Yang
Citations: 3,474
h-index: 9
Yujiu Yang
Yujiu Yang
Citations: 463
h-index: 8
Youliang Zhang
Youliang Zhang
Citations: 49
h-index: 3
Chufan Shi
Chufan Shi
Citations: 627
h-index: 12
Yizhen Zhang
Yizhen Zhang
Citations: 50
h-index: 5
Xinchen Zhang
Xinchen Zhang
Citations: 340
h-index: 7
Bo Liu
Bo Liu
Citations: 1,579
h-index: 2
Jiale Liu
Jiale Liu
Citations: 2,183
h-index: 9
Junhong Liu
Junhong Liu
Citations: 40
h-index: 1
Zhiheng Li
Zhiheng Li
Citations: 74
h-index: 2

Visual outcomes are increasingly central to multimodal large language models, making reliable and fine-grained verification essential for scaling generalist foundation models. In this work, we investigate multimodal meta-verification, which leverages verifier-generated rationales rather than decision-only signals, and explore how to effectively incorporate meta-verification feedback into multimodal verifier training. We identify two key findings. First, symbolic verifier outputs (e.g., bounding boxes) outperform textual explanations as meta-verification rationales, enabling efficient rule-based reinforcement learning rewards while avoiding reliance on model-based rewards from auxiliary judge models. Second, decoupling reinforcement learning objectives for binary judgment and meta-verification substantially outperforms joint reward optimization, due to intrinsic differences in output structure and learning dynamics. Based on these insights, we train OmniVerifier-M1, a generalist visual verifier leveraging symbolic meta-verification and decoupled reinforcement learning. OmniVerifier-M1 provides robust verification and fine-grained error localization, and further enables M1-TTS, a verifier-driven agentic generation system achieving dynamic region-level self-correction. This approach paves the way for more reliable, interpretable, and fine-grained multimodal verification, supporting safer and more controllable foundation model deployment.

0 Citations
0 Influential
6 Altmetric
30.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!