2605.29657v1 May 28, 2026 cs.CV

OccamToken: Efficient VLM Inference with Training-Free and Budget-Adaptive Token Pruning

Tuo An
Tuo An
Citations: 67
h-index: 4
Jianfei Yang
Jianfei Yang
Citations: 27
h-index: 2
Kuang Zuo
Kuang Zuo
Citations: 20
h-index: 1
Gen Li
Gen Li
Citations: 147
h-index: 2
Guohao Chen
Guohao Chen
Citations: 188
h-index: 6
Ting Chen
Ting Chen
Citations: 47
h-index: 2
Shilin Shan
Shilin Shan
Citations: 31
h-index: 4
Bofan Lyu
Bofan Lyu
Citations: 7
h-index: 1

Vision-language models (VLMs) rely on long visual token sequences for visual understanding, making the prefill stage expensive in both computation and memory. Most existing pruning methods follow an absolute-ranking paradigm, assigning importance scores to visual tokens and retaining a fixed top-K subset. In this work, we argue that this paradigm is fundamentally brittle: attention sinks distort token importance rankings, while image redundancy and query-dependent visual evidence make fixed token budgets unreliable across inputs. We propose OccamToken, a training-free framework that replaces absolute token ranking with register-anchored relative evidence testing. Instead of asking which tokens are globally important, OccamToken evaluates whether a visual token provides information beyond a register-based reference. Our key insight is that register tokens naturally absorb low-information attention patterns, making them a stable reference for identifying genuinely informative visual evidence. Based on this principle, OccamToken performs both image-adaptive redundancy pruning and query-adaptive relevance pruning through dynamic thresholds derived from register attention. Across LLaVA-NeXT, LLaVA-v1.5, and Qwen3-VL, OccamToken consistently improves the accuracy-efficiency trade-off without additional training. Notably, on LLaVA-NeXT, it reduces 2,880 visual tokens to approximately 40 while preserving over 93% of the original accuracy, enabling stable visual token compression even in the extreme 1.4% retention regime.

0 Citations
0 Influential
3 Altmetric
15.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!