2606.16633v1 Jun 15, 2026 cs.CV

DCP-Prune: Ultra-Low Token Pruning with Distribution Consistency Preservation

Zirui Li
Zirui Li
Univerisity of Manchester
Citations: 2
h-index: 1
Ming-Ming Cheng
Ming-Ming Cheng
Citations: 51
h-index: 3
Xifeng Xue
Xifeng Xue
Citations: 4
h-index: 1
Xiaokang Wang
Xiaokang Wang
Citations: 24
h-index: 3
Guolei Sun
Guolei Sun
Citations: 127
h-index: 7

Recent vision token pruning methods effectively preserve model performance under moderate token budgets but become unstable under ultra-low token budget. Our analysis shows that as the pruning budget decreases, accuracy degradation is often accompanied by larger feature distribution shifts. Critically, the degree of this distribution shift strongly correlates with performance degradation. To better characterize this phenomenon, we introduce a lightweight distribution consistency metric to estimate the distribution shift between retained and full tokens. Motivated by these observations, we propose a two-stage pruning framework consisting of Anchor-Context Graph Recovery (ACGR) and Text-Aware Token Cluster Selection (TATCS). Specifically, ACGR transfers contextual information before token removal, while TATCS dynamically re-selects representative tokens when severe distribution shift is detected. Extensive experiments demonstrate that our method achieves superior and more stable performance under ultra-low token budget. Notably, it retains 92.1% of the upper-bound average performance on LLaVA-1.5-7B with only 16 visual tokens.

0 Citations
0 Influential
3.5 Altmetric
17.5 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!