2606.11836v1 Jun 10, 2026 cs.SD

Towards Data-free and Training-free Compression for Speech Foundation Models Using Parameter Clustering

Youjun Chen
Youjun Chen
Citations: 54
h-index: 5
Huimeng Wang
Huimeng Wang
Citations: 79
h-index: 5
Mengzhe Geng
Mengzhe Geng
Citations: 1,097
h-index: 20
Xunying Liu
Xunying Liu
Citations: 439
h-index: 8
Haoning Xu
Haoning Xu
Citations: 84
h-index: 5
Zhaoqing Li
Zhaoqing Li
Citations: 97
h-index: 5
Chengxi Deng
Chengxi Deng
Citations: 39
h-index: 2

This paper presents a novel data-free and training-free compression approach for speech foundation models using channelwise clustering via k-means. More fine-grained, mixed sparsity pruning by layer-level varying number of parameter clusters is also explored. Experiments conducted on the LibriSpeech dataset suggest that when operating with pruning sparsity of 50% on HuBERT-large, consistent WER reductions of 27.73%/18.61% absolute (34.37%/21.91% relative) over the magnitude-based pruning were obtained on the test-clean and test-other subsets before fine-tuning and 0.19%/0.79% absolute (3.36%/4.62% relative) after fine-tuning with only 3 epochs. Similar WER reductions of 2.86%/5.02% absolute (59.21%/55.29% relative) were observed against magnitudebased pruning on Whisper-large-v3 at 10% sparsity, all with no significant WER increase relative to the uncompressed baseline.

0 Citations
0 Influential
10 Altmetric
50.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!