2606.11836v1 Jun 10, 2026 cs.SD

Towards Data-free and Training-free Compression for Speech Foundation Models Using Parameter Clustering

Youjun Chen

Citations: 54

h-index: 5

Huimeng Wang

Citations: 79

h-index: 5

Mengzhe Geng

Citations: 1,097

h-index: 20

Xunying Liu

Citations: 439

h-index: 8

Haoning Xu

Citations: 84

h-index: 5

Zhaoqing Li

Citations: 97

h-index: 5

Chengxi Deng

Citations: 39

h-index: 2

This paper presents a novel data-free and training-free compression approach for speech foundation models using channelwise clustering via k-means. More fine-grained, mixed sparsity pruning by layer-level varying number of parameter clusters is also explored. Experiments conducted on the LibriSpeech dataset suggest that when operating with pruning sparsity of 50% on HuBERT-large, consistent WER reductions of 27.73%/18.61% absolute (34.37%/21.91% relative) over the magnitude-based pruning were obtained on the test-clean and test-other subsets before fine-tuning and 0.19%/0.79% absolute (3.36%/4.62% relative) after fine-tuning with only 3 epochs. Similar WER reductions of 2.86%/5.02% absolute (59.21%/55.29% relative) were observed against magnitudebased pruning on Whisper-large-v3 at 10% sparsity, all with no significant WER increase relative to the uncompressed baseline.

0 Citations

0 Influential

10 Altmetric

50.0 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!