2606.13253v1 Jun 11, 2026 cs.SD

Towards Personalized Federated Learning for Dysarthric Speech Recognition

Tao Zhong
Tao Zhong
Citations: 2
h-index: 1
Mengzhe Geng
Mengzhe Geng
Citations: 1,097
h-index: 20
Shujie Hu
Shujie Hu
Citations: 598
h-index: 13
Xunying Liu
Xunying Liu
Citations: 439
h-index: 8
Jiajun Deng
Jiajun Deng
Citations: 398
h-index: 11

Speech recognition is challenging for dysarthric speakers. While federated learning (FL)-based ASR can be an effective tool for protecting privacy, it suffers from heterogeneity issues caused by speaker variability. Forcing all speakers to share the same model components can be suboptimal under such heterogeneity, making personalization a promising direction; however, related research on dysarthric speech remains limited. To this end, this paper explores two aggregation strategies to achieve personalization, including the parameter-based averaging strategy and the embedding-based averaging strategy. Experiments on UASpeech and TORGO show that the proposed methods outperform the baseline regularized FedAvg by statistically significant WER reductions of up to 0.99% absolute (3.15% relative) on UASpeech and 0.56% absolute (4.73% relative) on TORGO, respectively.

0 Citations
0 Influential
10 Altmetric
50.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!