2606.16137v1 Jun 15, 2026 cs.CL

XAI-Grounded Explanation Generation for Speech Deepfake Detection with Training-Free Multimodal Large Language Models

Yupei Li
Yupei Li
Citations: 98
h-index: 6
Berrak Sisman
Berrak Sisman
Citations: 2,477
h-index: 26
Bjorn W. Schuller
Bjorn W. Schuller
Citations: 126
h-index: 5
Chenxi Wang
Chenxi Wang
Citations: 183
h-index: 6
Qiyang Sun
Qiyang Sun
Citations: 119
h-index: 6
Xiao-lan Wu
Xiao-lan Wu
Citations: 47
h-index: 4

Speech deepfake detection (SDD) systems require trustworthy explanations for reliable decision-making. Existing explanation ways mainly fall into two categories. Traditional explainable AI (XAI), such as gradient-based attribution, produces low-level attribution signals tightly coupled with model decisions, and harder to be understood by human than natural language explanations. Meanwhile, large language model (LLM)-based explanation generation often produces generic and ungrounded descriptions due to the lack of heuristic evidence and task-specific supervision, stemming from limited grounded explanation datasets for SDD. We therefore propose a training-free explanation framework that integrates XAI evidence with multimodal LLMs to generate grounded and specific explanations. Using the PartialSpoof dataset, we construct a grounded explanation dataset and show that methods with XAI increase inside accuracy by over 45\%, verified through human evaluation and faithfulness checks.

0 Citations
0 Influential
13 Altmetric
65.0 Score
Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

Log in to request an AI analysis.

댓글

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!