2602.05406v1 Feb 05, 2026 cs.SD

자동 음성 인식 시스템 구축을 위한 지원: 아칸어 음성 장애 데이터셋

Enabling Automatic Disordered Speech Recognition: An Impaired Speech Dataset in the Akan Language

I. Wiafe

Citations: 713

h-index: 14

A. Ekpezu

Citations: 122

h-index: 7

Sumaya Ahmed Salihs

Citations: 3

h-index: 1

E. D. Atsakpo

Citations: 24

h-index: 3

Fiifi Baffoe Payin Winful

Citations: 23

h-index: 3

J. Abdulai

Citations: 1,437

h-index: 19

음성 장애 데이터의 부족은 포괄적인 음성 기술 개발, 특히 아칸어와 같은 자원 부족 언어 분야의 발전을 저해합니다. 이러한 문제를 해결하기 위해, 본 연구에서는 아칸어 원어민의 음성 장애 샘플로 구성된 큐레이션된 데이터셋을 제시합니다. 이 데이터셋은 50.01시간 분량의 오디오 녹음으로 구성되며, 발음 장애, 뇌성마비, 구개열, 뇌졸중으로 인한 음성 장애의 네 가지 유형을 포함합니다. 녹음은 통제된 환경에서 이루어졌으며, 참가자들은 미리 선택된 이미지를 자신의 말로 설명했습니다. 결과적으로 생성된 데이터셋은 오디오 녹음, 전사본, 그리고 화자의 인구통계, 장애 유형, 녹음 환경, 사용 장치에 대한 메타데이터를 포함합니다. 이 데이터셋은 자원 부족 환경에서 자동 음성 인식 시스템 및 보조 음성 기술 연구를 지원하기 위해 설계되었습니다.

Original Abstract

The lack of impaired speech data hinders advancements in the development of inclusive speech technologies, particularly in low-resource languages such as Akan. To address this gap, this study presents a curated corpus of speech samples from native Akan speakers with speech impairment. The dataset comprises of 50.01 hours of audio recordings cutting across four classes of impaired speech namely stammering, cerebral palsy, cleft palate, and stroke induced speech disorder. Recordings were done in controlled supervised environments were participants described pre-selected images in their own words. The resulting dataset is a collection of audio recordings, transcriptions, and associated metadata on speaker demographics, class of impairment, recording environment and device. The dataset is intended to support research in low-resource automatic disordered speech recognition systems and assistive speech technology.

1 Citations

0 Influential

9.5 Altmetric

48.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!