AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR

Author:

Olatunji Tobi12,Afonja Tejumade342,Yadavalli Aditya52,Emezue Chris Chinenye672,Singh Sahib82,Dossou Bonaventure F. P.679102,Osuchukwu Joanne1,Osei Salomey112,Tonja Atnafu Lambebo912132,Etori Naome142,Mbataku Clinton42

Affiliation:

1. Intron Health

2. Masakhane NLP

3. CISPA Helmholtz Center for Information Security

4. AI Saturdays Lagos

5. Karya Inc

6. Mila Quebec AI Institute

7. Lanfrica

8. Ford Motor Company,

9. Lelapa AI

10. McGill University

11. University of Deutso

12. Instituto Politécnico Nacional

13. University of Colorado, Colorado Springs

14. University of Minnesota

Abstract

Abstract Africa has a very poor doctor-to-patient ratio. At very busy clinics, doctors could see 30+ patients per day—a heavy patient burden compared with developed countries—but productivity tools such as clinical automatic speech recognition (ASR) are lacking for these overworked clinicians. However, clinical ASR is mature, even ubiquitous, in developed nations, and clinician-reported performance of commercial clinical ASR systems is generally satisfactory. Furthermore, the recent performance of general domain ASR is approaching human accuracy. However, several gaps exist. Several publications have highlighted racial bias with speech-to-text algorithms and performance on minority accents lags significantly. To our knowledge, there is no publicly available research or benchmark on accented African clinical ASR, and speech data is non-existent for the majority of African accents. We release AfriSpeech, 200hrs of Pan-African English speech, 67,577 clips from 2,463 unique speakers across 120 indigenous accents from 13 countries for clinical and general domain ASR, a benchmark test set, with publicly available pre-trained models with SOTA performance on the AfriSpeech benchmark.

Publisher

MIT Press

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Human-Computer Interaction,Communication

Reference75 articles.

1. Chronic staff shortfalls stifle Africa’s health systems: WHO study — afro.who.int;World Health Organization

2. Supervised domain adaptation for emotion recognition from speech;Abdelwahab,2015

3. Learning nigerian accent embeddings from speech: Preliminary results based on sautidb-naija corpus;Afonja;arXiv preprint arXiv:2112.06199,2021

4. Introduction of digital speech recognition in a specialised outpatient department: A case study;Ahlgrim;BMC Medical Informatics and Decision Making,2016

5. The health workforce status in the WHO African region: Findings of a cross-sectional study;Ahmat;BMJ Global Health,2022

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3