Classifying Autism From Crowdsourced Semistructured Speech Recordings: Machine Learning Model Comparison Study-Reference-Cited by-同舟云学术

Classifying Autism From Crowdsourced Semistructured Speech Recordings: Machine Learning Model Comparison Study

Published:2022-04-14 Issue:2 Volume:5 Page:e35406
ISSN:2561-6722
Container-title:JMIR Pediatrics and Parenting
language:en
Short-container-title:JMIR Pediatr Parent

Author:

Chi Nathan A^ORCID,Washington Peter^ORCID,Kline Aaron^ORCID,Husic Arman^ORCID,Hou Cathy^ORCID,He Chloe^ORCID,Dunlap Kaitlyn^ORCID,Wall Dennis P^ORCID

Abstract

Background Autism spectrum disorder (ASD) is a neurodevelopmental disorder that results in altered behavior, social development, and communication patterns. In recent years, autism prevalence has tripled, with 1 in 44 children now affected. Given that traditional diagnosis is a lengthy, labor-intensive process that requires the work of trained physicians, significant attention has been given to developing systems that automatically detect autism. We work toward this goal by analyzing audio data, as prosody abnormalities are a signal of autism, with affected children displaying speech idiosyncrasies such as echolalia, monotonous intonation, atypical pitch, and irregular linguistic stress patterns. Objective We aimed to test the ability for machine learning approaches to aid in detection of autism in self-recorded speech audio captured from children with ASD and neurotypical (NT) children in their home environments. Methods We considered three methods to detect autism in child speech: (1) random forests trained on extracted audio features (including Mel-frequency cepstral coefficients); (2) convolutional neural networks trained on spectrograms; and (3) fine-tuned wav2vec 2.0—a state-of-the-art transformer-based speech recognition model. We trained our classifiers on our novel data set of cellphone-recorded child speech audio curated from the Guess What? mobile game, an app designed to crowdsource videos of children with ASD and NT children in a natural home environment. Results The random forest classifier achieved 70% accuracy, the fine-tuned wav2vec 2.0 model achieved 77% accuracy, and the convolutional neural network achieved 79% accuracy when classifying children’s audio as either ASD or NT. We used 5-fold cross-validation to evaluate model performance. Conclusions Our models were able to predict autism status when trained on a varied selection of home audio clips with inconsistent recording qualities, which may be more representative of real-world conditions. The results demonstrate that machine learning methods offer promise in detecting autism automatically from speech without specialized equipment.

Publisher

JMIR Publications Inc.

Subject

Computer Science Applications,Health Informatics,Biomedical Engineering,Pediatrics, Perinatology and Child Health

Reference64 articles.

1. Autism spectrum disorder: definition, epidemiology, causes, and clinical evaluation

2. Prevalence and Characteristics of Autism Spectrum Disorder Among Children Aged 8 Years — Autism and Developmental Disabilities Monitoring Network, 11 Sites, United States, 2018

3. Identification, Evaluation, and Management of Children With Autism Spectrum Disorder

4. Identification and Quantification of Gaps in Access to Autism Resources in the United States: An Infodemiological Study

5. Timeliness of Autism Spectrum Disorder Diagnosis and Use of Services Among U.S. Elementary School–Aged Children

Cited by 27 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Brain functional gradient and structure features in adolescent and adult autism spectrum disorders;Human Brain Mapping;2024-07-22

2. Multimodal deep learning for dementia classification using text and audio;Scientific Reports;2024-06-16

3. Voice as a Biomarker of Pediatric Health: A Scoping Review;Children;2024-06-04

4. A Perspective on Crowdsourcing and Human-in-the-Loop Workflows in Precision Health;Journal of Medical Internet Research;2024-04-11

5. Utilizing Constructed Neural Networks for Autism Screening;Applied Sciences;2024-04-05