Automated detection of poor-quality data: case studies in healthcare-Reference-Cited by-同舟云学术

Automated detection of poor-quality data: case studies in healthcare

Published:2021-09-09 Issue:1 Volume:11 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Dakka M. A.,Nguyen T. V.,Hall J. M. M.,Diakiw S. M.,VerMilyea M.,Linke R.,Perugini M.,Perugini D.

Abstract

AbstractThe detection and removal of poor-quality data in a training set is crucial to achieve high-performing AI models. In healthcare, data can be inherently poor-quality due to uncertainty or subjectivity, but as is often the case, the requirement for data privacy restricts AI practitioners from accessing raw training data, meaning manual visual verification of private patient data is not possible. Here we describe a novel method for automated identification of poor-quality data, called Untrainable Data Cleansing. This method is shown to have numerous benefits including protection of private patient data; improvement in AI generalizability; reduction in time, cost, and data needed for training; all while offering a truer reporting of AI performance itself. Additionally, results show that Untrainable Data Cleansing could be useful as a triage tool to identify difficult clinical cases that may warrant in-depth evaluation or additional testing to support a diagnosis.

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

Link

https://www.nature.com/articles/s41598-021-97341-0.pdf

Reference27 articles.

1. LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).

2. Goodfellow, I., Bengio, Y. & Courville, A. Deep Learning (MIT Press, 2016).

3. Esteva, A. et al. A guide to deep learning in healthcare. Nat. Med. 25, 24–29 (2019).

4. Fourcade, A. & Khonsari, R. H. Deep learning in medical image analysis: A third eye for doctors. J. Stomatol. Oral Maxillofac. Surg. 120, 279–288. https://doi.org/10.1016/j.jormas.2019.06.002 (2019).

5. Lundervold, A. S. & Lundervold, A. An overview of deep learning in medical imaging focusing on MRI. Z. Med. Phys. 29, 102–127. https://doi.org/10.1016/j.zemedi.2018.11.002 (2019).

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Efficient automated error detection in medical data using deep-learning and label-clustering;Scientific Reports;2023-11-09

2. Application of artificial intelligence in the diagnosis of hepatocellular carcinoma;eGastroenterology;2023-11

3. Gaps and future of human-centered artificial intelligence in ophthalmology: Future Vision Forum consensus statement;Current Opinion in Ophthalmology;2023-07-17

4. Deep Feature-Based Automated Chest Radiography Compliance Assessment;2023 IEEE 7th Portuguese Meeting on Bioengineering (ENBENG);2023-06-22

5. Efficient automated error detection in medical data using deep-learning and label-clustering;2023-03-07