Strengthening deep-learning models for intracranial hemorrhage detection: strongly annotated computed tomography images and model ensembles-Reference-Cited by-同舟云学术

Strengthening deep-learning models for intracranial hemorrhage detection: strongly annotated computed tomography images and model ensembles

Published:2023-12-29 Issue: Volume:14 Page:
ISSN:1664-2295
Container-title:Frontiers in Neurology
language:
Short-container-title:Front. Neurol.

Author:

Kang Dong-Wan,Park Gi-Hun,Ryu Wi-Sun,Schellingerhout Dawid,Kim Museong,Kim Yong Soo,Park Chan-Young,Lee Keon-Joo,Han Moon-Ku,Jeong Han-Gil,Kim Dong-Eog

Abstract

Background and purposeMultiple attempts at intracranial hemorrhage (ICH) detection using deep-learning techniques have been plagued by clinical failures. We aimed to compare the performance of a deep-learning algorithm for ICH detection trained on strongly and weakly annotated datasets, and to assess whether a weighted ensemble model that integrates separate models trained using datasets with different ICH improves performance.MethodsWe used brain CT scans from the Radiological Society of North America (27,861 CT scans, 3,528 ICHs) and AI-Hub (53,045 CT scans, 7,013 ICHs) for training. DenseNet121, InceptionResNetV2, MobileNetV2, and VGG19 were trained on strongly and weakly annotated datasets and compared using independent external test datasets. We then developed a weighted ensemble model combining separate models trained on all ICH, subdural hemorrhage (SDH), subarachnoid hemorrhage (SAH), and small-lesion ICH cases. The final weighted ensemble model was compared to four well-known deep-learning models. After external testing, six neurologists reviewed 91 ICH cases difficult for AI and humans.ResultsInceptionResNetV2, MobileNetV2, and VGG19 models outperformed when trained on strongly annotated datasets. A weighted ensemble model combining models trained on SDH, SAH, and small-lesion ICH had a higher AUC, compared with a model trained on all ICH cases only. This model outperformed four deep-learning models (AUC [95% C.I.]: Ensemble model, 0.953[0.938–0.965]; InceptionResNetV2, 0.852[0.828–0.873]; DenseNet121, 0.875[0.852–0.895]; VGG19, 0.796[0.770–0.821]; MobileNetV2, 0.650[0.620–0.680]; p < 0.0001). In addition, the case review showed that a better understanding and management of difficult cases may facilitate clinical use of ICH detection algorithms.ConclusionWe propose a weighted ensemble model for ICH detection, trained on large-scale, strongly annotated CT scans, as no model can capture all aspects of complex tasks.

Funder

National Research Foundation

Publisher

Frontiers Media SA

Subject

Neurology (clinical),Neurology

Reference34 articles.

1. Accuracy of automated computer-aided diagnosis for stroke imaging: a critical evaluation of current evidence;Wardlaw;Stroke,2022

2. Missed diagnosis of subarachnoid hemorrhage in the emergency department;Vermeulen;Stroke,2007

3. Expert-level detection of acute intracranial hemorrhage on head computed tomography using deep learning;Kuo;Proc Natl Acad Sci U S A,2019

4. Detection and classification of intracranial haemorrhage on CT images using a novel deep-learning algorithm;Lee;Sci Rep,2020

5. Intracranial hemorrhage detection in head CT using double-branch convolutional neural network, support vector machine, and random Forest;Sage;Appl Sci,2020