The Use of Machine Learning Models and Radiomics for Segmentation and Classification of Adnexal Masses on Ultrasound: A multi-cohort retrospective study

Author:

Barcroft Jennifer F,Linton-Reid Kristofer,Landolfo Chiara,Al Memar Maya,Parker Nina,Kyriacou Chris,Munaretto Maria,Fantauzzi Martina,Cooper Nina,Yazbek Joseph,Bharwani Nishat,Lee Sa ra,Kim Ju Hee,Timmerman Dirk,Posma Joram M.,Savelli Luca,Saso Srdjan,Aboagye Eric O.,Bourne Tom

Abstract

BackgroundOvarian cancer remains the deadliest of all gynaecological cancers. Ultrasound-based models exist to support the classification of adnexal masses but are dependent on human assessment of features on ultrasound. Therefore, we aimed to develop an end-to-end machine learning (ML) model capable of automating the classification of adnexal masses.MethodsIn this retrospective study, transvaginal ultrasound scan images were extracted and segmented from Imperial College Healthcare, UK (ICH development dataset; n=577 masses; 1444 images) and Morgagni-Pierantoni Hospital, Italy (MPH external dataset; n=184 masses; 476 images). Clinical data including age, CA-125 and diagnosis (ultrasound subjective assessment, SA) or histology) were collected. A segmentation and classification model was developed by comparing several models using convolutional neural network-based models and traditional radiomics features. Dice surface coefficient was used to measure segmentation performance and area under the ROC curve (AUC), F1-score and recall for classification performance.FindingsThe ICH and MPH datasets had a median age of 45 (IQR 35-60) and 48 (IQR 38-57) and consisted of 23·1% and 31·5% malignant cases, respectively. The best segmentation model achieved a dice surface coefficient of 0·85 ±0·01, 0·88 ±0·01 and 0·85 ±0·01 in the ICH training, ICH validation and MPH test sets. The best classification model achieved a recall of 1·00 and F1-score of 0·88 (AUC 0·93), 0·94 (AUC 0·89) and 0·83 (AUC 0·90) in the ICH training, ICH validation and MPH test sets, respectively.InterpretationThe ML model provides an end-to-end method of adnexal mass segmentation and classification, with a comparable predictive performance (AUC 0·90) to the published performance of expert subjective assessment (SA, gold standard), and current risk models. Further prospective evaluation of the classification performance of the ML model against existing methods is required.FundingMedical Research Council, Imperial STRATiGRAD PhD programme and Imperial Health Charity.Research in ContextEvidence before this studyAdnexal masses are common, affecting up to 18% of postmenopausal women. Ultrasound is the primary imaging modality for the assessment of adnexal masses. Accurate classification of adnexal masses is fundamental to inform appropriate management. However, all existing classification methods are subjective and rely upon ultrasound expertise.Various models have been developed using ultrasound features and serological markers such as the Risk of malignancy index (RMI), International Ovarian Tumour Analysis (IOTA) Simple Rules (SR), the IOTA Assessment of Different NEoplasia’s in the AdneXa (ADNEX) model, and American College of Radiology (ACR) Ovarian-Adnexal Reporting and Data System Ultrasound (ORADS-US) to support the classification of adnexal masses. Despite modelling efforts, expert subjective assessment remains the gold standard method of classifying adnexal masses.The use of machine learning (ML) within clinical imaging is a rapidly evolving field due to its potential to overcome the subjectivity within image assessment and interpretation. Various studies (n=17) evaluating the use of ML within the classification of adnexal masses on ultrasound have been summarised within a recent meta-analysis by Xu et al, 2022. No studies used a radiomics-based approach to the classification of adnexal masses, and most have not been externally validated within a test set, questioning their generalisability. The largest study to date (Gao et al, 2022), used a deep learning (DL) based approach and was externally validated, yet its performance (F1 score 0·551) was not comparable to existing classification approaches.Added value of this studyWe have developed an end-to-end ML model (ODS) using DL and radiomics-based approaches, capable of identification (automated segmentation) and classification of adnexal masses with a high detection rate for malignancy. The ODS model had a performance comparable to the published performance of existing adnexal mass classification methods and does not rely upon ultrasound experience.Implications of all the available evidenceODS is a high performing, end-to-end model capable of classifying adnexal masses and requires limited ultrasound operator experience. The ODS model is potentially generalisable, having showed consistent performance in both validation (internal) and test (external) sets, highlighting the potential clinical value of a radiomics-based model within the classification of adnexal masses on ultrasound. The ODS model could function as a scalable triage tool, to identify high risk adnexal masses requiring further ultrasound assessment by an expert.

Publisher

Cold Spring Harbor Laboratory

Reference30 articles.

1. Cancer Research UK. Ovarian Cancer Survival Statistics [Internet]. https://www.cancerresearchuk.org/health-professional/cancer-statistics/statistics-by-cancer-type/ovarian-cancer/survival. [cited 2023 Feb 21]. Available from: https://www.cancerresearchuk.org/health-professional/cancer-statistics/statistics-by-cancer-type/ovarian-cancer/survival

2. Rennison R. Pathfinder England: Transforming futures for women with ovarian cancer [Internet]. Pathfinder England: Transforming futures for women with ovarian cancer. [cited 2023 Feb 21]. Available from: https://targetovariancancer.org.uk/sites/default/files/2020-07/Pathfinder%202016%20-%20England%20report.pdf

3. The Incidental Postmenopausal Adnexal Mass

4. Risk of complications in patients with conservatively managed ovarian tumours (IOTA5): a 2-year interim analysis of a multicentre, prospective, cohort study

5. Subjective assessment versus ultrasound models to diagnose ovarian cancer: A systematic review and meta-analysis;Eur J Cancer Oxf Engl 1990,2016

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3