A comparison of 12 machine learning models developed to predict ploidy, using a morphokinetic meta-dataset of 8147 embryos

Author:

Bamford Thomas1ORCID,Easter Christina1,Montgomery Sue2,Smith Rachel2,Dhillon-Smith Rima K1ORCID,Barrie Amy2ORCID,Campbell Alison2ORCID,Coomarasamy Arri1ORCID

Affiliation:

1. Tommy's National Centre for Miscarriage Research, Institute of Metabolism and Systems Research, College of Medical and Dental Sciences, University of Birmingham , Edgbaston, UK

2. Care Fertility Headquarters , Nottingham, UK

Abstract

AbstractSTUDY QUESTIONAre machine learning methods superior to traditional statistics in predicting blastocyst ploidy status using morphokinetic and clinical biodata?SUMMARY ANSWERMixed effects logistic regression performed better than all machine learning methods for ploidy prediction using our dataset of 8147 embryos.WHAT IS KNOWN ALREADYMorphokinetic timings have been demonstrated to be delayed in aneuploid embryos. Machine learning and statistical models are increasingly being built, however, until now they have been limited by data insufficiency.STUDY DESIGN, SIZE, DURATIONThis is a multicentre cohort study. Data were obtained from 8147 biopsied blastocysts from 1725 patients, treated from 2012 to 2020.PARTICIPANTS/MATERIALS, SETTING, METHODSAll embryos were cultured in a time-lapse system at nine IVF clinics in the UK. A total of 3004 euploid embryos and 5023 aneuploid embryos were included in the final verified dataset. We developed a total of 12 models using four different approaches: mixed effects multivariable logistic regression, random forest classifiers, extreme gradient boosting, and deep learning. For each of the four algorithms, two models were created, the first consisting of 22 covariates using 8027 embryos (Dataset 1) and the second, a dataset of 2373 embryos and 26 covariates (Dataset 2). Four final models were created by switching the target outcome from euploid to aneuploid for each algorithm (Dataset 1). Models were validated using internal–external cross-validation and external validation.MAIN RESULTS AND THE ROLE OF CHANCEAll morphokinetic variables were significantly delayed in aneuploid embryos. The likelihood of euploidy was significantly increased the more expanded the blastocyst (P < 0.001) and the better the trophectoderm grade (P < 0.01). Univariable analysis showed no association with ploidy status for morula or cleavage stage fragmentation, morula grade, fertilization method, sperm concentration, or progressive motility. Male age did not correlate with the percentage of euploid embryos when stratified for female age. Multinucleation at the two-cell or four-cell stage was not associated with ploidy status. The best-performing model was logistic regression built using the larger dataset with 22 predictors (F1 score 0.59 for predicting euploidy; F1 score 0.77 for predicting aneuploidy; AUC 0.71; 95% CI 0.67–0.73). The best-performing models using the algorithms from random forest, extreme gradient boosting, and deep learning achieved an AUC of 0.68, 0.63, and 0.63, respectively. When using only morphokinetic predictors the AUC was 0.61 for predicting ploidy status, whereas a model incorporating only embryo grading was unable to discriminate aneuploid embryos (AUC = 0.52). The ploidy prediction model’s performance improved with increasing age of the egg provider.LIMITATIONS, REASONS FOR CAUTIONThe models have not been validated in a prospective study design or yet been used to determine whether they improve clinical outcomesWIDER IMPLICATIONS OF THE FINDINGSThis model may aid decision-making, particularly where pre-implantation genetic testing for aneuploidy is not permitted or for prioritizing embryos for biopsy.STUDY FUNDING/COMPETING INTEREST(S)No specific funding was sought for this study; university funds supported the first author. A.Ca. is a minor shareholder of participating centres.TRIAL REGISTRATION NUMBERN/A.

Publisher

Oxford University Press (OUP)

Subject

Obstetrics and Gynecology,Rehabilitation,Reproductive Medicine

Reference54 articles.

1. Interpretable, not black-box, artificial intelligence should be used for embryo selection;Afnan;Hum Reprod Open,2021

2. Time-lapse systems for embryo incubation and assessment in assisted reproduction;Armstrong;Cochrane Database Syst Rev,2019

3. Random Forest

4. Morphological and morphokinetic associations with aneuploidy: a systematic review and meta-analysis;Bamford;Hum Reprod Update,2022

5. P165 Design, implementation and results of a group-wide, embryo morphokinetic annotation quality assurance scheme across ten fertility clinics. Fertility 2021 Barriers and breakthroughs 6–10th January 2021 Online;Barrie;Hum Fertil,2021

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3