Potential and limitations of machine meta-learning (ensemble) methods for predicting COVID-19 mortality in a large inhospital Brazilian dataset

Author:

de Paiva Bruno Barbosa MirandaORCID,Pereira Polianna DelfinoORCID,de Andrade Claudio Moisés ValienseORCID,Gomes Virginia Mara ReisORCID,Souza-Silva Maira Viana RegoORCID,Martins Karina Paula Medeiros PradoORCID,Sales Thaís Lorenna SouzaORCID,de Carvalho Rafael Lima RodriguesORCID,Pires Magda CarvalhoORCID,Ramos Lucas Emanuel FerreiraORCID,Silva Rafael TavaresORCID,de Freitas Martins Vieira AlessandraORCID,Nunes Aline Gabrielle SousaORCID,de Oliveira Jorge AlziraORCID,de Oliveira Maurílio AmandaORCID,Scotton Ana Luiza Bahia AlvesORCID,da Silva Carla Thais Candida AlvesORCID,Cimini Christiane Corrêa RodriguesORCID,Ponce DanielaORCID,Pereira Elayne CrestaniORCID,Manenti Euler Roberto FernandesORCID,Rodrigues Fernanda d’AthaydeORCID,Anschau FernandoORCID,Botoni Fernando AntônioORCID,Bartolazzi FredericoORCID,Grizende Genna Maira SantosORCID,Noal Helena CarolinaORCID,Duani HelenaORCID,Gomes Isabela MoraesORCID,Costa Jamille Hemétrio Salles MartinsORCID,di Sabatino Santos Guimarães JúliaORCID,Tupinambás Julia TeixeiraORCID,Rugolo Juliana MachadoORCID,Batista Joanna d’Arc LyraORCID,de Alvarenga Joice CoutinhoORCID,Chatkin José MiguelORCID,Ruschel Karen BrasilORCID,Zandoná Liege BarellaORCID,Pinheiro Lílian SantosORCID,Menezes Luanna Silva MonteiroORCID,de Oliveira Lucas Moyses Carvalho,Kopittke LucianeORCID,Assis Luisa ArgoloORCID,Marques Luiza MargotoORCID,Raposo Magda CesarORCID,Floriani Maiara AnschauORCID,Bicalho Maria Aparecida CamargosORCID,Nogueira Matheus Carvalho AlvesORCID,de Oliveira Neimy RamosORCID,Ziegelmann Patricia KlarmannORCID,Paraiso Pedro GibsonORCID,de Lima Martelli Petrônio JoséORCID,Senger RobertaORCID,Menezes Rochele MosmannORCID,Francisco Saionara CristinaORCID,Araújo Silvia FerreiraORCID,Kurtz TatianaORCID,Fereguetti Tatiani OliveiraORCID,de Oliveira Thainara ConceiçãoORCID,Ribeiro Yara Cristina Neves Marques BarbosaORCID,Ramires Yuri CarlottoORCID,Lima Maria Clara Pontello BarbosaORCID,Carneiro MarceloORCID,Bezerra Adriana Falangola BenjaminORCID,Schwarzbold Alexandre VargasORCID,de Moura Costa André SoaresORCID,Farace Barbara LopesORCID,Silveira Daniel VitorioORCID,de Almeida Cenci Evelin PaolaORCID,Lucas Fernanda BarbosaORCID,Aranha Fernando GraçaORCID,Bastos Gisele Alsina NaderORCID,Vietta Giovanna GrunewaldORCID,Nascimento Guilherme FagundesORCID,Vianna Heloisa ReniersORCID,Guimarães Henrique CerqueiraORCID,de Morais Julia Drumond ParreirasORCID,Moreira Leila BeltramiORCID,de Oliveira Leonardo SeixasORCID,de Deus Sousa LucasORCID,de Souza Viana LucianoORCID,de Souza Cabral Máderson AlvaresORCID,Ferreira Maria Angélica PiresORCID,de Godoy Mariana FrizzoORCID,de Figueiredo Meire PereiraORCID,Guimarães-Junior Milton HenriquesORCID,de Paula de Sordi Mônica AparecidaORCID,da Cunha Severino Sampaio NatáliaORCID,Assaf Pedro LedicORCID,Lutkmeier RaquelORCID,Valacio Reginaldo AparecidoORCID,Finger Renan GoulartORCID,de Freitas RufinoORCID,Guimarães Silvana Mangeon MeirellesORCID,Oliveira Talita FischerORCID,Diniz Thulio Henrique OliveiraORCID,Gonçalves Marcos AndréORCID,Marcolino Milena SorianoORCID

Abstract

AbstractThe majority of early prediction scores and methods to predict COVID-19 mortality are bound by methodological flaws and technological limitations (e.g., the use of a single prediction model). Our aim is to provide a thorough comparative study that tackles those methodological issues, considering multiple techniques to build mortality prediction models, including modern machine learning (neural) algorithms and traditional statistical techniques, as well as meta-learning (ensemble) approaches. This study used a dataset from a multicenter cohort of 10,897 adult Brazilian COVID-19 patients, admitted from March/2020 to November/2021, including patients [median age 60 (interquartile range 48–71), 46% women]. We also proposed new original population-based meta-features that have not been devised in the literature. Stacking has shown to achieve the best results reported in the literature for the death prediction task, improving over previous state-of-the-art by more than 46% in Recall for predicting death, with AUROC 0.826 and MacroF1 of 65.4%. The newly proposed meta-features were highly discriminative of death, but fell short in producing large improvements in final prediction performance, demonstrating that we are possibly on the limits of the prediction capabilities that can be achieved with the current set of ML techniques and (meta-)features. Finally, we investigated how the trained models perform on different hospitals, showing that there are indeed large differences in classifier performance between different hospitals, further making the case that errors are produced by factors that cannot be modeled with the current predictors.

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3