Investigation of Super Learner Methodology on HIV-1 Small Sample: Application on Jaguar Trial Data

Author:

Houssaïni Allal12,Assoumou Lambert12,Marcelin Anne Geneviève123,Molina Jean Michel4,Calvez Vincent123,Flandre Philippe123

Affiliation:

1. INSERM, UMR-S 943, 56 Boulevard Vincent Auriol, BP 335, 75625 Paris Cedex 13, France

2. UPMC Univ Paris 06, UMR S943, Paris, France

3. Service de Virologie, Hôpital Pitié-Salpêtrière, AP-HP, Paris, France

4. Service des Maladies Infectieuses, Hôpital Saint Louis, AP-HP, Paris, France

Abstract

Background. Many statistical models have been tested to predict phenotypic or virological response from genotypic data. A statistical framework called Super Learner has been introduced either to compare different methods/learners (discrete Super Learner) or to combine them in a Super Learner prediction method.Methods. The Jaguar trial is used to apply the Super Learner framework. The Jaguar study is an “add-on” trial comparing the efficacy of adding didanosine to an on-going failing regimen. Our aim was also to investigate the impact on the use of different cross-validation strategies and different loss functions. Four different repartitions between training set and validations set were tested through two loss functions. Six statistical methods were compared. We assess performance by evaluatingR2values and accuracy by calculating the rates of patients being correctly classified.Results. Our results indicated that the more recent Super Learner methodology of building a new predictor based on a weighted combination of different methods/learners provided good performance. A simple linear model provided similar results to those of this new predictor. Slight discrepancy arises between the two loss functions investigated, and slight difference arises also between results based on cross-validated risks and results from full dataset. The Super Learner methodology and linear model provided around 80% of patients correctly classified. The difference between the lower and higher rates is around 10 percent. The number of mutations retained in different learners also varys from one to 41.Conclusions. The more recent Super Learner methodology combining the prediction of many learners provided good performance on our small dataset.

Funder

Sidaction

Publisher

Hindawi Limited

Subject

Infectious Diseases,Public Health, Environmental and Occupational Health,Dermatology,Immunology and Allergy

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3