Semisupervised transfer learning for evaluation of model classification performance

Author:

Wang Linshanshan1ORCID,Wang Xuan2,Liao Katherine P3,Cai Tianxi4

Affiliation:

1. Department of Biostatistics, Harvard T.H. Chan School of Public Health , Boston, MA 02115 , United States

2. Division of Biostatistics, Department of Population Health Sciences, University of Utah , Salt Lake City, UT 84108 , United States

3. Division of Rheumatology, Brigham and Women’s Hospital , Boston, MA 02115 , United States

4. Department of Biomedical Informatics, Harvard Medical School , Boston, MA 02115 , United States

Abstract

ABSTRACT In many modern machine learning applications, changes in covariate distributions and difficulty in acquiring outcome information have posed challenges to robust model training and evaluation. Numerous transfer learning methods have been developed to robustly adapt the model itself to some unlabeled target populations using existing labeled data in a source population. However, there is a paucity of literature on transferring performance metrics, especially receiver operating characteristic (ROC) parameters, of a trained model. In this paper, we aim to evaluate the performance of a trained binary classifier on unlabeled target population based on ROC analysis. We proposed Semisupervised Transfer lEarning of Accuracy Measures (STEAM), an efficient three-step estimation procedure that employs (1) double-index modeling to construct calibrated density ratio weights and (2) robust imputation to leverage the large amount of unlabeled data to improve estimation efficiency. We establish the consistency and asymptotic normality of the proposed estimator under the correct specification of either the density ratio model or the outcome model. We also correct for potential overfitting bias in the estimators in finite samples with cross-validation. We compare our proposed estimators to existing methods and show reductions in bias and gains in efficiency through simulations. We illustrate the practical utility of the proposed method on evaluating prediction performance of a phenotyping model for rheumatoid arthritis (RA) on a temporally evolving EHR cohort.

Funder

National Institutes of Health

Publisher

Oxford University Press (OUP)

Reference36 articles.

1. Assessing accuracy of a continuous screening test in the presence of verification bias;Alonzo;Journal of the Royal Statistical Society: Series C (Applied Statistics),2005

2. Semi-supervised triply robust inductive transfer learning;Cai,2022

3. Portability of an algorithm to identify rheumatoid arthritis in electronic health records;Carroll;Journal of the American Medical Informatics Association,2012

4. Robust covariate shift regression;Chen,2016

5. Estimating average treatment effects with a double-index propensity score;Cheng;Biometrics,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3