A New Tool for Holistic Residency Application Review: Using Natural Language Processing of Applicant Experiences to Predict Interview Invitation

Author:

Mahtani Arun Umesh1ORCID,Reinstein Ilan2,Marin Marina3,Burk-Rafel Jesse4

Affiliation:

1. A.U. Mahtaniis a resident, Richmond University Medical Center/Mount Sinai, Staten Island, New York; ORCID:.

2. I. Reinsteinis a data science engineer, Institute for Innovations in Medical Education, NYU Grossman School of Medicine, New York, New York.

3. M. Marinis director, Division of Academic Analytics, Institute for Innovations in Medical Education, NYU Grossman School of Medicine, New York, New York.

4. J. Burk-Rafelis assistant professor of medicine and assistant director, Precision and Translational Medical Education, Institute for Innovations in Medical Education, NYU Grossman School of Medicine, New York, New York; ORCID:.

Abstract

Problem Reviewing residency application narrative components is time intensive and has contributed to nearly half of applications not receiving holistic review. The authors developed a natural language processing (NLP)–based tool to automate review of applicants’ narrative experience entries and predict interview invitation. Approach Experience entries (n = 188,500) were extracted from 6,403 residency applications across 3 application cycles (2017–2019) at 1 internal medicine program, combined at the applicant level, and paired with the interview invitation decision (n = 1,224 invitations). NLP identified important words (or word pairs) with term frequency-inverse document frequency, which were used to predict interview invitation using logistic regression with L1 regularization. Terms remaining in the model were analyzed thematically. Logistic regression models were also built using structured application data and a combination of NLP and structured data. Model performance was evaluated on never-before-seen data using area under the receiver operating characteristic and precision–recall curves (AUROC, AUPRC). Outcomes The NLP model had an AUROC of 0.80 (vs chance decision of 0.50) and AUPRC of 0.49 (vs chance decision of 0.19), showing moderate predictive strength. Phrases indicating active leadership, research, or work in social justice and health disparities were associated with interview invitation. The model’s detection of these key selection factors demonstrated face validity. Adding structured data to the model significantly improved prediction (AUROC 0.92, AUPRC 0.73), as expected given reliance on such metrics for interview invitation. Next Steps This model represents a first step in using NLP-based artificial intelligence tools to promote holistic residency application review. The authors are assessing the practical utility of using this model to identify applicants screened out using traditional metrics. Generalizability must be determined through model retraining and evaluation at other programs. Work is ongoing to thwart model “gaming,” improve prediction, and remove unwanted biases introduced during model training.

Publisher

Ovid Technologies (Wolters Kluwer Health)

Subject

Education,General Medicine

Cited by 5 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3