Multicenter, Head-to-Head, Real-World Validation Study of Seven Automated Artificial Intelligence Diabetic Retinopathy Screening Systems

Author:

Lee Aaron Y.123ORCID,Yanagihara Ryan T.1,Lee Cecilia S.12,Blazes Marian1,Jung Hoon C.12,Chee Yewlin E.1,Gencarella Michael D.1,Gee Harry4,Maa April Y.56,Cockerham Glenn C.78,Lynch Mary59,Boyko Edward J.1011ORCID

Affiliation:

1. Department of Ophthalmology, University of Washington School of Medicine, Seattle, WA

2. Department of Ophthalmology, Puget Sound Veteran Affairs, Seattle, WA

3. eScience Institute, University of Washington, Seattle, WA

4. Office of Information and Technology, Clinical Imaging, Seattle, WA

5. Department of Ophthalmology, Emory University School of Medicine, Atlanta, GA

6. Regional Telehealth Services, Veterans Affairs Southeast Network Veterans Integrated Service Networks (VISN) 7, Duluth, GA

7. Veterans Health Administration, Specialty Care Services, Washington, DC

8. Ophthalmology Service, Stanford University School of Medicine, Palo Alto, CA

9. Ophthalmology Section, Atlanta Veterans Affairs Medical Center, Atlanta, GA

10. Seattle Epidemiologic Research and Information Center, Department of Veterans Affairs Medical Center, Seattle, WA

11. Department of Medicine, University of Washington, Seattle, WA

Abstract

OBJECTIVE With rising global prevalence of diabetic retinopathy (DR), automated DR screening is needed for primary care settings. Two automated artificial intelligence (AI)–based DR screening algorithms have U.S. Food and Drug Administration (FDA) approval. Several others are under consideration while in clinical use in other countries, but their real-world performance has not been evaluated systematically. We compared the performance of seven automated AI-based DR screening algorithms (including one FDA-approved algorithm) against human graders when analyzing real-world retinal imaging data. RESEARCH DESIGN AND METHODS This was a multicenter, noninterventional device validation study evaluating a total of 311,604 retinal images from 23,724 veterans who presented for teleretinal DR screening at the Veterans Affairs (VA) Puget Sound Health Care System (HCS) or Atlanta VA HCS from 2006 to 2018. Five companies provided seven algorithms, including one with FDA approval, that independently analyzed all scans, regardless of image quality. The sensitivity/specificity of each algorithm when classifying images as referable DR or not were compared with original VA teleretinal grades and a regraded arbitrated data set. Value per encounter was estimated. RESULTS Although high negative predictive values (82.72–93.69%) were observed, sensitivities varied widely (50.98–85.90%). Most algorithms performed no better than humans against the arbitrated data set, but two achieved higher sensitivities, and one yielded comparable sensitivity (80.47%, P = 0.441) and specificity (81.28%, P = 0.195). Notably, one had lower sensitivity (74.42%) for proliferative DR (P = 9.77 × 10−4) than the VA teleretinal graders. Value per encounter varied at $15.14–$18.06 for ophthalmologists and $7.74–$9.24 for optometrists. CONCLUSIONS The DR screening algorithms showed significant performance differences. These results argue for rigorous testing of all such algorithms on real-world data before clinical implementation.

Funder

National Eye Institute

Research to Prevent Blindness

Publisher

American Diabetes Association

Subject

Advanced and Specialized Nursing,Endocrinology, Diabetes and Metabolism,Internal Medicine

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3