Using a Deep Learning Model to Address Interobserver Variability in the Evaluation of Ulcerative Colitis (UC) Severity

Author:

Kim Jeong-Heon123,Choe A Reum4ORCID,Park Yehyun4,Song Eun-Mi4,Byun Ju-Ran4,Cho Min-Sun5,Yoo Youngeun5ORCID,Lee Rena6,Kim Jin-Sung123,Ahn So-Hyun7ORCID,Jung Sung-Ae4

Affiliation:

1. Department of Medicine, Yonsei University College of Medicine, Seoul 03722, Republic of Korea

2. Medical Physics and Biomedical Engineering Lab (MPBEL), Yonsei University College of Medicine, Seoul 03722, Republic of Korea

3. Department of Radiation Oncology, Yonsei Cancer Center, Heavy Ion Therapy Research Institute, Yonsei University College of Medicine, Seoul 03722, Republic of Korea

4. Department of Internal Medicine, Ewha Womans University College of Medicine, Seoul 03760, Republic of Korea

5. Department of Pathology, Ewha Womans University College of Medicine, Seoul 03760, Republic of Korea

6. Department of Bioengineering, Ewha Womans University College of Medicine, Seoul 03760, Republic of Korea

7. Ewha Medical Research Institute, Ewha Womans University College of Medicine, Seoul 03760, Republic of Korea

Abstract

The use of endoscopic images for the accurate assessment of ulcerative colitis (UC) severity is crucial to determining appropriate treatment. However, experts may interpret these images differently, leading to inconsistent diagnoses. This study aims to address the issue by introducing a standardization method based on deep learning. We collected 254 rectal endoscopic images from 115 patients with UC, and five experts in endoscopic image interpretation assigned classification labels based on the Ulcerative Colitis Endoscopic Index of Severity (UCEIS) scoring system. Interobserver variance analysis of the five experts yielded an intraclass correlation coefficient of 0.8431 for UCEIS scores and a kappa coefficient of 0.4916 when the UCEIS scores were transformed into UC severity measures. To establish a consensus, we created a model that considered only the images and labels on which more than half of the experts agreed. This consensus model achieved an accuracy of 0.94 when tested with 50 images. Compared with models trained from individual expert labels, the consensus model demonstrated the most reliable prediction results.

Funder

National Research Foundation of Korea

Ministry of Education

Ministry of Science and IC

Publisher

MDPI AG

Subject

Medicine (miscellaneous)

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3