BRSET: A Brazilian Multilabel Ophthalmological Dataset of Retina Fundus Photos

Author:

Nakayama Luis FilipeORCID,Restrepo David,Matos JoãoORCID,Ribeiro Lucas Zago,Malerbi Fernando Korn,Celi Leo Anthony,Regatieri Caio Saito

Abstract

AbstractIntroductionThe Brazilian Multilabel Ophthalmological Dataset (BRSET) addresses the scarcity of publicly available ophthalmological datasets in Latin America. BRSET comprises 16,266 color fundus retinal photos from 8,524 Brazilian patients, aiming to enhance data representativeness, serving as a research and teaching tool. It contains sociodemographic information, enabling investigations into differential model performance across demographic groups.MethodsData from three São Paulo outpatient centers yielded demographic and medical information from electronic records, including nationality, age, sex, clinical history, insulin use, and duration of diabetes diagnosis. A retinal specialist labeled images for anatomical features (optic disc, blood vessels, macula), quality control (focus, illumination, image field, artifacts), and pathologies (e.g., diabetic retinopathy). Diabetic retinopathy was graded using International Clinic Diabetic Retinopathy and Scottish Diabetic Retinopathy Grading. Validation used Dino V2 Base for feature extraction, with 70% training and 30% testing subsets. Support Vector Machines (SVM) and Logistic Regression (LR) were employed with weighted training. Performance metrics included area under the receiver operating curve (AUC) and Macro F1-score.ResultsBRSET comprises 65.1% Canon CR2 and 34.9% Nikon NF5050 images. 61.8% of the patients are female, and the average age is 57.6 years. Diabetic retinopathy affected 15.8% of patients, across a spectrum of disease severity. Anatomically, 20.2% showed abnormal optic discs, 4.9% abnormal blood vessels, and 28.8% abnormal macula. Models were trained on BRSET in three prediction tasks: “diabetes diagnosis”; “sex classification”; and “diabetic retinopathy diagnosis”.DiscussionBRSET is the first multilabel ophthalmological dataset in Brazil and Latin America. It provides an opportunity for investigating model biases by evaluating performance across demographic groups. The model performance of three prediction tasks demonstrates the value of the dataset for external validation and for teaching medical computer vision to learners in Latin America using locally relevant data sources.Author SummaryIn low-resource settings, access to open medical datasets is crucial for research. Regions such as Latin America often face underrepresentation, resulting in health biases and inequities. To face the scarcity of diverse ophthalmological datasets in these areas, especially in Brazil and Latin America, we introduce the Brazilian Multilabel Ophthalmological Dataset (BRSET) as a means to alleviate biases in medical AI research. Comprising 16,266 color fundus retinal photos from 8,524 Brazilian patients, BRSET integrates sociodemographic information, empowering researchers to investigate biases across demographic groups and diseases. BRSET was extracted from São Paulo outpatient centers, and includes demographics, clinical history, and retinal images labeled for anatomical features, quality control, and pathologies like diabetic retinopathy. Validation was performed in a set of selected prediction tasks, such as diabetes diagnosis, sex classification, and diabetic retinopathy diagnosis. BRSET’s inclusion of sociodemographic data and experiment metrics underscores its potential efficacy across diverse classification objectives and patient groups, providing crucial insights for medical AI in underrepresented regions.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3