Evaluation of a natural language processing tool for extracting gender, weight, ethnicity, and race in the US food and drug administration adverse event reporting system-Reference-Cited by-同舟云学术

Evaluation of a natural language processing tool for extracting gender, weight, ethnicity, and race in the US food and drug administration adverse event reporting system

Published:2022-11-14 Issue: Volume:2 Page:
ISSN:2674-0869
Container-title:Frontiers in Drug Safety and Regulation
language:
Short-container-title:Front. Drug Saf. Regul.

Author:

Dang Vivian,Wu Eileen,Kortepeter Cindy M.,Phan Michael,Zhang Rongmei,Ma Yong,Muñoz Monica A.

Abstract

The US Food and Drug Administration Adverse Event Reporting System (FAERS) contains over 24 million individual case safety reports (ICSRs). In this research project, we evaluated a natural language processing (NLP) tool’s ability to extract four demographic variables (gender, weight, ethnicity, race) from ICSR narratives. Specificity of the NLP algorithm was over 94% for all demographics, while sensitivity varied between the demographics: 98.6% (gender), 45.5% (weight), 100% (ethnicity), and 85.3% (race). Among ICSRs missing weight, ethnicity, and race in the structured field, few cases had this information in the narrative (>95% missing); consequently, the positive predictive value (PPV) for these three demographics had wide 95% confidence intervals. After NLP implementation, the total number of ICSRs missing gender was reduced by 33% (i.e., NLP identified 472 thousand reports having a gender value in the narrative that was not in the structured field), while the total number of ICSRs missing weight, ethnicity, or race was reduced by less than 4%. This study demonstrated that the implementation of an NLP tool can provide meaningful improvements in the availability of gender information for pharmacovigilance activities conducted with FAERS data. In contrast, NLP tools targeting the extraction of weight, ethnicity, or race from free-text fields have minimal impact largely because the information was infrequently provided by the reporter. Further gains in completeness of these fields must originate from increases in provision of demographic information from the reporter rather than informatic solutions.

Publisher

Frontiers Media SA

Reference26 articles.

1. Drug dosing in obese adults;Barras;Aust. Prescr.,2017

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Subgroup disproportionality analysis of dementia-related adverse events with sacubitril/valsartan across geographical regions;Scientific Reports;2024-09-03

2. Integrating clinical pharmacology and artificial intelligence: potential benefits, challenges, and role of clinical pharmacologists;Expert Review of Clinical Pharmacology;2024-02-15

3. Editorial: Computational methods and systems to support decision making in pharmacovigilance;Frontiers in Drug Safety and Regulation;2023-04-21