DeepADEMiner: A Deep Learning Pharmacovigilance Pipeline for Extraction and Normalization of Adverse Drug Effect Mentions on Twitter-Reference-Cited by-同舟云学术

DeepADEMiner: A Deep Learning Pharmacovigilance Pipeline for Extraction and Normalization of Adverse Drug Effect Mentions on Twitter

Published:2020-12-16 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Magge Arjun,Tutubalina Elena,Miftahutdinov Zulfat,Alimova Ilseyar,Dirkson Anne,Verberne Suzan,Weissenbacher Davy,Gonzalez-Hernandez Graciela

Abstract

Objective: Research on pharmacovigilance from social media data has focused on mining adverse drug effects (ADEs) using annotated datasets, with publications generally focusing on one of three tasks: (i) ADE classification, (ii) named entity recognition (NER) for identifying the span of an ADE mentions, and (iii) ADE mention normalization to standardized vocabularies. While the common goal of such systems is to detect ADE signals that can be used to inform public policy, it has been impeded largely by limited end-to-end solutions to the three tasks for large-scale analysis of social media reports for different drugs. Materials and Methods: We present a dataset for training and evaluation of ADE pipelines where the ADE distribution is closer to the average `natural balance' with ADEs present in about 7% of the Tweets. The deep learning architecture involves an ADE extraction pipeline with individual components for all three tasks. Results: The system presented achieved a classification performance of F1 = 0.63, span detection performance of F1 = 0.44 and an end-to-end entity resolution performance of F1 = 0.34 on the presented dataset. Discussion: The performance of the models continue to highlight multiple challenges when deploying pharmacovigilance systems that use social media data. We discuss the implications of such models in the downstream tasks of signal detection and suggest future enhancements. Conclusion: Mining ADEs from Twitter posts using a pipeline architecture requires the different components to be trained and tuned based on input data imbalance in order to ensure optimal performance on the end-to-end resolution task.

Publisher

Cold Spring Harbor Laboratory

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Methods and Annotated Data Sets Used to Predict the Gender and Age of Twitter Users: Scoping Review (Preprint);2023-04-05

2. How do others cope? Extracting coping strategies for adverse drug events from social media;Journal of Biomedical Informatics;2023-03

3. Scoping Review of Methods and Annotated Datasets Used to Predict Gender and Age of Twitter Users;2022-12-06

4. Mining Medication-Effect Relations from Twitter Data Using Pre-trained Transformer Language Model;Communications in Computer and Information Science;2021