Abstract
AbstractThe increasing significance of Adverse Drug Events (ADEs) extracted from social media, such as Twitter data, has led to the development of various end-to-end resolution methodologies. Despite recent advancements, there remains a substantial gap in normalizing ADE entities coming from social media, particularly with informal and diverse expressions of symptoms, which is crucial for accurate ADE identification and reporting. To address this challenge, we introduce a novel end-to-end solution called CONORM: Context-Aware Entity Normalization. CONORM is a two-step pipeline. The first component is a transformer encoder fine-tuned for entity recognition. The second component is a context-aware entity normalization algorithm. This algorithm uses a dynamic context refining mechanism to adjust entity embeddings, aiming to align ADE mentions with their respective concepts in medical terminology. An integral feature of CONORM is its compatibility with vector databases, which enables efficient querying and scalable parallel processing. Upon evaluation with the SMM4H 2023 ADE normalization shared task dataset, CONORM achieved an F1-score of 50.20% overall and 39.40% for out-of-distribution samples. These results improve performance by 18.00% and 19.90% over the median shared task results, 7.60% and 10.20% over the best model in the shared task, and 5.00% and 3.10% over the existing state-of-the-art ADE mining algorithm. CONORM’s ability to provide context-aware entity normalization paves the way for enhanced end-to-end ADE resolution methods. Our findings and methodologies shed light on the potential advancements in the broader realm of pharmacovigilance using social media data.The model architectures are publicly available athttps://github.com/anthonyyazdani/CONORM.
Publisher
Cold Spring Harbor Laboratory
Reference36 articles.
1. The Nature of Adverse Events in Hospitalized Patients
2. Incidence of Adverse Drug Events and Potential Adverse Drug Events
3. Drug-related morbidity and mortality;Journal of Managed Care Pharmacy,1996
4. Drug-related morbidity and mortality: updating the cost-of-illness model;Journal of the American Pharmaceutical Association,1996
5. Gonzalez-Hernandez G , Weissenbacher D , editors. Proceedings of The Seventh Workshop on Social Media Mining for Health Applications, Workshop & Shared Task.Gyeongju, Republic of Korea: Association for Computational Linguistics; 2022.Available from: https://aclanthology.org/2022.smm4h-1.0.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献