Overview of the 8<sup>th</sup>Social Media Mining for Health Applications (#SMM4H) Shared Tasks at the AMIA 2023 Annual Symposium-Reference-Cited by-同舟云学术

Overview of the 8^thSocial Media Mining for Health Applications (#SMM4H) Shared Tasks at the AMIA 2023 Annual Symposium

Published:2023-11-08 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Klein Ari Z.,Banda Juan M.,Guo Yuting,Schmidt Ana Lucia,Xu Dongfang,Flores Amaro Jesus Ivan,Rodriguez-Esteban Raul,Sarker Abeed,Gonzalez-Hernandez Graciela

Abstract

ABSTRACTThe aim of the Social Media Mining for Health Applications (#SMM4H) shared tasks is to take a community-driven approach to address the natural language processing and machine learning challenges inherent to utilizing social media data for health informatics. The eighth iteration of the #SMM4H shared tasks was hosted at the AMIA 2023 Annual Symposium and consisted of five tasks that represented various social media platforms (Twitter and Reddit), languages (English and Spanish), methods (binary classification, multi-class classification, extraction, and normalization), and topics (COVID-19, therapies, social anxiety disorder, and adverse drug events). In total, 29 teams registered, representing 18 countries. In this paper, we present the annotated corpora, a technical summary of the systems, and the performance results. In general, the top-performing systems used deep neural network architectures based on pre-trained transformer models. In particular, the top-performing systems for the classification tasks were based on single models that were pre-trained on social media corpora. To facilitate future work, the datasets—a total of 61,353 posts—will remain available by request, and the CodaLab sites will remain active for a post-evaluation phase.

Publisher

Cold Spring Harbor Laboratory

Reference39 articles.

1. Auxier B , Anderson M. Social media use in 2021. Pew Research Center 7 April 2021. https://www.pewresearch.org/internet/2021/04/07/social-media-use-in-2021/ (accessed 20 October 2023).

2. Dixon SJ . Number of global social network users 2017-2027. Statista 29 August 2023. https://www.statista.com/statistics/278414/number-of-worldwide-social-network-users/ (accessed 20 October 2023).

3. Automatically identifying self-reports of COVID-19 diagnosis on Twitter: an annotated data set, deep neural network classifiers, and a large-scale cohort;J Med Internet Res,2023

4. An aspect-level sentiment analysis dataset for therapies on Twitter;Data Brief,2023

5. DeepADEMiner: a deep learning pharmacovigilance pipeline for extraction and normalization of adverse drug event mentions on Twitter;J Am Med Inform Assoc,2021

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. BERT-based language model for accurate drug adverse event extraction from social media: implementation, evaluation, and contributions to pharmacovigilance practices;Frontiers in Public Health;2024-04-23

2. Potential of artificial intelligence in injury prevention research and practice;Injury Prevention;2024-02-02

3. Shayona@SMM4H’23: COVID-19 Self diagnosis classification using BERT and LightGBM models;2024-01-04

4. MANTIS at #SMM4H 2023: Leveraging Hybrid and Ensemble Models for Detection of Social Anxiety Disorder on Reddit;2023-12-05

5. ThaparUni at #SMM4H 2023: Synergistic Ensemble of RoBERTa, XLNet, and ERNIE 2.0 for Enhanced Textual Analysis¹;2023-11-13