A Comparison of ChatGPT and Fine-Tuned Open Pre-Trained Transformers (OPT) Against Widely Used Sentiment Analysis Tools: Sentiment Analysis of COVID-19 Survey Data-Reference-Cited by-同舟云学术

A Comparison of ChatGPT and Fine-Tuned Open Pre-Trained Transformers (OPT) Against Widely Used Sentiment Analysis Tools: Sentiment Analysis of COVID-19 Survey Data

Published:2024-01-25 Issue: Volume:11 Page:e50150
ISSN:2368-7959
Container-title:JMIR Mental Health
language:en
Short-container-title:JMIR Ment Health

Author:

Lossio-Ventura Juan Antonio^ORCID,Weger Rachel^ORCID,Lee Angela Y^ORCID,Guinee Emily P^ORCID,Chung Joyce^ORCID,Atlas Lauren^ORCID,Linos Eleni^ORCID,Pereira Francisco^ORCID

Abstract

Background Health care providers and health-related researchers face significant challenges when applying sentiment analysis tools to health-related free-text survey data. Most state-of-the-art applications were developed in domains such as social media, and their performance in the health care context remains relatively unknown. Moreover, existing studies indicate that these tools often lack accuracy and produce inconsistent results. Objective This study aims to address the lack of comparative analysis on sentiment analysis tools applied to health-related free-text survey data in the context of COVID-19. The objective was to automatically predict sentence sentiment for 2 independent COVID-19 survey data sets from the National Institutes of Health and Stanford University. Methods Gold standard labels were created for a subset of each data set using a panel of human raters. We compared 8 state-of-the-art sentiment analysis tools on both data sets to evaluate variability and disagreement across tools. In addition, few-shot learning was explored by fine-tuning Open Pre-Trained Transformers (OPT; a large language model [LLM] with publicly available weights) using a small annotated subset and zero-shot learning using ChatGPT (an LLM without available weights). Results The comparison of sentiment analysis tools revealed high variability and disagreement across the evaluated tools when applied to health-related survey data. OPT and ChatGPT demonstrated superior performance, outperforming all other sentiment analysis tools. Moreover, ChatGPT outperformed OPT, exhibited higher accuracy by 6% and higher F-measure by 4% to 7%. Conclusions This study demonstrates the effectiveness of LLMs, particularly the few-shot learning and zero-shot learning approaches, in the sentiment analysis of health-related survey data. These results have implications for saving human labor and improving efficiency in sentiment analysis tasks, contributing to advancements in the field of automated sentiment analysis.

Publisher

JMIR Publications Inc.

Subject

Psychiatry and Mental health

Reference110 articles.

1. Techniques and applications for sentiment analysis

2. Sentiment analysis algorithms and applications: A survey

3. A comprehensive survey on sentiment analysis: Approaches, challenges and trends

4. What social media told us in the time of COVID-19: a scoping review

5. Sentiment analysis and its applications in fighting COVID-19 and infectious diseases: A systematic review

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Large Language Models in Biomedical and Health Informatics: A Review with Bibliometric Analysis;Journal of Healthcare Informatics Research;2024-09-14

2. Exploring the potential of using ChatGPT for rhetorical move-step analysis: The impact of prompt refinement, few-shot learning, and fine-tuning;Journal of English for Academic Purposes;2024-09

3. Large Language Models Can Enable Inductive Thematic Analysis of a Social Media Corpus in a Single Prompt: Human Validation Study;JMIR Infodemiology;2024-08-29

4. Comparing ChatGPT's correction and feedback comments with that of educators in the context of primary students' short essays written in English and Greek;Education and Information Technologies;2024-07-27

5. Mental Health Applications of Generative AI and Large Language Modeling in the United States;International Journal of Environmental Research and Public Health;2024-07-12