Abstract
AbstractEmpirical insights into promising commercial sentiment analysis solutions that go beyond the claims of their vendors are rare. Moreover, due to the constant evolution in the field, previous studies are far from reflecting the current situation. The goal of this article is to evaluate and compare current solutions using two experimental studies. In the first part of the study, based on tweets about airline service quality, we test the solutions of six vendors with different market power, such as Amazon, Google, IBM, Microsoft, Lexalytics, and MeaningCloud, and report their measures of accuracy, precision, recall, (macro)F1, time performance, and service level agreements (SLA). Furthermore, we compare two of the services in depth with multiple data sets and over time. The services tested here are Google Cloud Natural Language API and MeaningCloud Sentiment Analysis API. For evaluating the results over time, we use the same data set as in November 2020. In addition, further topic-specific and general Twitter data sets are used. The experiments show that the IBM Watson NLU and Google Cloud Natural Language API solutions may be preferred when negative text detection is the primary concern. When tested in July 2022, the Google Cloud Natural Language API was still the clear winner compared to the MeaningCloud Sentiment Analysis API, but only on the airline service quality data set; on the other data sets, both services provided specific benefits and drawbacks. Furthermore, we detected changes in the sentiment classification over time with both services. Our results motivate that an independent, critical, and longitudinal experimental analysis of sentiment analysis services can provide interesting insights into their overall reliability and particular classification accuracy beyond marketing claims to critically compare solutions based on real data and analyze potential weaknesses and margins of error before making an investment.
Funder
Technische Hochschule Wildau
Publisher
Springer Science and Business Media LLC
Subject
Computer Science Applications,Computer Networks and Communications,Computer Graphics and Computer-Aided Design,Computational Theory and Mathematics,Artificial Intelligence,General Computer Science
Reference64 articles.
1. Liu B. Sentiment analysis: mining opinions, sentiments, and emotions (studies in natural language processing). Cambridge: Cambridge University Press; 2015.
2. Wiegand M, Balahur A, Roth B, Klakow D, Montoyo A. A survey on the role of negation in sentiment analysis. In: Proceedings of the workshop on negation and speculation in natural language processing, pp. 60–68. Uppsala: University of Antwerp (2010).
3. Lau R, Liao S, Wong KF, Chiu D. Web 2.0 environmental scanning and adaptive decision support for business mergers and acquisitions. Manag Inf Syst Quart. 2012;36:1239–68.
4. Hu T, Tripathi A. The effect of social media on market liquidity. ICIS 2015 Proceedings (2015).
5. Jiang C, Wang J, Tang Q, Lyu X. Investigating the effects of dimension-specific sentiments on product sales: the perspective of sentiment preferences. J Assoc Inf Syst. 2021. https://doi.org/10.17705/1jais.00668.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Discursos contrarios a la educación sexual en España;Revista ICONO 14. Revista científica de Comunicación y Tecnologías emergentes;2024-07-23
2. Sentiment Analysis of User Reactions to Meta's Threads Launch and Twitter's X Renaming;Advances in Business Information Systems and Analytics;2024-02-23