Threshold-based Naïve Bayes classifier

Author:

Romano Maurizio,Contu Giulia,Mola Francesco,Conversano ClaudioORCID

Abstract

AbstractThe Threshold-based Naïve Bayes (Tb-NB) classifier is introduced as a (simple) improved version of the original Naïve Bayes classifier. Tb-NB extracts the sentiment from a Natural Language text corpus and allows the user not only to predict how much a sentence is positive (negative) but also to quantify a sentiment with a numeric value. It is based on the estimation of a single threshold value that concurs to define a decision rule that classifies a text into a positive (negative) opinion based on its content. One of the main advantage deriving from Tb-NB is the possibility to utilize its results as the input of post-hoc analysis aimed at observing how the quality associated to the different dimensions of a product or a service or, in a mirrored fashion, the different dimensions of customer satisfaction evolve in time or change with respect to different locations. The effectiveness of Tb-NB is evaluated analyzing data concerning the tourism industry and, specifically, hotel guests’ reviews from all hotels located in the Sardinian region and available on Booking.com. Moreover, Tb-NB is compared with other popular classifiers used in sentiment analysis in terms of model accuracy, resistance to noise and computational efficiency.

Funder

Ministry of University (IT) - Prog. Dipartimenti di Eccellenza

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Statistics and Probability

Reference41 articles.

1. Arndt J (1967) Role of product-related conversations in the diffusion of a new product. J Market Res 4(3):291–295. https://doi.org/10.2307/3149462

2. Bachtiar FA, Paulina W, Rusydi AN (2020) Text mining for aspect based sentiment analysis on customer review: a case study in the hotel industry. In: Serdült U, Loshchilov A, Mahmudy WF, Nurwasito H (eds) Proceedings of the 5th international workshop on innovations in information and communication science and technology (canceled by authorities due to SARS-CoV-2), CEUR workshop proceedings, vol 2627, pp 105–112, Malang, Indonesia, CEUR-WS.org

3. Boyd D, Crawford K (2012) Critical questions for big data: provocations for a cultural, technological, and scholarly phenomenon. Inf Commun Soc 15(5):662–679. https://doi.org/10.1080/1369118X.2012.678878

4. Brownlee J (2017) Deep learning for natural language processing: develop deep learning models for your natural language problems. In: Machine learning mastery, 1.7 edition

5. Buttle FA (1998) Word of mouth: understanding and managing referral marketing. J Strateg Market 6(3):241–254. https://doi.org/10.1080/096525498346658

Cited by 6 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Semi-supervised topic representation through sentiment analysis and semantic networks;Big Data Research;2024-08

2. Inventory Classification and Management System Using Machine Learning and Analytical Dashboard;Advances in Business Information Systems and Analytics;2024-02-23

3. SMARTS: SeMi-Supervised Clustering for Assessment of Reviews Using Topic and Sentiment;Studies in Classification, Data Analysis, and Knowledge Organization;2024

4. Improving Prediction of Polarity in Tourism Domain using Convolutional Neural Network;International Journal of Combinatorial Optimization Problems and Informatics;2023-12-31

5. Iterative threshold-based Naïve bayes classifier;Statistical Methods & Applications;2023-09-05

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3