Affiliation:
1. Department of Computer Science, Federal University of Lavras, PO Box 3037, 37.200-000, Lavras, Brazil
Abstract
Abstract
Sentiment analysis has been the main focus of plenty of research efforts, particularly justified by its commercial significance, both for consumers and businesses. Thus, many methods have been proposed so far, and the most prominent have been compared in terms of effectiveness. Nonetheless, the literature is deficient when it comes to assessing the efficiency of these methods for processing large volumes of data. In this study, we performed an experimental assessment of the efficiency of 22 methods in total, whose implementations were available. We also proposed and assessed an environment for distributed processing methods for sentiment analysis, using the Apache Spark platform, named BigFeel. In this environment, the existing methods, outlined to run in a non-distributed way, can be adapted, without altering their source code, to run in a distributed manner. The experimental results reveal that (i) few methods are efficient in their native form, (ii) the methods improve their efficiency after having been integrated into BigFeel, (iii) some of them, which were unfeasible to process a large dataset, became viable when deployed in a computer cluster and (iv) some methods can only handle small datasets, even in a distributed manner.
Funder
Brazilian National Council for Scientific and Technological Development
Foundation for Research of the State of Minas Gerais
Publisher
Oxford University Press (OUP)
Reference57 articles.
1. Aspect and Entity Extraction for Opinion Mining
2. Techniques and applications for sentiment analysis;Feldman;Commun. ACM,2013
3. New avenues in opinion mining and sentiment analysis;Cambria;IEEE Intell. Syst.,2013
4. Mining big data: current status, and forecast to the future;Fan;ACM SIGKDD Explor. Newsl.,2013
5. Lexicon-based sentiment analysis: Comparative evaluation of six sentiment lexicons;Khoo;J. Inf. Sci.,2017
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献