Author:
Khan Muhammad Taimoor,Durrani Mehr,Ali Armughan,Inayat Irum,Khalid Shehzad,Khan Kamran Habib
Abstract
Abstract
There is huge amount of content produced online by amateur authors, covering a large variety of topics. Sentiment analysis (SA) extracts and aggregates users’ sentiments towards a target entity. Machine learning (ML) techniques are frequently used as the natural language data is in abundance and has definite patterns. ML techniques adapt to domain specific solution at high accuracy depending upon the feature set used. The lexicon-based techniques, using external dictionary, are independent of data to prevent overfitting but they miss context too in specialized domains. Corpus-based statistical techniques require large data to stabilize. Complex network based techniques are highly resourceful, preserving order, proximity, context and relationships. Recent applications developed incorporate the platform specific structural information i.e. meta-data. New sub-domains are introduced as influence analysis, bias analysis, and data leakage analysis. The nature of data is also evolving where transcribed customer-agent phone conversation are also used for sentiment analysis. This paper reviews sentiment analysis techniques and highlight the need to address natural language processing (NLP) specific open challenges. Without resolving the complex NLP challenges, ML techniques cannot make considerable advancements. The open issues and challenges in the area are discussed, stressing on the need of standard datasets and evaluation methodology. It also emphasized on the need of better language models that could capture context and proximity.
Funder
full fee waiver awarded by editor-in-chief
Publisher
Springer Science and Business Media LLC
Subject
Applied Mathematics,Computer Science Applications,Modeling and Simulation
Reference85 articles.
1. Akkaya Cem, Janyce Wiebe, Rada Mihalcea (2009) Subjectivity word sense disambiguation. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing EMNLP
2. Ana C, Arlindo LO (2007) Semi-supervised single-label text categorization using centroid-based classifiers. ACM 844–851
3. Aoyama M (2002) A business-driven web service creation methodology, saint-w. IEEE
4. Bar-Haim R, Dinur E, Feldman R, Fresko M, Goldstein G (2011) Identifying and following expert investors in stock microblogs, In: Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP-2011)
5. Basu A et al (2003) Support vector machines for text categorization. In: Proceedings of the IEEE Hawaii International conference on system sciences
Cited by
40 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献