Modeling and Mining Domain Shared Knowledge for Sentiment Analysis
-
Published:2017-09-15
Issue:2
Volume:36
Page:1-36
-
ISSN:1046-8188
-
Container-title:ACM Transactions on Information Systems
-
language:en
-
Short-container-title:ACM Trans. Inf. Syst.
Author:
Zhou Guang-You1,
Huang Jimmy Xiangji2
Affiliation:
1. Central China Normal University, Hubei, P. R. China
2. Information Retrieval and Knowledge Management Research Lab, York University, Ontario, Canada
Abstract
Sentiment classification aims to automatically predict sentiment polarity (e.g., positive or negative) of user generated sentiment data (e.g., reviews, blogs). In real applications, these user-generated sentiment data can span so many different domains that it is difficult to label the training data for all of them. Therefore, we study the problem of sentiment classification adaptation task in this article. That is, a system is trained to label reviews from one source domain but is meant to be used on the target domain. One of the biggest challenges for sentiment classification adaptation task is how to deal with the problem when two data distributions between the source domain and target domain are significantly different from one another. However, our observation is that there might exist some domain shared knowledge among certain input dimensions of different domains. In this article, we present a novel method for modeling and mining the domain shared knowledge from different sentiment review domains via a joint non-negative matrix factorization–based framework. In this proposed framework, we attempt to learn the domain shared knowledge and the domain-specific information from different sentiment review domains with several various regularization constraints. The advantage of the proposed method can promote the correspondence under the topic space between the source domain and the target domain, which can significantly reduce the data distribution gap across two domains. We conduct extensive experiments on two real-world balanced data sets from Amazon product reviews for sentence-level and document-level binary sentiment classification. Experimental results show that our proposed approach significantly outperforms several strong baselines and achieves an accuracy that is competitive with the most well-known methods for sentiment classification adaptation.
Funder
York Research Chairs (YRC) program
Fundamental Research Funds for the Central Universities
Wuhan Youth Science and Technology plan
Early Researcher Award/Premiers Research Excellence Award
ORF-RE
Information Retrieval and Knowledge Management Research Laboratory
Natural Sciences and Engineering Research Council (NSERC) of Canada
IBM Shared University Research (SUR) Award
NSERC CREATE award in ADERSIM
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Science Applications,General Business, Management and Accounting,Information Systems
Cited by
16 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献