Research on Building a Chinese Sentiment Lexicon Based on SO-PMI

Author:

Yang Ai Min1,Lin Jiang Hao1,Zhou Yong Mei1,Chen Jin1

Affiliation:

1. Guangdong University of Foreign Studies

Abstract

Considering user behavior, this paper has built a Chinese sentiment lexicon based on improved SO-PMI algorithm. Sematic lexicons were used to classify the sentiment of the collected Chinese hotel reviews. The experiment has compared the feature extraction between CHI and sentiment lexicons to find out different classification performances. The results indicate that feature extraction based on sentiment lexicon gains higher F1. The performance of classification method “Basic Semantic Lexicon + BOOL + NB” gains 92.40% of F1. Based on different sentiment lexicons, the experimental results shows that (SO-A) and (SO-P) is slightly better than NB classifier. Therefore, it would be effective to use ((SO-A) and (SO-P) as text sentiment classifiers. The experiment also finds out the method “Hotel Reviews Semantic Lexicon using improved SO-PMI algorithm +((SO-A)” gains the highest F1 which is 92.84%. The results reveal that improved SO-PMI does more effective on weight calculation and sentiment lexicon building.

Publisher

Trans Tech Publications, Ltd.

Reference10 articles.

1. Ding Yang, Aimin Yang. Classification approach of Chinese texts sentiment based on semantic lexicon and naive Bayesian[J]. Application Research of Computers, 2010, 27(10): 3737-3739. In Chinese.

2. Liuling Dai etc. Measuring Semantic Similarity between Words Using HowNet[J]. ICCSIT2008, 2008: 601-605.

3. Weiping Liu etc. Research on building Chinese basic semantic lexicon[J]. Journal of Computer Applications, 2009, 29(11): 2882-2884. In Chinese.

4. Yanhui Zhu etc. A Method of Emotional Feature Extraction in Chinese Text Based on Multiple Lexicons[J] Journal of Hunan University of Technology, 2011, 25(2): 42-46. In Chinese.

5. Stefano Baccianella, Andrea Esuli, and Fabrizio Sebastiani. SENTIWORDNET3. 0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining[J]. LREC, 2010: 2200-2204.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3