Affiliation:
1. National Institute of Development Administration
2. National Institute of Development
Abstract
With the advancement in the Internet technology, customers can easily share opinions on services and products in forms of reviews. There can be large amounts of reviews for popular products. Manually summarizing those reviews for important issues is a daunting task. Automatic opinion summarization is a solution to the problem. The task is more complicated for reviews written in Thai language. Thai words are written continuously without space, there is no symbol to signify the end of a sentence, and many reviews are written informally, thus accurate word identification and linguistic annotation cannot be relied upon. This research proposes a novel technique to generate abstractive summaries of customer reviews written in Thai language. The proposed technique, which consists of the local and the global models, is evaluated using actual reviews of fifty randomly selected products from a popular cosmetic website. The results show that the local model outperforms the other model and the two baseline methods both quantitatively and qualitatively.
Publisher
Trans Tech Publications, Ltd.
Reference12 articles.
1. O. Sornil and K. Gree-ut: An Automatic Text Summarization Approach using Content-Based and Graph-Based Characteristics in 2006 IEEE Conference on Cybernetics and Intelligent Systems (2006), p.1–6.
2. K. Ganesan, C. Zhai, and J. Han: Opinogsis A Graph-based Approach to Abstractive Summarization of Highly Redundant Opinions, in Proceedings of the 23rd International Conference on Computational Linguistics, Stroudsburg, PA, USA (2010), p.340–348.
3. K. Filippova: Multi-sentence Compression Finding Shortest Paths in Word Graphs, in Proceedings of the 23rd International Conference on Computational Linguistics, Stroudsburg, PA, USA (2010), p.322–330.
4. K. Ganesan, C. Zhai, and E. Viegas: Micropinion Generation An Unsupervised Approach to Generating Ultra-concise Summaries of Opinions, in Proceedings of the 21st International Conference on World Wide Web, New York, NY, USA (2012), p.869–878.
5. P. Bheganan, R. Nayak, and Y. Xu: Thai Word Segmentation with Hidden Markov Model and Decision Tree, in Advances in Knowledge Discovery and Data Mining, Springer Berlin Heidelberg (2009), p.74–85.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Construction of Text Summarization Corpus in Economics Domain and Baseline Models;Journal of information and communication convergence engineering;2024-03-31
2. ThEconSum: an Economics-domained Dataset for Thai Text Summarization and Baseline Models;2022 17th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP);2022-11-05
3. A Performance Analysis of Deep-Learning-Based Thai News Abstractive Summarization: Word Positions and Document Length;2022 7th International Conference on Business and Industrial Research (ICBIR);2022-05-19