Abstract
The unavailability of an annotated dataset for a low-resource Ewe language makes it difficult to develop an automated system to appropriately evaluate public opinion on events, news, policies, and regulations. In this study, we collected and preprocessed a low-resourced document-level Ewe sentiment dataset based on social media comments. We used three (3) features learned by word embeddings (Global vectors, word-to-vector, and FastText) rather than hand-crafted features. We further proposed a novel method termed MC2D-CNN+BiLSTM-Attn to detect the exact sentiment feature from the Ewe dataset. Extensive experiments indicate that the proposed method efficiently classifies various sentiments and is superior to benchmark deep learning methods. Results show that in detecting the precise sentiments from raw Ewe textual context, the BiLSTM incorporating Glove outperforms word2vec and FastText embedding with an accuracy of 0.727. Furthermore, Attn+BiLSTM and Multi-channel CNN methods incorporating the word2vec embedding layer perform better than Glove and FastText embedding with an accuracy of 0.848 and 0.896. In contrast, our proposed method with the same word2vec embedding recorded 0.949.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献