Affiliation:
1. Department of Information Technology, Netaji Subhas University of Technology, Delhi, India
2. Department of Computer Science & Engineering, Netaji Subhas University of Technology, Delhi, India
3. Department of Information Technology, Delhi Technological University, Delhi, India
Abstract
Automated sarcasm detection is deemed as a complex natural language processing task and extending it to a morphologically-rich and free-order dominant indigenous Indian language Hindi is another challenge in itself. The scarcity of resources and tools such as annotated corpora, lexicons, dependency parser, Part-of-Speech tagger and benchmark datasets engorge the linguistic challenges of sarcasm detection in low-resource languages like Hindi. Furthermore, as context incongruity is imperative to detect sarcasm, various linguistic, aural and visual cues can be used to predict target utterance as sarcastic. While pre-trained word embeddings capture the meanings, semantic relationships and different types of contexts in the form of word representations, emojis can also render useful contextual information, analogous to human facial expressions, for gauging sarcasm. Thus, the goal of this research is to demonstrate the use of a hybrid deep learning model trained using two embeddings, namely word and emoji embeddings to detect sarcasm. The model is validated on a Hindi tweets dataset, Sarc-H, manually annotated with sarcastic and non-sarcastic labels. The preliminary results clearly depict the importance of using emojis for sarcasm detection, with our model attaining an accuracy of 97.35% with an F-score of 0.9708. The research validates that automated feature engineering facilitates efficient and repeatable predictive model for detecting sarcasm in indigenous, low-resource languages.
Publisher
Association for Computing Machinery (ACM)
Reference53 articles.
1. New Avenues in Opinion Mining and Sentiment Analysis
2. Hybrid context enriched deep learning model for fine-grained sentiment analysis in textual and visual semiotic modality social data
3. Kumar , A. ( 2021 ). Contextual semantics using hierarchical attention network for sentiment classification in social internet-of-things. Multimed Tools Appl https://doi.org/10.1007/s11042-021-11262-8 10.1007/s11042-021-11262-8 Kumar, A. (2021). Contextual semantics using hierarchical attention network for sentiment classification in social internet-of-things. Multimed Tools Appl https://doi.org/10.1007/s11042-021-11262-8
4. Sarcasm detection in mash-up language using soft-attention based bi-directional LSTM and feature-rich CNN
5. How Intense Are You? Predicting Intensities of Emotions and Sentiments using Stacked Ensemble [Application Notes]
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献