Detecting sarcasm in multi-domain datasets using convolutional neural networks and long short term memory network model

Author:

Jamil Ramish1,Ashraf Imran2ORCID,Rustam Furqan1,Saad Eysha1ORCID,Mehmood Arif3,Choi Gyu Sang2ORCID

Affiliation:

1. Khwaja Fareed University of Engineering and Information Technology, Rahim Yar Khan, Pakistan

2. Information and Communication Engineering, Yeungnam University, Gyeongsan si, Daegu, South Korea

3. The Islamia University of Bahawalpur, Bahawalpur, Pakistan

Abstract

Sarcasm emerges as a common phenomenon across social networking sites because people express their negative thoughts, hatred and opinions using positive vocabulary which makes it a challenging task to detect sarcasm. Although various studies have investigated the sarcasm detection on baseline datasets, this work is the first to detect sarcasm from a multi-domain dataset that is constructed by combining Twitter and News Headlines datasets. This study proposes a hybrid approach where the convolutional neural networks (CNN) are used for feature extraction while the long short-term memory (LSTM) is trained and tested on those features. For performance analysis, several machine learning algorithms such as random forest, support vector classifier, extra tree classifier and decision tree are used. The performance of both the proposed model and machine learning algorithms is analyzed using the term frequency-inverse document frequency, bag of words approach, and global vectors for word representations. Experimental results indicate that the proposed model surpasses the performance of the traditional machine learning algorithms with an accuracy of 91.60%. Several state-of-the-art approaches for sarcasm detection are compared with the proposed model and results suggest that the proposed model outperforms these approaches concerning the precision, recall and F1 scores. The proposed model is accurate, robust, and performs sarcasm detection on a multi-domain dataset.

Funder

Basic Science Research Program through the National Research Foundation of Korea

Ministry of Education

MSIT

ITRC

IITP

Publisher

PeerJ

Subject

General Computer Science

Reference57 articles.

1. Deep learning-based sentiment classification of evaluative text based on multi-feature fusion;Abdi;Information Processing & Management,2019

2. A survey of figurative language and its computational detection in online social networks;Abulaish;ACM Transactions on the Web (TWEB),2020

3. Affective representations for sarcasm detection;Agrawal,2018

4. Modelling context with user embeddings for sarcasm detection in social media;Amir;arXiv,2016

5. A rule based system for speech language context understanding;Bajwa;Journal of Donghua University,2006

Cited by 41 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3