Emotion Detection System for Malayalam Text using Deep Learning and Transformers

Author:

K Anuja12ORCID,Raj P. C. Reghu12ORCID,K R Remesh Babu32ORCID

Affiliation:

1. Computer Science and Engineering, Govt. Engineering College, Palakkad, India

2. APJ Abdul Kalam Technological University, Trivandrum, India

3. Information Technology, Govt. Engineering College, Idukki, India

Abstract

Recent advances in Natural Language Processing (NLP) have improved the performance of the systems that perform tasks, such as Emotion Detection (ED), Information Retrieval, Translation, etc., in resource-rich languages like English and Chinese. But similar advancements have not been made in Malayalam due to the dearth of annotated datasets. Because of its rich morphology, free word order and agglutinative character, data preparation in Malayalam is highly challenging. In this paper, we employ traditional Machine Learning (ML) techniques such as support vector machines (SVM) and multilayer perceptrons (MLP), and recent deep learning methods such as Recurrent Neural Networks (RNN) and advanced transformer-based methodologies to train an emotion detection system. This work stands out since all the previous attempts to extract emotions from Malayalam text have relied on lexicons, which are inappropriate for handling large amounts of data. By tweaking the hyperparameters, we enhanced the transformer-based model known as MuRIL to obtain an accuracy of 79%, which is then compared with the only state-of-the-art (SOTA) model. We found that the proposed techniques surpass the SOTA system available for detecting emotions in Malayalam reported so far.

Publisher

Association for Computing Machinery (ACM)

Reference53 articles.

1. Athul Jayson 2020. GENERATING MALAYALAM WORD EMBEDDINGS: A CASE STUDY. https://blog.qburst.com//. Accessed: 2022-02-20.

2. Jason Brownlee 2021. Random Oversampling and Undersampling for Imbalanced Classification. https://machinelearningmastery.com//. Accessed: 2021-05-20.

3. July 2023. Multilingual (English and Chinese) GoEmotions Classification Model. https://huggingface.co/SchuylerH/bert-multilingual-go-emtions//. Accessed: 2023-12-15.

4. Lakshmi Panneerselvam 2021. A Gentle Introduction To MuRIL : Multilingual Representations for Indian Languages. https://www.analyticsvidhya.com//. Accessed: 2022-06-17.

5. October 2023. Emotions Classification Model. https://huggingface.co/gokuls/bert_uncased_L-12_H-768_A-12_emotion//. Accessed: 2023-12-17.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3