ORDSAENet: Outlier Resilient Semantic Featured Deep Driven Sentiment Analysis Model for Education Domain-Reference-Cited by-同舟云学术

ORDSAENet: Outlier Resilient Semantic Featured Deep Driven Sentiment Analysis Model for Education Domain

Published:2023-10-05 Issue: Volume: Page:408-430
ISSN:2788-7669
Container-title:Journal of Machine and Computing
language:en
Short-container-title:JMC

Author:

B A Smitha¹,K N Raja Praveen¹

Affiliation:

1. Jain Deemed to be University, Bangalore, India.

Abstract

The high pace rising global competitions across education sector has forced institutions to enhance aforesaid aspects, which require assessing students or related stakeholders’ perception and opinion towards the learning materials, courses, learning methods or pedagogies, etc. To achieve it, the use of reviews by students can of paramount significance; yet, annotating student’s opinion over huge heterogenous and unstructured data remains a tedious task. Though, the artificial intelligence (AI) and natural language processing (NLP) techniques can play decisive role; yet the conventional unsupervised lexicon, corpus-based solutions, and machine learning and/or deep driven approaches are found limited due to the different issues like class-imbalance, lack of contextual details, lack of long-term dependency, convergence, local minima etc. The aforesaid challenges can be severe over large inputs in Big Data ecosystems. In this reference, this paper proposed an outlier resilient semantic featuring deep driven sentiment analysis model (ORDSAENet) for educational domain sentiment annotations. To address data heterogeneity and unstructured-ness over unpredictable digital media, the ORDSAENet applies varied pre-processing methods including missing value removal, Unicode normalization, Emoji and Website link removal, removal of the words with numeric values, punctuations removal, lower case conversion, stop-word removal, lemmatization, and tokenization. Moreover, it applies a text size-constrained criteria to remove outlier texts from the input and hence improve ROI-specific learning for accurate annotation. The tokenized data was processed for Word2Vec assisted continuous bag-of-words (CBOW) semantic embedding followed by synthetic minority over-sampling with edited nearest neighbor (SMOTE-ENN) resampling. The resampled embedding matrix was then processed for Bi-LSTM feature extraction and learning that retains both local as well as contextual features to achieve efficient learning and classification. Executing ORDSAENet model over educational review dataset encompassing both qualitative reviews as well as quantitative ratings for the online courses, revealed that the proposed approach achieves average sentiment annotation accuracy, precision, recall, and F-Measure of 95.87%, 95.26%, 95.06% and 95.15%, respectively, which is higher than the LSTM driven standalone feature learning solutions and other state-of-arts. The overall simulation results and allied inferences confirm robustness of the ORDSAENet model towards real-time educational sentiment annotation solution.

Publisher

Anapub Publications

Subject

Electrical and Electronic Engineering,Computational Theory and Mathematics,Human-Computer Interaction,Computational Mechanics

Link

https://anapub.co.ke/journals/jmc/jmc_pdf/2023/jmc_volume_3-issue_4/JMC202303034.pdf

Reference73 articles.

1. M. Bansal, S. Verma, K. Vig, and K. Kakran, “Opinion Mining from Student Feedback Data Using Supervised Learning Algorithms,” Lecture Notes in Networks and Systems, pp. 411–418, 2022, doi: 10.1007/978-3-031-12413-6_32.

2. Ligthart, C. Catal, and B. Tekinerdogan, “Systematic reviews in sentiment analysis: a tertiary study,” Artificial Intelligence Review, vol. 54, no. 7, pp. 4997–5053, Mar. 2021, doi: 10.1007/s10462-021-09973-3.

3. Liu, “Sentiment Analysis,” Jun. 2015, doi: 10.1017/cbo9781139084789.

4. E. Cambria, “Affective Computing and Sentiment Analysis,” IEEE Intelligent Systems, vol. 31, no. 2, pp. 102–107, Mar. 2016, doi: 10.1109/mis.2016.31.

5. B. Liu, “Sentiment Analysis and Opinion Mining,” Synthesis Lectures on Human Language Technologies, vol. 5, no. 1, pp. 1–167, May 2012, doi: 10.2200/s00416ed1v01y201204hlt016.