Abstract
Social media websites and tweeting apps have seen a sharp rise in popularity in the recent years. One can express their opinions and sentiments about things, people, and events through these platforms. Arguments frequently start on social media platforms during discussions and debates and involve the usage of toxic comments, which are unpleasant, disrespectful, and hurtful statements. According to many, social networking sites must be able to identify these harmful comments. This research analyses several deep learning and machine learning methods like Convolutional Neural Network, Long Short -Term Memory, Support Vector Machine, Random Forest, and Naive Bayes for toxic comments classification along with the study that examines the effects of many word embedding methods including Word2Vector, Bag of Words, Global Vectors, Bidirectional Encoder Representations from Transformers, and Embeddings from Language Model on the classification of toxic comments and also the future scope of the research.
Publisher
Inventive Research Organization
Reference26 articles.
1. [1] Digital 2023: India –DataReportal-Global Digital Insights, february 2023 (online). Available: https://datareportal.com/reports/digital-2023-india.
2. [2] Digital Around the World – DataReportal (online). Available: https://datareportal.com/global-digital-overview.
3. [3] B. Gamback and U. K. Sikdar, ‘‘Using convolutional neural networks to classify hate-speech,’’ in Proc. 1st Workshop Abusive Lang. Online, 2017, pp. 85–90.
4. [4] M. Ibrahim, M. Torki, and N. El-Makky, ‘‘Imbalanced toxic comments classification using data augmentation and deep learning,’’ in Proc. 17th IEEE Int. Conf. Mach. Learn. Appl. (ICMLA), Dec. 2018, pp. 875–878.
5. [5] M. A. Saif, A. N. Medvedev, M. A. Medvedev, and T. Atanasova, ‘‘Classification of online toxic comments using the logistic regression and neural networks models,’’ AIP Conf. Proc., vol. 2048, no. 1, 2018, Art. no. 060011.