Abstract
People crave interaction and connection with other people. Therefore, social media became the center of society’s life. Among the brightest social media platforms nowadays with a massive number of daily users there is Instagram, which is due to its distinctive features. The excessive revealing of personal life has put users in the spots of getting bullied and harassed and getting toxic revues from other users. Numerous studies have targeted social media to fight its harmful side effects. Nevertheless, most of the datasets that were already available were in English, the Arabic Moroccan dialect ones were not. In this work, the Arabic Moroccan dialect dataset has been extracted from the Instagram platform. Furthermore, feature extraction techniques have been applied to the collected dataset to increase classification accuracy. Afterward, we developed models using machine learning and deep learning algorithms to detect and classify toxicity. For the models’ evaluation, we have used the most used metrics: accuracy, precision, F1-score, and recall. The experimental results gave modest scores of around 70% to 83%. These results imply that the models need improvement due to the lack of available datasets and the preprocessing libraries to handle the Moroccan dialect of Arabic.
Publisher
Institute of Advanced Engineering and Science
Subject
Electrical and Electronic Engineering,Control and Optimization,Computer Networks and Communications,Hardware and Architecture,Information Systems,Signal Processing
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献