Safeguarding Online Communications using DistilRoBERTa for Detection of Terrorism and Offensive Chats
-
Published:2024-06-29
Issue:1
Volume:7
Page:93-107
-
ISSN:1658-7790
-
Container-title:Journal of Information Security and Cybercrimes Research
-
language:
-
Short-container-title:JISCR
Author:
Shah Mohamed Safwan Saalik1, Abuaieta Amr Mohamed2, Almazrouei Shaima Saeed2
Affiliation:
1. Middlesex University Dubai, Dubai, UAE. 2. Digital Forensics Expert at International Center for Forensic Science and Criminology, Dubai Police, Dubai, UAE.
Abstract
People use social media for both good and distasteful purposes. When used with malicious intent, it raises significant concerns as it involves the use of offensive language and hate speech that promote terrorism and other negative behaviors. To create a safe, secure and pleasant environment, these communications must be closely monitored to prevent severe problems, associated risks and other pertinent issues. With the help of AI, specifically Large Language Models (LLM), we can quickly analyze text and speech to determine whether the communications promote the dangers identified here above not to mention other toxic elements. For this research, the LLM used is the DistilRoBERTa model from the Transformers library using Hugging Face. The DistilRoBERTa model was trained on datasets consisting of terrorism-related conversations, offensive-related conversations, and neutral conversations. These datasets were obtained from publicly available sources. The results of the experimentation show that the model achieved 99% accuracy, precision, recall, F1 score, and ROC curve. To improve the robustness of the model, it must be continuously fine-tuned to predict dynamic communication behavior since real conversations are inaccessible due to restrictions. A drag-and-drop interface is used to upload the files and get the categorical output, ensuring seamless and easy interaction.
Publisher
Naif Arab University for Security Sciences
Reference22 articles.
1. A. Rajendran, V.S. Sahithi, C. Gupta, M. Yadav, S. Ahirrao, K. Kotecha, M. Gaikwad, A. Abraham, N. Ahmed, and S.M. Alhammad, "Detecting extremism on twitter during US capitol riot using deep learning techniques," *IEEE Access*, vol. 10, pp. 133052-133077, 2022. 2. S. Hussain and P. Mohideen, "Advanced Machine Learning Approach for Detection of Multilinguistic Terror Message to save human Lives," *Journal of Pharmaceutical Negative Results*, pp. 2528-2541, 2023. 3. O. Sharif, M.M. Hoque, A.S.M. Kayes, R. Nowrozy, and I.H. Sarker, "Detecting suspicious texts using machine learning techniques," *Applied Sciences*, vol. 10, no. 18, p. 6527, 2020. 4. A.B. Abhijith and P. Prithvi, "Automated Toxic Chat Synthesis, Reporting and Removing the Chat in Telegram Social Media Using Natural Language Processing Techniques," in *2024 Fourth International Conference on Advances in Electrical, Computing, Communication and Sustainable Technologies (ICAECT)*, Jan. 2024, pp. 1-7. 5. M. Gaikwad, S. Ahirrao, S. Phansalkar, and K. Kotecha, "Online extremism detection: A systematic literature review with emphasis on datasets, classification techniques, validation methods, and tools," *IEEE Access*, vol. 9, pp. 48364-48404, 2021.
|
|