Cyberbullying identification in twitter using support vector machine and information gain based feature selection-Reference-Cited by-同舟云学术

Cyberbullying identification in twitter using support vector machine and information gain based feature selection

Published:2020-06-01 Issue:3 Volume:18 Page:1494
ISSN:2502-4760
Container-title:Indonesian Journal of Electrical Engineering and Computer Science
language:
Short-container-title:IJEECS

Author:

Dwi Purnamasari Ni Made Gita,Fauzi M. Ali,Indriati Indriati,Dewi Liana Shinta

Abstract

<span>Cyberbullying is one of the actions that violate the ITE Law where the crime is committed on social media applications such as Twitter. This action is difficult to detect if no one is reporting the tweet. Cyberbullying tweet identification aims to classify tweets that contain bullying. Classification is done using Support Vector Machine method where this method aims to find the dividing hyperplane between negative and positive class. This study is a text classification where more data is used, the more features are produced, therefore this research also uses Information Gain as feature selection to select features that are not relevant to the classification. The process of the system starts from text preprocessing with tokenizing, filtering, stemming and term weighting. Then perform the information gain feature selection by calculating the entropy value of each term. After that perform the classification process based on the terms that have been selected, and the output of the system is identification whether the tweet is bullying or not. The result of using SVM method is accuracy 75%, precision 70.27%, recall 86.66% and f-measure 77.61% on experiment maximum iteration = 20, λ = 0.5, γ = 0.001, ε = 0.000001, and C = 1. The best threshold of information gain is 90%, with accuracy 76.66%, precision 72.22%, recall 86.66% and f-measure 78.78%.</span>

Publisher

Institute of Advanced Engineering and Science

Subject

Electrical and Electronic Engineering,Control and Optimization,Computer Networks and Communications,Hardware and Architecture,Information Systems,Signal Processing

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Social Media Forensics: An Adaptive Cyberbullying-Related Hate Speech Detection Approach Based on Neural Networks With Uncertainty;IEEE Access;2024

2. Detection of Cyberbullying in Social Media Texts Using Explainable Artificial Intelligence;Communications in Computer and Information Science;2024

3. Conversation Graph Construction Approach of Cyberbully Detection Using Bully Scores;Lecture Notes in Networks and Systems;2024

4. Digital Resilience: A Review of Cutting-Edge Approaches to Cyberbullying Detection in the Social Media Landscape;2023 1st DMIHER International Conference on Artificial Intelligence in Education and Industry 4.0 (IDICAIEI);2023-11-27

5. A Comparative Study of Machine Learning Approaches for Cyber bullying Detection in Digital Forums;2023 International Conference on Advances in Computation, Communication and Information Technology (ICAICCIT);2023-11-23