Efficient detection of hacker community based on twitter data using complex networks and machine learning algorithm-Reference-Cited by-同舟云学术

Efficient detection of hacker community based on twitter data using complex networks and machine learning algorithm

Published:2021-06-21 Issue:6 Volume:40 Page:12321-12337
ISSN:1064-1246
Container-title:Journal of Intelligent & Fuzzy Systems
language:
Short-container-title:IFS

Author:

Al-Tarawneh Ahmed¹,Al-Saraireh Ja’afer¹

Affiliation:

1. Computer Science Department, Princess Sumaya University for Technology, Amman, Jordan

Abstract

Twitter is one of the most popular platforms used to share and post ideas. Hackers and anonymous attackers use these platforms maliciously, and their behavior can be used to predict the risk of future attacks, by gathering and classifying hackers’ tweets using machine-learning techniques. Previous approaches for detecting infected tweets are based on human efforts or text analysis, thus they are limited to capturing the hidden text between tweet lines. The main aim of this research paper is to enhance the efficiency of hacker detection for the Twitter platform using the complex networks technique with adapted machine learning algorithms. This work presents a methodology that collects a list of users with their followers who are sharing their posts that have similar interests from a hackers’ community on Twitter. The list is built based on a set of suggested keywords that are the commonly used terms by hackers in their tweets. After that, a complex network is generated for all users to find relations among them in terms of network centrality, closeness, and betweenness. After extracting these values, a dataset of the most influential users in the hacker community is assembled. Subsequently, tweets belonging to users in the extracted dataset are gathered and classified into positive and negative classes. The output of this process is utilized with a machine learning process by applying different algorithms. This research build and investigate an accurate dataset containing real users who belong to a hackers’ community. Correctly, classified instances were measured for accuracy using the average values of K-nearest neighbor, Naive Bayes, Random Tree, and the support vector machine techniques, demonstrating about 90% and 88% accuracy for cross-validation and percentage split respectively. Consequently, the proposed network cyber Twitter model is able to detect hackers, and determine if tweets pose a risk to future institutions and individuals to provide early warning of possible attacks.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference13 articles.

1. Alsaffar D. , Alfahhad A. , Alqhtani B. , Alamri L. , Alansari S. , Alqahtani N. and Alboaneen D.A. , Machine and deep learning algorithms for Twitter spam detection. In International Conference on Advanced Intelligent Systems and Informatics. Springer, Cham, (pp. 483–491). (2019).

2. Benjamin V. and Chen H. , Developing understanding of hacker language through the use of lexical semantics. In IEEE International Conference on Intelligence and Security Informatics (ISI), (pp. 79–84). (2015).

3. Deliu I. , Leichter C. and Franke K. , Collecting cyber threat intelligence from hacker forums via a two-stage hybrid process using support vector machines and latent dirichlet allocation. 2018 IEEE International Conference on Big Data (pp. 5008–5013). New York, NY: IEEE. (2018).

4. The WEKA data mining software: An update;Hall;ACM SIGKDD Explorations Newsletter,2009

5. Social sentiment sensor in Twitter for predicting cyber-attacks using ℓ1regularization;Hernandez-Suarez;Sensors Journal,2018

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Dissecting the infodemic: An in-depth analysis of COVID-19 misinformation detection on X (formerly Twitter) utilizing machine learning and deep learning techniques;Heliyon;2024-09

2. Hybridizing Base-Line 2D-CNN Model with Cat Swarm Optimization for Enhanced Advanced Persistent Threat Detection;2024 International Telecommunications Conference (ITC-Egypt);2024-07-22

3. Community Detection On Multi-layer Graph using Intra-layer and Inter-layer Linkage Graphs (CDMIILG);Expert Systems with Applications;2024-03

4. Fuzzy K-Means with M-KMP: a security framework in pyspark environment for intrusion detection;Multimedia Tools and Applications;2024-02-13

5. Urban community governance and machine learning: practice and prospects for intelligent decision making;Applied Mathematics and Nonlinear Sciences;2024-01-01