Towards Safe Cyber Practices: Developing a Proactive Cyber-Threat Intelligence System for Dark Web Forum Content by Identifying Cybercrimes-Reference-Cited by-同舟云学术

Towards Safe Cyber Practices: Developing a Proactive Cyber-Threat Intelligence System for Dark Web Forum Content by Identifying Cybercrimes

Published:2023-06-18 Issue:6 Volume:14 Page:349
ISSN:2078-2489
Container-title:Information
language:en
Short-container-title:Information

Author:

Sangher Kanti Singh¹,Singh Archana²^ORCID,Pandey Hari Mohan³^ORCID,Kumar Vivek⁴^ORCID

Affiliation:

1. School of IT, Centre for Development of Advanced Computing, Noida 201307, India

2. Amity School of Engineering and Technology, Amity University, Noida 201313, India

3. Department of Computing and Informatics, Bournemouth University, Fern Barrow, Poole BH12 5BB, UK

4. Department of Mathematics and Computer Science, University of Cagliari, 09124 Cagliari, Italy

Abstract

The untraceable part of the Deep Web, also known as the Dark Web, is one of the most used “secretive spaces” to execute all sorts of illegal and criminal activities by terrorists, cybercriminals, spies, and offenders. Identifying actions, products, and offenders on the Dark Web is challenging due to its size, intractability, and anonymity. Therefore, it is crucial to intelligently enforce tools and techniques capable of identifying the activities of the Dark Web to assist law enforcement agencies as a support system. Therefore, this study proposes four deep learning architectures (RNN, CNN, LSTM, and Transformer)-based classification models using the pre-trained word embedding representations to identify illicit activities related to cybercrimes on Dark Web forums. We used the Agora dataset derived from the DarkNet market archive, which lists 109 activities by category. The listings in the dataset are vaguely described, and several data points are untagged, which rules out the automatic labeling of category items as target classes. Hence, to overcome this constraint, we applied a meticulously designed human annotation scheme to annotate the data, taking into account all the attributes to infer the context. In this research, we conducted comprehensive evaluations to assess the performance of our proposed approach. Our proposed BERT-based classification model achieved an accuracy score of 96%. Given the unbalancedness of the experimental data, our results indicate the advantage of our tailored data preprocessing strategies and validate our annotation scheme. Thus, in real-world scenarios, our work can be used to analyze Dark Web forums and identify cybercrimes by law enforcement agencies and can pave the path to develop sophisticated systems as per the requirements.

Publisher

MDPI AG

Subject

Information Systems

Link

https://www.mdpi.com/2078-2489/14/6/349/pdf

Reference97 articles.

1. Guide to the Internet: The world wide web;Pallen;BMJ,1995

2. Gehl, R.W. (2018). Research Methods for the Digital Humanities, Springer.

3. The Dark Web: Defined, Discovered, Exploited;Mancini;Int. J. Cyber Res. Educ.,2019

4. The Dark Web dilemma: Tor, anonymity and online policing;Jardine;Glob. Comm. Internet Gov. Pap. Ser.,2015

5. Chertoff, M., and Simon, T. (2023, March 27). The Impact of the Dark Web on Internet Governance and Cyber Security. Available online: https://policycommons.net/artifacts/1203086/the-impact-of-the-dark-web-on-internet-goverannce-and-cyber-security/1756195/.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Agriculture 4.0 and beyond: Evaluating cyber threat intelligence sources and techniques in smart farming ecosystems;Computers & Security;2024-05

2. Agriculture 4.0 and Beyond: Evaluating Cyber Threat Intelligence Sources and Techniques in Smart Farming Ecosystems;2024