Hybrid Approach for Phishing Website Detection Using Classification Algorithms-Reference-Cited by-同舟云学术

Hybrid Approach for Phishing Website Detection Using Classification Algorithms

Published:2022-12-20 Issue:3 Volume:3 Page:16-29
ISSN:2711-4627
Container-title:ParadigmPlus
language:
Short-container-title:paradigmplus

Author:

Raj Mukta Mithra,Arul Jothi J. Angel

Abstract

The internet has significantly altered how we work and interact with one another.Statistics show 63.1 % of the present world population are internet users. This clearly indicates how heavily man is dependent on digital media. Digital media users are on the rise and so is the incidence of cyber crimes. People who lack experience and knowledge are more vulnerable and susceptible to phishing scams.The victims experience severe consequences as their personal credentials are at stake. Phishers use publicly available sources to acquire details about the victim's professional and personal history.Countermeasures must be implemented with the highest priority. Detection of malicious websites can significantly reduce the risk of phishing attempts.In this research, a highly accurate website phishing detection method based on URL features is proposed. We investigated eight existing machine learning classification techniques for this, including extreme gradient boosting (XGBoost), random forest (RF), adaptive boosting (AdaBoost), decision trees (DT), K-nearest neighbors (KNN), support vector machines (SVM), logistic regression and naïve bayes (NB) to detect malicious websites.The results show that XGboost had the best accuracy with a score of 96.71%, followed by random forest and AdaBoost.We further experimented with various hybrid combinations of the top three classifiers and observed that XGboost-Random Forest hybrid algorithms produced the best results.The hybrid model classified the websites as legitimate or phishing with an accuracy of 97.07%.

Publisher

ITI Research Group

Reference21 articles.

1. J. Fruhlinger, "What is phishing? Examples, types, and techniques." https://www.csoonline.com/article/2117843/what-is-phishing-examples-types-and-techniques.html, 2022.

2. IBM, "Cost of a data breach report 2021." https://www.dataendure.com/wp-content/uploads/2021_Cost_of_a_Data_Breach_-2.pdf, 2021.

3. J. Gu and H. Xu, "An ensemble method for phishing websites detection based on XGBoost," in 2022 14th international conference on computer research and development (ICCRD), 2022, pp. 214-219.

4. A. Maini, N. Kakwani, B. Ranjitha, M. Shreya, and R. Bharathi, "Improving the performance of semantic-based phishing detection system through ensemble learning method," in 2021 IEEE mysore sub section international conference (MysuruCon), 2021, pp. 463-469.

5. A. Pandey, N. Gill, K. Sai Prasad Nadendla, and I. S. Thaseen, "Identification of phishing attack in websites using random forest-svm hybrid model," in International conference on intelligent systems design and applications, 2018, pp. 120-128.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enhancing Cybersecurity: A Comprehensive Analysis of Machine Learning Techniques in Detecting and Preventing Phishing Attacks with a Focus on Xgboost Algorithm;2024 International Conference on Intelligent Systems for Cybersecurity (ISCS);2024-05-03

2. Web Extension For Phishing Website Identification: A Browser-Based Security Solution;2023 International Conference on Research Methodologies in Knowledge Management, Artificial Intelligence and Telecommunication Engineering (RMKMATE);2023-11-01

3. Single and Hybrid-Ensemble Learning-Based Phishing Website Detection: Examining Impacts of Varied Nature Datasets and Informative Feature Selection Technique;Digital Threats: Research and Practice;2023-09-30

4. Phishing Prediction on Website Updates with Novel Features Through Machine Learning;2023 5th International Conference on Inventive Research in Computing Applications (ICIRCA);2023-08-03