Affiliation:
1. Rao Bahadur Y Mahabaleswarappa Engineering College, Ballari, India
Abstract
Phishing, a cybercriminal's attempted attack, is a social web-engineering attack in which valuable data or personal information might be stolen from either email addresses or websites. There are many methods available to detect phishing, but new ones are being introduced in an attempt to increase detection accuracy and decrease phishing websites success to steal information. Phishing is generally detected using Machine Learning methods with different kinds of algorithms. In this study, our aim is to use Machine Learning to detect phishing websites. We used the data from Kaggle consisting of 86 features and 11,430 total URLs, half of them are phishing and half of them are legitimate. We trained our data using Decision Tree (DT), Random Forest (RF), XGBoost, Multilayer Perceptrons, K-Nearest Neighbors, Naive Bayes, AdaBoost, and Gradient Boosting and reached the highest accuracy of 96.6using X G Boost
Reference8 articles.
1. Mohammed Hazim Alkawaz, Stephanie Joanne Steven and Asif Iqbal Hajamydeen, "Detecting Phishing Website Using Machine Learning", 16th IEEE International Colloquium on Signal Processing its Applications (CSPA 2020), 28-29 Feb. 2020.
2. Sandeep Kumar Satapathy, Shruti Mishra, Pradeep Kumar Mallick, Lavanya Badiginchala, Ravali Reddy Gudur and Siri Chandana Guttha, "Classification of Features for detecting Phishing Web Sites based on Machine Learning Techniques", International Journal of Innovative Technology and Exploring Engineering (IJITEE), vol. 8, no. 8S2, June 2019, ISSN 2278-3075
3. Vaibhav Patil and S. P. Godse, Detection and Prevention of Phishing Websites using Machine Learning Approach, ISBN 978-1-5386-5257-2018.
4. S Nandhini and V. Vasanthi, Extraction of Features and Classification on Phishing Websites using Web Mining Techniques, vol. 5, no. 4, 2017, ISSN 2321-9939.
5. Sagar Patil, Yogesh Shetye and Nilesh Shendage, "Detecting Phishing Websites", International Research Journal of Engineering and Technology, vol. 07, no. 02, Feb 2020.