Affiliation:
1. Department of Computer Science and Engineering Indian Institution of Technology (IIT) Roorkee India
2. Department of Computer Science and Engineering Indian Institution of Technology (IIT) Jammu India
Abstract
AbstractIn the past, many techniques like blacklisting/whitelisting, third‐party, search engine, visual similarity, heuristic, URL features, and website content were used for anti‐phishing. Search engine‐based, third‐party assisted tools and blacklist/whitelist fail to identify new phishing attacks resulting in high FPR. Heuristic and visual similarity approaches are slow, whereas URL and web content‐based techniques do not mimic the dynamic content of current websites and hence cannot stop zero‐day attacks. A study was conducted to understand the critical features used in the past for anti‐phishing, and we found 16 HTTP header features that were novel. In this paper, we have developed a real‐time, highly scalable, feature‐rich anti‐phishing detection technique based on ML that extracts the HTTP headers (predominantly security headers) from web pages to identify them as legitimate or phished. It is observed that phishing sites are short‐lived and are created to achieve a specific objective, like stealing the credential of a user. Once the goal is met, the sites are pulled down immediately. Hence these sites do not take pain to use the security features of web technology and only focus on making the site as similar as possible to the original website. Test results based on our novel features show high accuracy of 97.8% with an average response time of 1.57 s. We have created multiple datasets for different scenarios, like a dataset for website creation through phishing tools and a new dataset for testing unseen phishing attacks. The results thus obtained show detection accuracy of 99% and 95%, respectively.
Subject
Electrical and Electronic Engineering
Reference36 articles.
1. A comprehensive survey of AI-enabled phishing attacks detection techniques
2. Overview of phishing landscape and homographs in Arabic domain names;Ahmad H;Secur Privacy,2021
3. Leverage website favicon to detect phishing websites;Chiew KL;J Secur Commun Netw,2018
4. A novel approach to protect against phishing attacks at client side using auto‐updated white‐list;Jain AK;EURASIP J Inf Secur,2016
5. Towards detection of phishing websites on client-side using machine learning based approach
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献