HTTP header based phishing attack detection using machine learning

Author:

Shukla Sanjeev1ORCID,Misra Manoj1,Varshney Gaurav2

Affiliation:

1. Department of Computer Science and Engineering Indian Institution of Technology (IIT) Roorkee India

2. Department of Computer Science and Engineering Indian Institution of Technology (IIT) Jammu India

Abstract

AbstractIn the past, many techniques like blacklisting/whitelisting, third‐party, search engine, visual similarity, heuristic, URL features, and website content were used for anti‐phishing. Search engine‐based, third‐party assisted tools and blacklist/whitelist fail to identify new phishing attacks resulting in high FPR. Heuristic and visual similarity approaches are slow, whereas URL and web content‐based techniques do not mimic the dynamic content of current websites and hence cannot stop zero‐day attacks. A study was conducted to understand the critical features used in the past for anti‐phishing, and we found 16 HTTP header features that were novel. In this paper, we have developed a real‐time, highly scalable, feature‐rich anti‐phishing detection technique based on ML that extracts the HTTP headers (predominantly security headers) from web pages to identify them as legitimate or phished. It is observed that phishing sites are short‐lived and are created to achieve a specific objective, like stealing the credential of a user. Once the goal is met, the sites are pulled down immediately. Hence these sites do not take pain to use the security features of web technology and only focus on making the site as similar as possible to the original website. Test results based on our novel features show high accuracy of 97.8% with an average response time of 1.57 s. We have created multiple datasets for different scenarios, like a dataset for website creation through phishing tools and a new dataset for testing unseen phishing attacks. The results thus obtained show detection accuracy of 99% and 95%, respectively.

Publisher

Wiley

Subject

Electrical and Electronic Engineering

Reference36 articles.

1. A comprehensive survey of AI-enabled phishing attacks detection techniques

2. Overview of phishing landscape and homographs in Arabic domain names;Ahmad H;Secur Privacy,2021

3. Leverage website favicon to detect phishing websites;Chiew KL;J Secur Commun Netw,2018

4. A novel approach to protect against phishing attacks at client side using auto‐updated white‐list;Jain AK;EURASIP J Inf Secur,2016

5. Towards detection of phishing websites on client-side using machine learning based approach

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3