An optimization-based deep belief network for the detection of phishing e-mails-Reference-Cited by-同舟云学术

An optimization-based deep belief network for the detection of phishing e-mails

Published:2020-07-16 Issue:4 Volume:54 Page:529-549
ISSN:2514-9288
Container-title:Data Technologies and Applications
language:en
Short-container-title:DTA

Author:

M. Arshey,K. S. Angel Viji

Abstract

PurposePhishing is a serious cybersecurity problem, which is widely available through multimedia, such as e-mail and Short Messaging Service (SMS) to collect the personal information of the individual. However, the rapid growth of the unsolicited and unwanted information needs to be addressed, raising the necessity of the technology to develop any effective anti-phishing methods.Design/methodology/approachThe primary intention of this research is to design and develop an approach for preventing phishing by proposing an optimization algorithm. The proposed approach involves four steps, namely preprocessing, feature extraction, feature selection and classification, for dealing with phishing e-mails. Initially, the input data set is subjected to the preprocessing, which removes stop words and stemming in the data and the preprocessed output is given to the feature extraction process. By extracting keyword frequency from the preprocessed, the important words are selected as the features. Then, the feature selection process is carried out using the Bhattacharya distance such that only the significant features that can aid the classification are selected. Using the selected features, the classification is done using the deep belief network (DBN) that is trained using the proposed fractional-earthworm optimization algorithm (EWA). The proposed fractional-EWA is designed by the integration of EWA and fractional calculus to determine the weights in the DBN optimally.FindingsThe accuracy of the methods, naive Bayes (NB), DBN, neural network (NN), EWA-DBN and fractional EWA-DBN is 0.5333, 0.5455, 0.5556, 0.5714 and 0.8571, respectively. The sensitivity of the methods, NB, DBN, NN, EWA-DBN and fractional EWA-DBN is 0.4558, 0.5631, 0.7035, 0.7045 and 0.8182, respectively. Likewise, the specificity of the methods, NB, DBN, NN, EWA-DBN and fractional EWA-DBN is 0.5052, 0.5631, 0.7028, 0.7040 and 0.8800, respectively. It is clear from the comparative table that the proposed method acquired the maximal accuracy, sensitivity and specificity compared with the existing methods.Originality/valueThe e-mail phishing detection is performed in this paper using the optimization-based deep learning networks. The e-mails include a number of unwanted messages that are to be detected in order to avoid the storage issues. The importance of the method is that the inclusion of the historical data in the detection process enhances the accuracy of detection.

Publisher

Emerald

Subject

Library and Information Sciences,Information Systems

Reference35 articles.

1. Semi-supervised learning using frequent itemset and ensemble learning for SMS classification;Expert Systems with Applications,2015

2. Secret sharing in visual cryptography using NVSS and data hiding techniques,2015

3. An experimental comparison of Naive Bayesian and keyword-based anti-spam filtering with personal e-mail messages,2000

4. Factorial design analysis applied to the performance of SMS anti-spam filtering systems;Expert Systems with Applications,2016

5. Automated document classification for news article in Bahasa Indonesia based on term frequency inverse document frequency (TF-IDF) approach,2004

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The Power of Persuasion: Exploring Social Engineering in the Digital Age;Studies in Computational Intelligence;2024

2. Applications of deep learning for phishing detection: a systematic literature review;Knowledge and Information Systems;2022-05-23

3. A Systematic Literature Review on Phishing Email Detection Using Natural Language Processing Techniques;IEEE Access;2022

4. Phishing Classification Techniques: A Systematic Literature Review;IEEE Access;2022

5. Deep Learning for Phishing Detection: Taxonomy, Current Challenges and Future Directions;IEEE Access;2022