A Spam Filtering Method Based on Multi-Modal Fusion-Reference-Cited by-同舟云学术

A Spam Filtering Method Based on Multi-Modal Fusion

Published:2019-03-19 Issue:6 Volume:9 Page:1152
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Yang Hong^ORCID,Liu Qihe,Zhou Shijie,Luo Yang

Abstract

In recent years, the single-modal spam filtering systems have had a high detection rate for image spamming or text spamming. To avoid detection based on the single-modal spam filtering systems, spammers inject junk information into the multi-modality part of an email and combine them to reduce the recognition rate of the single-modal spam filtering systems, thereby implementing the purpose of evading detection. In view of this situation, a new model called multi-modal architecture based on model fusion (MMA-MF) is proposed, which use a multi-modal fusion method to ensure it could effectively filter spam whether it is hidden in the text or in the image. The model fuses a Convolutional Neural Network (CNN) model and a Long Short-Term Memory (LSTM) model to filter spam. Using the LSTM model and the CNN model to process the text and image parts of an email separately to obtain two classification probability values, then the two classification probability values are incorporated into a fusion model to identify whether the email is spam or not. For the hyperparameters of the MMA-MF model, we use a grid search optimization method to get the most suitable hyperparameters for it, and employ a k-fold cross-validation method to evaluate the performance of this model. Our experimental results show that this model is superior to the traditional spam filtering systems and can achieve accuracies in the range of 92.64–98.48%.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/9/6/1152/pdf

Reference32 articles.

1. Kaspersky Lab Spam and Phishing Report: FIFA 2018 and Bitcoin among 2017’s Most Luring Topicshttps://usa.kaspersky.com/about/press-releases/2018_fifa-2018-and-bitcoin-among-2017-most-luring-topics

2. Learning to Filter Unsolicited Commercial E-Mail;Androutsopoulos,2014

3. A Bayesian approach to filtering junk e-mail;Sahami,1998

Cited by 35 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Analysis of Machine Learning Models for Spam Email Detection and Real-Time Integration;2024 International Conference on Science, Engineering and Business for Driving Sustainable Development Goals (SEB4SDG);2024-04-02

2. Framework Based on Simulation of Real-World Message Streams to Evaluate Classification Solutions;Algorithms;2024-01-21

3. MMTD: A Multilingual and Multimodal Spam Detection Model Combining Text and Document Images;Applied Sciences;2023-10-27

4. Advanced Machine Learning Model to Detect Spam on Instagram;2023 IEEE International Conference on Blockchain and Distributed Systems Security (ICBDS);2023-10-06

5. Efficient e-mail spam filtering approach combining Logistic Regression model and Orthogonal Atomic Orbital Search algorithm;Applied Soft Computing;2023-09