Image Spam Detection Using Machine Learning and Natural Language Processing-Reference-Cited by-同舟云学术

Image Spam Detection Using Machine Learning and Natural Language Processing

Published:2020 Issue:2 Volume:55 Page:
ISSN:0258-2724
Container-title:Journal of Southwest Jiaotong University
language:en
Short-container-title:

Author:

Yaseen Yaseen Khather,Abbas Alaa Khudhair,Sana Ahmed M.

Abstract

Today, images are a part of communication between people. However, images are being used to share information by hiding and embedding messages within it, and images that are received through social media or emails can contain harmful content that users are not able to see and therefore not aware of. This paper presents a model for detecting spam on images. The model is a combination of optical character recognition, natural language processing, and the machine learning algorithm. Optical character recognition extracts the text from images, and natural language processing uses linguistics capabilities to detect and classify the language, to distinguish between normal text and slang language. The features for selected images are then extracted using the bag-of-words model, and the machine learning algorithm is run to detect any kind of spam that may be on it. Finally, the model can predict whether or not the image contains any harmful content. The results show that the proposed method using a combination of the machine learning algorithm, optical character recognition, and natural language processing provides high detection accuracy compared to using machine learning alone.

Publisher

Southwest Jiaotong University

Subject

Multidisciplinary

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Managing Spam Images on Android: An Approach Utilizing Machine Learning and NLP;Lecture Notes in Networks and Systems;2024

2. A Novel Hybrid Multi-Modal Deep Learning for Detecting Hashtag Incongruity on Social Media;Sensors;2022-12-15

3. YOLO based Efficient Vigorous Scene Detection And Blurring for Harmful Content Management to Avoid Children’s Destruction;2022 3rd International Conference on Electronics and Sustainable Communication Systems (ICESC);2022-08-17