Affiliation:
1. Jerusalem College of Technology, Department of Computer Science 21 Havaad Haleumi St ., Jerusalem , Israel
Abstract
Abstract
The prevalence of sensationalized headlines and deceptive narratives in online content has prompted the need for effective clickbait detection methods. This study delves into the nuances of clickbait in Hebrew, scrutinizing diverse features such as linguistic and structural features, and exploring various types of clickbait in Hebrew, a language that has received relatively limited attention in this context. Utilizing a range of machine learning models, this research aims to identify linguistic features that are instrumental in accurately classifying Hebrew headlines as either clickbait or non-clickbait. The findings underscore the critical role of linguistic attributes in enhancing the performance of the classification model. Notably, the employment of a machine learning model resulted in an impressive accuracy of 0.87 in clickbait detection. Moving forward, our research plan encompasses dataset expansion through the best machine learning model assisted labelling, with the objective of optimizing deep learning models for even more robust outcomes. This study not only advances clickbait detection in the realm of Hebrew but also emphasizes the fundamental importance of linguistic features in the accurate classification of clickbait.
Subject
Linguistics and Language,Communication,Language and Linguistics