CA-CD: context-aware clickbait detection using new Chinese clickbait dataset with transfer learning method-Reference-Cited by-同舟云学术

CA-CD: context-aware clickbait detection using new Chinese clickbait dataset with transfer learning method

Published:2023-08-29 Issue: Volume: Page:
ISSN:2514-9288
Container-title:Data Technologies and Applications
language:en
Short-container-title:DTA

Author:

Wang Hei-Chia^ORCID,Maslim Martinus^ORCID,Liu Hung-Yu

Abstract

PurposeA clickbait is a deceptive headline designed to boost ad revenue without presenting closely relevant content. There are numerous negative repercussions of clickbait, such as causing viewers to feel tricked and unhappy, causing long-term confusion, and even attracting cyber criminals. Automatic detection algorithms for clickbait have been developed to address this issue. The fact that there is only one semantic representation for the same term and a limited dataset in Chinese is a need for the existing technologies for detecting clickbait. This study aims to solve the limitations of automated clickbait detection in the Chinese dataset.Design/methodology/approachThis study combines both to train the model to capture the probable relationship between clickbait news headlines and news content. In addition, part-of-speech elements are used to generate the most appropriate semantic representation for clickbait detection, improving clickbait detection performance.FindingsThis research successfully compiled a dataset containing up to 20,896 Chinese clickbait news articles. This collection contains news headlines, articles, categories and supplementary metadata. The suggested context-aware clickbait detection (CA-CD) model outperforms existing clickbait detection approaches on many criteria, demonstrating the proposed strategy's efficacy.Originality/valueThe originality of this study resides in the newly compiled Chinese clickbait dataset and contextual semantic representation-based clickbait detection approach employing transfer learning. This method can modify the semantic representation of each word based on context and assist the model in more precisely interpreting the original meaning of news articles.

Publisher

Emerald

Subject

Library and Information Sciences,Information Systems

Reference51 articles.

1. Clickbait detection using deep learning,2016

2. Experimental evaluation of clickbait detection using machine learning models;Intelligent Automation & Soft Computing,2020

3. An improved multiple features and machine learning-based approach for detecting clickbait news on social networks;Applied Sciences,2021

4. We used neural networks to detect clickbaits: you won't believe what happened next!,2017

5. “8 amazing secrets for getting more clicks”: detecting clickbaits in news streams using article informality;Proceedings of the AAAI Conference on Artificial Intelligence,2016