Leveraging Zero and Few-Shot Learning for Enhanced Model Generality in Hate Speech Detection in Spanish and English-Reference-Cited by-同舟云学术

Leveraging Zero and Few-Shot Learning for Enhanced Model Generality in Hate Speech Detection in Spanish and English

Published:2023-12-18 Issue:24 Volume:11 Page:5004
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

García-Díaz José Antonio¹^ORCID,Pan Ronghao¹^ORCID,Valencia-García Rafael¹^ORCID

Affiliation:

1. Facultad de Informática, Universidad de Murcia, Campus de Espinardo, 30100 Murcia, Spain

Abstract

Supervised training has traditionally been the cornerstone of hate speech detection models, but it often falls short when faced with unseen scenarios. Zero and few-shot learning offers an interesting alternative to traditional supervised approaches. In this paper, we explore the advantages of zero and few-shot learning over supervised training, with a particular focus on hate speech detection datasets covering different domains and levels of complexity. We evaluate the generalization capabilities of generative models such as T5, BLOOM, and Llama-2. These models have shown promise in text generation and have demonstrated the ability to learn from limited labeled data. Moreover, by evaluating their performance on both Spanish and English datasets, we gain insight into their cross-lingual applicability and versatility, thus contributing to a broader understanding of generative models in natural language processing. Our results highlight the potential of generative models to bridge the gap between data scarcity and model performance across languages and domains.

Funder

Agencia Estatal de Investigación

European Union

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/11/24/5004/pdf

Reference50 articles.

1. Scao, T.L., Fan, A., Akiki, C., Pavlick, E., Ilić, S., Hesslow, D., Castagné, R., Luccioni, A.S., Yvon, F., and Gallé, M. (2022). Bloom: A 176b-parameter open-access multilingual language model. arXiv.

2. Touvron, H., Martin, L., Stone, K., Albert, P., Almahairi, A., Babaei, Y., Bashlykov, N., Batra, S., Bhargava, P., and Bhosale, S. (2023). Llama 2: Open Foundation and Fine-Tuned Chat Models. arXiv.

3. English as a Global Language: An Exploration of EFL Learners’ Beliefs in Vietnam;Int. J. TESOL Educ.,2022

4. Nichols, J. (2018). Linguistic Diversity in Space and Time, University of Chicago Press.

5. A survey on automatic detection of hate speech in text;Fortuna;ACM Comput. Surv. CSUR,2018

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. User Story Classification with Machine Learning and LLMs;Lecture Notes in Computer Science;2024