Understanding Interpretability: Explainable AI Approaches for Hate Speech Classifiers-Reference-Cited by-同舟云学术

Understanding Interpretability: Explainable AI Approaches for Hate Speech Classifiers

Published:2023 Issue: Volume: Page:47-70
ISSN:1865-0929
Container-title:Communications in Computer and Information Science
language:
Short-container-title:

Author:

Yadav Sargam^ORCID,Kaushik Abhishek^ORCID,McDaid Kevin^ORCID

Publisher

Springer Nature Switzerland

Link

https://link.springer.com/content/pdf/10.1007/978-3-031-44070-0_3

Reference34 articles.

1. Atanasova, P., Simonsen, J.G., Lioma, C., Augenstein, I.: A diagnostic study of explainability techniques for text classification. arXiv preprint arXiv:2009.13295 (2020)

2. Attanasio, G., Nozza, D., Pastor, E., Hovy, D.: Benchmarking post-hoc interpretability approaches for transformer-based misogyny detection. In: Proceedings of NLP Power! The First Workshop on Efficient Benchmarking in NLP, pp. 100–112 (2022)

3. Biradar, S., Saumya, S., et al.: Fighting hate speech from bilingual Hinglish speaker’s perspective, a transformer-and translation-based approach. Soc. Netw. Anal. Min. 12(1), 1–10 (2022)

4. Buitinck, L., et al.: API design for machine learning software: experiences from the scikit-learn project. In: ECML PKDD Workshop: Languages for Data Mining and Machine Learning, pp. 108–122 (2013)

5. Camburu, O.M., Rocktäschel, T., Lukasiewicz, T., Blunsom, P.: e-SNLI: natural language inference with natural language explanations. In: Advances in Neural Information Processing Systems, vol. 31 (2018)

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Using Explainable AI (XAI) for Identification of Subjectivity in Hate Speech Annotations for Low-Resource Languages;4th International Workshop on OPEN CHALLENGES IN ONLINE SOCIAL NETWORKS;2024-09-10

2. The Explainability of Transformers: Current Status and Directions;Computers;2024-04-04