A General Framework for Implicit and Explicit Debiasing of Distributional Word Vector Spaces-Reference-Cited by-同舟云学术

A General Framework for Implicit and Explicit Debiasing of Distributional Word Vector Spaces

Published:2020-04-03 Issue:05 Volume:34 Page:8131-8138
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Lauscher Anne,Glavaš Goran,Ponzetto Simone Paolo,Vulić Ivan

Abstract

Distributional word vectors have recently been shown to encode many of the human biases, most notably gender and racial biases, and models for attenuating such biases have consequently been proposed. However, existing models and studies (1) operate on under-specified and mutually differing bias definitions, (2) are tailored for a particular bias (e.g., gender bias) and (3) have been evaluated inconsistently and non-rigorously. In this work, we introduce a general framework for debiasing word embeddings. We operationalize the definition of a bias by discerning two types of bias specification: explicit and implicit. We then propose three debiasing models that operate on explicit or implicit bias specifications and that can be composed towards more robust debiasing. Finally, we devise a full-fledged evaluation framework in which we couple existing bias metrics with newly proposed ones. Experimental findings across three embedding methods suggest that the proposed debiasing models are robust and widely applicable: they often completely remove the bias both implicitly and explicitly without degradation of semantic information encoded in any of the input distributional spaces. Moreover, we successfully transfer debiasing models, by means of cross-lingual embedding spaces, and remove or attenuate biases in distributional word vector spaces of languages that lack readily available bias specifications.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Empirical Study and Mitigation Methods of Bias in LLM-Based Robots;Academic Journal of Science and Technology;2024-08-20

2. A Scoping Review of Children, Empowerment, and Smartphone Technology Regarding Social Construction Theory with the Aim of Increasing Self-Direction in Democracies;Social Sciences;2024-03-31

3. Quantifying Gender Bias in Arabic Pre-Trained Language Models;IEEE Access;2024

4. Stereotype and Categorical Bias Evaluation via Differential Cosine Bias Measure;2022 IEEE International Conference on Big Data (Big Data);2022-12-17

5. Did You Just Assume My Vector? Detecting Gender Stereotypes in Word Embeddings;Communications in Computer and Information Science;2021