On Measuring and Mitigating Biased Inferences of Word Embeddings-Reference-Cited by-同舟云学术

On Measuring and Mitigating Biased Inferences of Word Embeddings

Published:2020-04-03 Issue:05 Volume:34 Page:7659-7666
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Dev Sunipa,Li Tao,Phillips Jeff M.,Srikumar Vivek

Abstract

Word embeddings carry stereotypical connotations from the text they are trained on, which can lead to invalid inferences in downstream models that rely on them. We use this observation to design a mechanism for measuring stereotypes using the task of natural language inference. We demonstrate a reduction in invalid inferences via bias mitigation strategies on static word embeddings (GloVe). Further, we show that for gender bias, these techniques extend to contextualized embeddings when applied selectively only to the static components of contextualized embeddings (ELMo, BERT).

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 23 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Towards trustworthy LLMs: a review on debiasing and dehallucinating in large language models;Artificial Intelligence Review;2024-08-10

2. Believing Anthropomorphism: Examining the Role of Anthropomorphic Cues on Trust in Large Language Models;Extended Abstracts of the CHI Conference on Human Factors in Computing Systems;2024-05-11

3. “They only care to show us the wheelchair”: disability representation in text-to-image AI models;Proceedings of the CHI Conference on Human Factors in Computing Systems;2024-05-11

4. VERB: Visualizing and Interpreting Bias Mitigation Techniques Geometrically for Word Representations;ACM Transactions on Interactive Intelligent Systems;2024-01-09

5. Measurement and Mitigation of Bias in Artificial Intelligence: A Narrative Literature Review for Regulatory Science;Clinical Pharmacology & Therapeutics;2023-12-12