Quantifying gender bias towards politicians in cross-lingual language models

Author:

Stańczak KarolinaORCID,Ray Choudhury Sagnik,Pimentel Tiago,Cotterell Ryan,Augenstein Isabelle

Abstract

Recent research has demonstrated that large pre-trained language models reflect societal biases expressed in natural language. The present paper introduces a simple method for probing language models to conduct a multilingual study of gender bias towards politicians. We quantify the usage of adjectives and verbs generated by language models surrounding the names of politicians as a function of their gender. To this end, we curate a dataset of 250k politicians worldwide, including their names and gender. Our study is conducted in seven languages across six different language modeling architectures. The results demonstrate that pre-trained language models’ stance towards politicians varies strongly across analyzed languages. We find that while some words such as dead, and designated are associated with both male and female politicians, a few specific words such as beautiful and divorced are predominantly associated with female politicians. Finally, and contrary to previous findings, our study suggests that larger language models do not tend to be significantly more gender-biased than smaller ones.

Funder

Danmarks Frie Forskningsfond

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference76 articles.

1. The Importance of Political Knowledge for Effective Citizenship: Differences Between the Broadcast and Internet Generations;MS Kleinberg;Public Opinion Quarterly,2019

2. Social media and political discussion: when online presence silences offline conversation;KN Hampton;Information, Communication & Society,2017

3. Political Effects of the Internet and Social Media;E Zhuravskaya;Annual Review of Economics,2020

4. Sentiment, Emotion, Purpose, and Style in Electoral Tweets;SM Mohammad;Information Processing and Management,2015

5. Social Media and the Elections;PT Metaxas;Science,2012

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Enhancing Neural Machine Translation of Indigenous Languages through Gender Debiasing and Named Entity Recognition;2024 16th International Conference on Human System Interaction (HSI);2024-07-08

2. The Ethics of Automating Legal Actors;Transactions of the Association for Computational Linguistics;2024

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3