Learning Fair Representations via Rate-Distortion Maximization-Reference-Cited by-同舟云学术

Learning Fair Representations via Rate-Distortion Maximization

Published:2022 Issue: Volume:10 Page:1159-1174
ISSN:2307-387X
Container-title:Transactions of the Association for Computational Linguistics
language:en
Short-container-title:

Author:

Chowdhury Somnath Basu Roy¹,Chaturvedi Snigdha²

Affiliation:

1. UNC Chapel Hill, USA somnath@cs.unc.edu

2. UNC Chapel Hill, USA snigdha@cs.unc.edu

Abstract

Abstract Text representations learned by machine learning models often encode undesirable demographic information of the user. Predictive models based on these representations can rely on such information, resulting in biased decisions. We present a novel debiasing technique, Fairness-aware Rate Maximization (FaRM), that removes protected information by making representations of instances belonging to the same protected attribute class uncorrelated, using the rate-distortion function. FaRM is able to debias representations with or without a target task at hand. FaRM can also be adapted to remove information about multiple protected attributes simultaneously. Empirical evaluations show that FaRM achieves state-of-the-art performance on several datasets, and learned representations leak significantly less protected attribute information against an attack by a non-linear probing network.

Publisher

MIT Press

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Human-Computer Interaction,Communication

Link

https://direct.mit.edu/tacl/article-pdf/doi/10.1162/tacl_a_00512/2054697/tacl_a_00512.pdf

Reference49 articles.

1. A study on similarity and relatedness using distributional and WordNet-based approaches;Agirre,2009

2. Layer normalization;Ba;arXiv preprint arXiv:1607.06450,2016

3. Adversarial removal of demographic attributes revisited;Barrett,2019

4. Predictive models of student college commitment decisions using machine learning;Basu;Data,2019

5. Adversarial scrubbing of demographic information for text classification;Roy Chowdhury,2021

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Separability Measure Supervised Network for Radar Target Recognition;Journal of Physics: Conference Series;2023-11-01