iFlipper: Label Flipping for Individual Fairness-Reference-Cited by-同舟云学术

iFlipper: Label Flipping for Individual Fairness

Published:2023-05-26 Issue:1 Volume:1 Page:1-26
ISSN:2836-6573
Container-title:Proceedings of the ACM on Management of Data
language:en
Short-container-title:Proc. ACM Manag. Data

Author:

Zhang Hantian¹^ORCID,Tae Ki Hyun²^ORCID,Park Jaeyoung²^ORCID,Chu Xu¹^ORCID,Whang Steven Euijong²^ORCID

Affiliation:

1. Georgia Institute of Technology, Atlanta, GA, USA

2. KAIST, Daejeon, South Korea

Abstract

As machine learning becomes prevalent, mitigating any unfairness present in the training data becomes critical. Among the various notions of fairness, this paper focuses on the well-known individual fairness, which states that similar individuals should be treated similarly. While individual fairness can be improved when training a model (in-processing), we contend that fixing the data before model training (pre-processing) is a more fundamental solution. In particular, we show that label flipping is an effective pre-processing technique for improving individual fairness. Our system iFlipper solves the optimization problem of minimally flipping labels given a limit to the individual fairness violations, where a violation occurs when two similar examples in the training data have different labels. We first prove that the problem is NP-hard. We then propose an approximate linear programming algorithm and provide theoretical guarantees on how close its result is to the optimal solution in terms of the number of label flips. We also propose techniques for making the linear programming solution more optimal without exceeding the violations limit. Experiments on real datasets show that iFlipper significantly outperforms other pre-processing baselines in terms of individual fairness and accuracy on unseen test sets. In addition, iFlipper can be combined with in-processing techniques for even better results.

Funder

National Research Foundation of Korea

Google Research Award

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3588688

Reference50 articles.

1. Alekh Agarwal , Alina Beygelzimer , Miroslav Dud'ik , John Langford , and Hanna Wallach . 2018. A reductions approach to fair classification. arXiv preprint arXiv:1803.02453 ( 2018 ). Alekh Agarwal, Alina Beygelzimer, Miroslav Dud'ik, John Langford, and Hanna Wallach. 2018. A reductions approach to fair classification. arXiv preprint arXiv:1803.02453 (2018).

2. Alexandr Andoni Piotr Indyk Thijs Laarhoven Ilya Razenshteyn and Ludwig Schmidt. 2015. Practical and Optimal LSH for Angular Distance. In NeurIPS. 1225--1233. Alexandr Andoni Piotr Indyk Thijs Laarhoven Ilya Razenshteyn and Ludwig Schmidt. 2015. Practical and Optimal LSH for Angular Distance. In NeurIPS. 1225--1233.

3. J. Angwin J. Larson S. Mattu and L. Kirchner. 2016. Machine bias: There's software used across the country to predict future criminals. And its biased against blacks. J. Angwin J. Larson S. Mattu and L. Kirchner. 2016. Machine bias: There's software used across the country to predict future criminals. And its biased against blacks.

4. Big data's disparate impact;Barocas Solon;Calif. L. Rev.,2016

5. Rachel K. E. Bellamy , Kuntal Dey , Michael Hind , Samuel C. Hoffman , Stephanie Houde , Kalapriya Kannan , Pranay Lohia , Jacquelyn Martino , Sameep Mehta , Aleksandra Mojsilovic , Seema Nagar , Karthikeyan Natesan Ramamurthy , John T. Richards, Diptikalyan Saha, Prasanna Sattigeri, Moninder Singh, Kush R. Varshney, and Yunfeng Zhang. 2018 . AI Fairness 360: An Extensible Toolkit for Detecting, Understanding, and Mitigating Unwanted Algorithmic Bias. CoRR , Vol. abs/ 1810 .01943 (2018). arxiv: 1810.01943 Rachel K. E. Bellamy, Kuntal Dey, Michael Hind, Samuel C. Hoffman, Stephanie Houde, Kalapriya Kannan, Pranay Lohia, Jacquelyn Martino, Sameep Mehta, Aleksandra Mojsilovic, Seema Nagar, Karthikeyan Natesan Ramamurthy, John T. Richards, Diptikalyan Saha, Prasanna Sattigeri, Moninder Singh, Kush R. Varshney, and Yunfeng Zhang. 2018. AI Fairness 360: An Extensible Toolkit for Detecting, Understanding, and Mitigating Unwanted Algorithmic Bias. CoRR, Vol. abs/1810.01943 (2018). arxiv: 1810.01943