Fair Enough: Searching for Sufficient Measures of Fairness-Reference-Cited by-同舟云学术

Fair Enough: Searching for Sufficient Measures of Fairness

Published:2023-09-29 Issue:6 Volume:32 Page:1-22
ISSN:1049-331X
Container-title:ACM Transactions on Software Engineering and Methodology
language:en
Short-container-title:ACM Trans. Softw. Eng. Methodol.

Author:

Majumder Suvodeep¹^ORCID,Chakraborty Joymallya¹^ORCID,Bai Gina R.¹^ORCID,Stolee Kathryn T.¹^ORCID,Menzies Tim¹^ORCID

Affiliation:

1. North Carolina State University, USA

Abstract

Testing machine learning software for ethical bias has become a pressing current concern. In response, recent research has proposed a plethora of new fairness metrics, for example, the dozens of fairness metrics in the IBM AIF360 toolkit. This raises the question: How can any fairness tool satisfy such a diverse range of goals? While we cannot completely simplify the task of fairness testing, we can certainly reduce the problem. This article shows that many of those fairness metrics effectively measure the same thing. Based on experiments using seven real-world datasets, we find that (a) 26 classification metrics can be clustered into seven groups and (b) four dataset metrics can be clustered into three groups. Further, each reduced set may actually predict different things. Hence, it is no longer necessary (or even possible) to satisfy all fairness metrics. In summary, to simplify the fairness testing problem, we recommend the following steps: (1) determine what type of fairness is desirable (and we offer a handful of such types), then (2) lookup those types in our clusters, and then (3) just test for one item per cluster. For the purpose of reproducibility, our scripts and data are available at https://github.com/Repoanon ymous/Fairness_Metrics.

Funder

LAS and NSF

Publisher

Association for Computing Machinery (ACM)

Subject

Software

Link

https://dl.acm.org/doi/pdf/10.1145/3585006

Reference90 articles.

1. 1953. Stanford hlab. Retrieved from https://hlab.stanford.edu/brian/number_of_clusters_.html.

2. 1994. UCI:Adult Data Set. Retrieved from http://mlr.cs.umass.edu/ml/datasets/Adult.

3. 2000. UCI:Statlog (German Credit Data) Data Set. Retrieved from https://archive.ics.uci.edu/ml/datasets/Statlog+(German+Credit+Data).

4. 2001. UCI:Heart Disease Data Set. Retrieved from https://archive.ics.uci.edu/ml/datasets/Heart+Disease.

5. 2011. sklearn.cluster.AgglomerativeClustering. Retrieved from https://scikit-learn.org/stable/modules/generated/sklearn.cluster.AgglomerativeClustering.html.

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The problem of fairness in tools for algorithmic fairness;AI and Ethics;2024-07-30

2. Balancing Fairness: Unveiling the Potential of SMOTE-Driven Oversampling in AI Model Enhancement;2024 9th International Conference on Machine Learning Technologies (ICMLT);2024-05-24

3. Unfair Trojan: Targeted Backdoor Attacks Against Model Fairness;Springer Optimization and Its Applications;2024-05-10

4. Policy advice and best practices on bias and fairness in AI;Ethics and Information Technology;2024-04-29

5. Preparing for the bedside—optimizing a postpartum depression risk prediction model for clinical implementation in a health system;Journal of the American Medical Informatics Association;2024-03-26