Ethical Adversaries-Reference-Cited by-同舟云学术

Ethical Adversaries

Published:2021-05-26 Issue:1 Volume:23 Page:32-41
ISSN:1931-0145
Container-title:ACM SIGKDD Explorations Newsletter
language:en
Short-container-title:SIGKDD Explor. Newsl.

Author:

Delobelle Pieter¹,Temple Paul²,Perrouin Gilles²,Frénay Benoit²,Heymans Patrick²,Berendt Bettina¹

Affiliation:

1. KU Leuven, Leuven.AI, Leuven, Belgium

2. University of Namur, Namur, Belgium

Abstract

Machine learning is being integrated into a growing number of critical systems with far-reaching impacts on society. Unexpected behaviour and unfair decision processes are coming under increasing scrutiny due to this widespread use and its theoretical considerations. Individuals, as well as organisations, notice, test, and criticize unfair results to hold model designers and deployers accountable. We offer a framework that assists these groups in mitigating unfair representations stemming from the training datasets. Our framework relies on two inter-operating adversaries to improve fairness. First, a model is trained with the goal of preventing the guessing of protected attributes' values while limiting utility losses. This first step optimizes the model's parameters for fairness. Second, the framework leverages evasion attacks from adversarial machine learning to generate new examples that will be misclassified. These new examples are then used to retrain and improve the model in the first step. These two steps are iteratively applied until a significant improvement in fairness is obtained. We evaluated our framework on well-studied datasets in the fairness literature - including COMPAS - where it can surpass other approaches concerning demographic parity, equality of opportunity and also the model's utility. We investigated the trade-offs between these targets in terms of model hyperparameters and also illustrated our findings on the subtle difficulties when mitigating unfairness and highlight how our framework can assist model designers.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3468507.3468513

Reference44 articles.

1. Poisoning attacks to compromise face templates

2. Wild patterns: Ten years after the rise of adversarial machine learning

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. AutoRIC: Automated Neural Network Repairing Based on Constrained Optimization;ACM Transactions on Software Engineering and Methodology;2024-09-04

2. Performance and biases of Large Language Models in public opinion simulation;Humanities and Social Sciences Communications;2024-08-28

3. FairCare: Adversarial training of a heterogeneous graph neural network with attention mechanism to learn fair representations of electronic health records;Information Processing & Management;2024-05

4. Information-Minimizing Generative Adversarial Network for Fair Generation and Classification;Neural Processing Letters;2024-02-15

5. RUNNER: Responsible UNfair NEuron Repair for Enhancing Deep Neural Network Fairness;Proceedings of the IEEE/ACM 46th International Conference on Software Engineering;2024-02-06