Author:
Catania Barbara,Guerrini Giovanna,Accinelli Chiara
Abstract
AbstractThe data science era is characterized by data-driven automated decision systems (ADS) enabling, through data analytics and machine learning, automated decisions in many contexts, deeply impacting our lives. As such, their downsides and potential risks are becoming more and more evident: technical solutions, alone, are not sufficient and an interdisciplinary approach is needed. Consequently, ADS should evolve into data-informed ADS, which take humans in the loop in all the data processing steps. Data-informed ADS should deal with data responsibly, guaranteeing nondiscrimination with respect to protected groups of individuals. Nondiscrimination can be characterized in terms of different types of properties, like fairness and diversity. While fairness, i.e., absence of bias against minorities, has been widely investigated in machine learning, only more recently this issue has been tackled by considering all the steps of data processing pipelines at the basis of ADS, from data acquisition to analysis. Additionally, fairness is just one point of view of nondiscrimination to be considered for guaranteeing equity: other issues, like diversity, are raising interest from the scientific community due to their relevance in society. This paper aims at critically surveying how nondiscrimination has been investigated in the context of complex data science pipelines at the basis of data-informed ADS, by focusing on the specific data processing tasks for which nondiscrimination solutions have been proposed.
Publisher
Springer Science and Business Media LLC
Subject
Artificial Intelligence,Human-Computer Interaction,Philosophy
Reference82 articles.
1. Abiteboul S, Stoyanovich J (2019) Transparency, fairness, data protection, neutrality: data management challenges in the face of new regulation. J Data Inf Qual 11(3):1–9
2. Abiteboul S, Arenas M, Barceló P, Bienvenu M, Calvanese D, David C, Schwentick M et al (2016) Research directions for principles of data management (abridged). SIGMOD Rec 45(4):5–17
3. Accinelli C, Minisi S, Catania B (2020) Coverage-based rewriting for data preparation. In: Proceedings of the EDBT/ICDT workshops, p 2578. CEUR-WS.org
4. Accinelli C, Catania B, Guerrini G, Minisi S (2021a) covRew: a Python toolkit for pre-processing pipeline rewriting ensuring coverage constraint satisfaction. In: Proceedings of the international conference on extending database technology (pp 698–701). OpenProceedings.org
5. Accinelli C, Catania B, Guerrini G, Minisi S (2021b) The impact of rewriting on coverage constraint satisfaction. In: Proceedings of the EDBT/ICDT workshops, p 2841
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献