The Disagreement Dilemma in Explainable AI: Can Bias Reduction Bridge the Gap-Reference-Cited by-同舟云学术

The Disagreement Dilemma in Explainable AI: Can Bias Reduction Bridge the Gap

Published:2024-07-19 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Bhardwaj Nitanshi¹,Parashar Gaurav²^ORCID

Affiliation:

1. SRM-RI: SRM Institute of Science and Technology (Deemed to be University) Research Kattankulathur

2. KIET Group of Institutions: Krishna Institute of Engineering & Technology

Abstract

Explainable AI (XAI) is an emerging field of research since the spread of AI in multifarious fields. The opacity and inherent black-box nature of the advanced machine learning models create a lack of transparency in them leading to the insufficiency in societal recognition. The increasing dependence on AI across diverse sectors has created the need for informed decision-making of the numerous predictive models used. XAI strives to close this divide by providing an explanation of the decision-making process, promoting trust, ensuring adherence to regulations, and cultivating societal approval. Various post-hoc techniques including well-known methods like LIME, SHAP, Integrated Gradients, Partial Dependence Plot, and Accumulated Local Effects have been proposed to decipher the intricacies of complex AI models. In the context of post hoc explanatory methods for machine learning models there arises a conflict known as the Disagreement problem where different explanation techniques provide differing interpretations of the same model. In this study, we aim to find whether reducing the bias in the dataset could lead to XAI explanations that do not disagree. The study thoroughly analyzes this problem, examining various widely recognized explanation methods.

Publisher

Springer Science and Business Media LLC

Reference75 articles.

1. Krishna, Satyapriya and Han, Tessa and Gu, Alex and Pombra, Javin and Jabbari, Shahin and Wu, Steven and Lakkaraju, Himabindu (2022) The disagreement problem in explainable machine learning: A practitioner's perspective. arXiv preprint arXiv:2202.01602

2. A. Tabrez (2019) Explanation-Based Reward Coaching to Improve Human Performance via Reinforcement Learning. ACM/IEEE International Conference on Human-Robot Interaction 2019 https://doi.org/10.1109/HRI.2019.8673104, https://api.elsevier.com/content/abstract/scopus_id/85064001723, 2167-2148, Conference Paper

3. K. Baum (2022) From Responsibility to Reason-Giving Explainable Artificial Intelligence. Philosophy and Technology 35(1) https://doi.org/10.1007/s13347-022-00510-w, https://api.elsevier.com/content/abstract/scopus_id/85125292638, 2210-5433, Article

4. A. Zytek (2022) Sibyl: Understanding and Addressing the Usability Challenges of Machine Learning in High-Stakes Decision Making. IEEE Transactions on Visualization and Computer Graphics 28(1) https://doi.org/10.1109/TVCG.2021.3114864, https://api.elsevier.com/content/abstract/scopus_id/85118642177, 1077-2626, Article

5. R. Nyrup (2022) Explanatory pragmatism: a context-sensitive framework for explainable medical AI. Ethics and Information Technology 24(1) https://doi.org/10.1007/s10676-022-09632-3, https://api.elsevier.com/content/abstract/scopus_id/85125618593, 1388-1957, Article