An eXplainable Artificial Intelligence Methodology on Big Data Architecture-Reference-Cited by-同舟云学术

An eXplainable Artificial Intelligence Methodology on Big Data Architecture

Published:2024-04-11 Issue:5 Volume:16 Page:2642-2659
ISSN:1866-9956
Container-title:Cognitive Computation
language:en
Short-container-title:Cogn Comput

Author:

La Gatta Valerio,Moscato Vincenzo,Postiglione Marco,Sperlì Giancarlo^ORCID

Abstract

AbstractAlthough artificial intelligence has become part of everyone’s real life, a trust crisis against such systems is occurring, thus increasing the need to explain black-box predictions, especially in the military, medical, and financial domains. Modern eXplainable Artificial Intelligence (XAI) techniques focus on benchmark datasets, but the cognitive applicability of such solutions under big data settings is still unclear due to memory or computation constraints. In this paper, we extend a model-agnostic XAI methodology, named Cluster-Aided Space Transformation for Local Explanation (CASTLE), to be able to deal with high-volume datasets. CASTLE aims to explain the black-box behavior of predictive models by combining both local (i.e., based on the input sample) and global (i.e., based on the whole scope for action of the model) information. In particular, the local explanation provides a rule-based explanation for the prediction of a target instance as well as the directions to update the likelihood of the predicted class. Our extension leverages modern big data technologies (e.g., Apache Spark) to handle the high volume, variety, and velocity of huge datasets. We have evaluated the framework on five datasets, in terms of temporal efficiency, explanation quality, and model significance. Our results indicate that the proposed approach retains the high-quality explanations associated with CASTLE while efficiently handling large datasets. Importantly, it exhibits a sub-linear, rather than exponential, dependence on dataset size, making it a scalable solution for massive datasets or in any big data scenario.

Funder

Università degli Studi di Napoli Federico II

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s12559-024-10272-6.pdf

Reference38 articles.

1. Firouzi F, Farahani B, Marinšek A. The convergence and interplay of edge, fog, and cloud in the AI-driven Internet of Things (IoT). Inf Syst. 2022;107:101840. https://doi.org/10.1016/j.is.2021.101840.

2. Cao L. AI in finance: challenges, techniques, and opportunities. Comput Surv. 2022. https://doi.org/10.1145/3502289.

3. Huang C, Zhang Z, Mao B, Yao X. An overview of artificial intelligence ethics. IEEE Trans Artif Intell. 2023;4(4):799–819. https://doi.org/10.1109/TAI.2022.3194503.

4. Strouse D, McKee K, Botvinick M, Hughes E, Everett R. Collaborating with humans without human data. Adv Neural Inf Process Syst. 2021;34:14502–15.

5. Li Z, Li S, Luo X. An overview of calibration technology of industrial robots. IEEE CAA J Autom Sin. 2021;8(1):23–36. https://doi.org/10.1109/JAS.2020.1003381.