Evaluating the Quality of Machine Learning Explanations: A Survey on Methods and Metrics-Reference-Cited by-同舟云学术

Evaluating the Quality of Machine Learning Explanations: A Survey on Methods and Metrics

Published:2021-03-04 Issue:5 Volume:10 Page:593
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Zhou Jianlong^ORCID,Gandomi Amir H.^ORCID,Chen Fang,Holzinger Andreas

Abstract

The most successful Machine Learning (ML) systems remain complex black boxes to end-users, and even experts are often unable to understand the rationale behind their decisions. The lack of transparency of such systems can have severe consequences or poor uses of limited valuable resources in medical diagnosis, financial decision-making, and in other high-stake domains. Therefore, the issue of ML explanation has experienced a surge in interest from the research community to application domains. While numerous explanation methods have been explored, there is a need for evaluations to quantify the quality of explanation methods to determine whether and to what extent the offered explainability achieves the defined objective, and compare available explanation methods and suggest the best explanation from the comparison for a specific task. This survey paper presents a comprehensive overview of methods proposed in the current literature for the evaluation of ML explanations. We identify properties of explainability from the review of definitions of explainability. The identified properties of explainability are used as objectives that evaluation metrics should achieve. The survey found that the quantitative metrics for both model-based and example-based explanations are primarily used to evaluate the parsimony/simplicity of interpretability, while the quantitative metrics for attribution-based explanations are primarily used to evaluate the soundness of fidelity of explainability. The survey also demonstrated that subjective measures, such as trust and confidence, have been embraced as the focal point for the human-centered evaluation of explainable systems. The paper concludes that the evaluation of ML explanations is a multidisciplinary research topic. It is also not possible to define an implementation of evaluation metrics, which can be applied to all explanation methods.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/10/5/593/pdf

Reference93 articles.

1. How AI can be a force for good

2. Human and Machine Learning: Visible, Explainable, Trustworthy and Transparent,2018

3. An Artificial Intelligence Approach to Predict Gross Primary Productivity in the Forests of South Korea Using Satellite Remote Sensing Data

4. AHNG: Representation learning on attributed heterogeneous network

Cited by 273 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Predictive modeling of patulin accumulation in apple lesions infected by Penicillium expansum using machine learning;Postharvest Biology and Technology;2024-11

2. Sentiment Analysis on E-Commerce Product Reviews Using Machine Learning and Deep Learning Algorithms: A Bibliometric Analysis, Systematic Literature Review, Challenges and Future Works;International Journal of Information Management Data Insights;2024-11

3. Evaluating the necessity of the multiple metrics for assessing explainable AI: A critical examination;Neurocomputing;2024-10

4. Optimizing Lung Condition Categorization through a Deep Learning Approach to Chest X-ray Image Analysis;BioMedInformatics;2024-09-10

5. Combination of Multiple Variables and Machine Learning for Regional Cropland Water and Carbon Fluxes Estimation: A Case Study in the Haihe River Basin;Remote Sensing;2024-09-04