CHIRPS: Explaining random forest classification-Reference-Cited by-同舟云学术

CHIRPS: Explaining random forest classification

Published:2020-06-04 Issue:8 Volume:53 Page:5747-5788
ISSN:0269-2821
Container-title:Artificial Intelligence Review
language:en
Short-container-title:Artif Intell Rev

Author:

Hatwell Julian,Gaber Mohamed Medhat,Azad R. Muhammad Atif

Abstract

Abstract Modern machine learning methods typically produce “black box” models that are opaque to interpretation. Yet, their demand has been increasing in the Human-in-the-Loop processes, that is, those processes that require a human agent to verify, approve or reason about the automated decisions before they can be applied. To facilitate this interpretation, we propose Collection of High Importance Random Path Snippets (CHIRPS); a novel algorithm for explaining random forest classification per data instance. CHIRPS extracts a decision path from each tree in the forest that contributes to the majority classification, and then uses frequent pattern mining to identify the most commonly occurring split conditions. Then a simple, conjunctive form rule is constructed where the antecedent terms are derived from the attributes that had the most influence on the classification. This rule is returned alongside estimates of the rule’s precision and coverage on the training data along with counter-factual details. An experimental study involving nine data sets shows that classification rules returned by CHIRPS have a precision at least as high as the state of the art when evaluated on unseen data (0.91–0.99) and offer a much greater coverage (0.04–0.54). Furthermore, CHIRPS uniquely controls against under- and over-fitting solutions by maximising novel objective functions that are better suited to the local (per instance) explanation setting.

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Linguistics and Language,Language and Linguistics

Link

http://link.springer.com/content/pdf/10.1007/s10462-020-09833-6.pdf

Reference69 articles.

1. Adnan MN, Islam MZ (2017) ForEx++: a new framework for knowledge discovery from decision forests. Australas J Inf Syst 21

2. Agrawal R, Srikant R et al (1994) Fast algorithms for mining association rules. In: Proceedings of 20th internatinal conference very large data bases, VLDB, vol 1215, pp 487–99

3. Andrews R, Diederich J, Tickle AB (1995) Survey and critique of techniques for extracting rules from trained artificial neural networks. Knowl-Based Syst 8(6):373–389

4. Biau G (2012) Analysis of a random forests model. J Mach Learn Res 13:1063–1095

5. Bibal A, Frenay B (2016) Interpretability of machine learning models and representations: an introduction. In: Michel V (ed) ESANN, computational intelligence and machine learning. Bruges, Belgium, pp 77–82

Cited by 59 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Research on the prediction of breakdown voltage of transformer oil based on multi-frequency ultrasound and GWO-RF algorithm;Measurement;2025-01

2. Identifying momentary suicidal ideation using machine learning in patients at high-risk for suicide;Journal of Affective Disorders;2024-11

3. A systematic review of the application of remote sensing technologies in mapping forest insect pests and diseases at a tree-level;Remote Sensing Applications: Society and Environment;2024-11

4. Sampling and active learning methods for network reliability estimation using K-terminal spanning tree;Reliability Engineering & System Safety;2024-10

5. Using artificial intelligence to rapidly identify microplastics pollution and predict microplastics environmental behaviors;Journal of Hazardous Materials;2024-08