Author:
Kuo Chia-Tung,Davidson Ian
Abstract
Outlier detection has been studied extensively and employed in diverse applications in the past decades. In this paper we formulate a related yet understudied problem which we call outlier description. This problem often arises in practice when we have a small number of data instances that had been identified to be outliers and we wish to explain why they are outliers. We propose a framework based on constraint programming to find an optimal subset of features that most differentiates the outliers and normal instances. We further demonstrate the framework offers great flexibility in incorporating diverse scenarios arising in practice such as multiple explanations and human in the loop extensions. We empirically evaluate our proposed framework on real datasets, including medical imaging and text corpus, and demonstrate how the results are useful and interpretable in these domains.
Publisher
Association for the Advancement of Artificial Intelligence (AAAI)
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Outlier Interpretation Using Regularized Auto Encoders and Genetic Algorithm;2024 IEEE Congress on Evolutionary Computation (CEC);2024-06-30
2. A Survey on Explainable Anomaly Detection;ACM Transactions on Knowledge Discovery from Data;2023-09-06
3. Anomaly Detection in Medical Imaging - A Mini Review;Data Science – Analytics and Applications;2022