Author:
Zuo Zheng,Li Ziqiang,Cheng Pengsen,Zhao Jian
Abstract
AbstractSubspace outlier detection has emerged as a practical approach for outlier detection. Classical full space outlier detection methods become ineffective in high dimensional data due to the “curse of dimensionality”. Subspace outlier detection methods have great potential to overcome the problem. However, the challenge becomes how to determine which subspaces to be used for outlier detection among a huge number of all subspaces. In this paper, firstly, we propose an intuitive definition of outliers in subspaces. We study the desirable properties of subspaces for outlier detection and investigate the metrics for those properties. Then, a novel subspace outlier detection algorithm with a statistical foundation is proposed. Our method selectively leverages a limited set of the most interesting subspaces for outlier detection. Through experimental validation, we demonstrate that identifying outliers within this reduced set of highly interesting subspaces yields significantly higher accuracy compared to analyzing the entire feature space. We show by experiments that the proposed method outperforms competing subspace outlier detection approaches on real world data sets.
Publisher
Springer Science and Business Media LLC
Reference39 articles.
1. Fawcett, T. & Provost, F. Adaptive fraud detection. Data Min. Knowl. Discov. 1(3), 291–316. https://doi.org/10.1023/A/3A1009700419189 (1997).
2. Mazel, J., Casas, P., Fontugne, R., Fukuda, K. & Owezarski, P. Hunting attacks in the dark: clustering and correlation analysis for unsupervised anomaly detection. Int. J. Netw. Manag. 25(5), 283–305. https://doi.org/10.1002/nem.1903/abstract (2015).
3. Podgorelec, V., Hericko, M. and Rozman, I. Improving mining of medical data by outliers prediction. In 18th IEEE Symposium on Computer-Based Medical Systems, 2005. Proceedings, pp. 91–96 (2005).
4. Hawkins, D. M. “Introduction,” in Identification of Outliers, ser. Monographs on Applied Probability and Statistics (Springer Netherlands, 1980), pp. 1–12. https://doi.org/10.1007/978-94-015-3994-41
5. Barnett, V. & Lewis, T. Outliers in Statistical Data Vol. 3 (Wiley, 1994).
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献