Abstract
Outlier detection has attracted extensive attention in medical, financial, telecommunications and other fields. Although many related technologies have been proposed, most of them are faced with the problems of the neighborhood size of an object is difficult to determine and the distance in high-dimensional space is unreliable. To overcome these weaknesses, we propose a novel density-based outlier detection method that introduces the concept of Minimum the Sum of Edge Set and other related definitions in key attributes space. Based on the stability of Reverse Minimum the Sum of Edge Set, the proposed method can adaptively select the parameter representing the neighborhood size. In addition, some properties of the proposed local outlier factor are derived. Experiments on synthetic and real-world datasets demonstrate that our method is more effective than the existing outlier detection approaches.
Subject
Artificial Intelligence,Computer Vision and Pattern Recognition,Theoretical Computer Science
Reference27 articles.
1. M. Amer and M. Goldstein, Nearest-Neighbor and Clustering based Anomaly Detection Algorithms for RapidMiner, in: Proc. of the 3rd RapidMiner Community Meeting and Conference (RCOMM 2012), 2012, pp. 1–12.
2. F. Angiulli, S. Basta, S. Lodi and C. Sartori, Reducing distance computations for distance-based outliers, Expert Systems with Applications 147 (2020), 113215.
3. M.M. Breunig, H.P. Kriegel, R.T. Ng and J. Sander, LOF: Identifying Density-Based Local Outliers, in: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, 2000, pp. 93–104.
4. On the evaluation of unsupervised outlier detection: Measures, datasets, and an empirical study;Campos;Data Mining and Knowledge Discovery,2016
5. Machine learning in therapeutic research: The hard work of outlier detection in large data;Cleophas;American Journal of Therapeutics,2016
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献