Affiliation:
1. Sun Yat-sen University, Guangzhou, China
Abstract
With the rapidly growing attention to multi-view data in recent years, multi-view outlier detection has become a rising field with intense research. These researches have made some success, but still exist some issues that need to be solved. First, many multi-view outlier detection methods can only handle datasets that conform to the cluster structure but are powerless for complex data distributions such as manifold structures. This overly restrictive data assumption limits the applicability of these methods. In addition, almost the majority of multi-view outlier detection algorithms cannot solve the online detection problem of multi-view outliers. To address these issues, we propose a new detection method based on the local similarity relation and data reconstruction, i.e., the Self-Representation Method with Local Similarity Preserving for fast multi-view outlier detection (SRLSP). By using the local similarity structure, the proposed method fully utilizes the characteristics of outliers and detects outliers with an applicable objective function. Besides, a well-designed optimization algorithm is proposed, which completes each iteration with linear time complexity and can calculate each instance parallelly. Also, the optimization algorithm can be easily extended to the online version, which is more suitable for practical production environments. Extensive experiments on both synthetic and real-world datasets demonstrate the superiority of the proposed method on both performance and time complexity.
Funder
Key-Area Research and Development Program of Guangdong Province
National Natural Science Foundation of China
Guangdong Basic and Applied Basic Research Foundation
Publisher
Association for Computing Machinery (ACM)
Reference58 articles.
1. Charu C. Aggarwal. 2015. Data Mining. Springer.
2. Mohiuddin Ahmed and Abdun Naser Mahmood. 2013. A novel approach for outlier detection and clustering improvement. In Proceedings of the IEEE Conference on Industrial Electronics and Applications (ICIEA). 577–582.
3. Emin Aleskerov, Bernd Freisleben, and Bharat Rao. 1997. CARDWATCH: A neural network based database mining system for credit card fraud detection. In Proceedings of the IEEE/IAFE Computational Intelligence for Financial Engineering (CIFEr). 220–226.
4. Distance-based detection and prediction of outliers;Angiulli Fabrizio;IEEE Transactions on Knowledge and Data Engineering (TKDE),2005
5. Irad Ben-Gal. 2005. Outlier detection. In Proceedings of the Data Mining and Knowledge Discovery Handbook. Springer, 131–146.
Cited by
14 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献