Abstract
In data mining, outlier detection is a major challenge as it has an important role in many applications such as medical data, image processing, fraud detection, intrusion detection, and so forth. An extensive variety of clustering based approaches have been developed to detect outliers. However they are by nature time consuming which restrict their utilization with real-time applications. Furthermore, outlier detection requests are handled one at a time, which means that each request is initiated individually with a particular set of parameters. In this paper, the first clustering based outlier detection framework, (On the Fly Clustering Based Outlier Detection (OFCOD)) is presented. OFCOD enables analysts to effectively find out outliers on time with request even within huge datasets. The proposed framework has been tested and evaluated using two real world datasets with different features and applications; one with 699 records, and another with five millions records. The experimental results show that the performance of the proposed framework outperforms other existing approaches while considering several evaluation metrics.
Subject
Information Systems and Management,Computer Science Applications,Information Systems
Reference40 articles.
1. Outlier Detection Using Replicator Neural Networks;Simon,2002
2. Automatic Growth Detection of Cell Cultures through Outlier Techniques using 2D Images
3. Data Mining: Concepts and Techniques;Han,2012
4. LOF: Identifying Density-based Local Outliers;Markus;SIGMOD Rec.,2000
Cited by
16 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Adaptive threshold based outlier detection on IoT sensor data: A node-level perspective;Alexandria Engineering Journal;2024-11
2. Customs valuation assessment using cluster-based approach;International Journal of Information Technology;2024-04-05
3. SA-O2DCA: Seasonal Adapted Online Outlier Detection and Classification Approach for WSN;Journal of Network and Systems Management;2024-03-04
4. LocFree;Proceedings of the 2nd ACM SIGSPATIAL International Workshop on Spatial Big Data and AI for Industrial Applications;2023-11-13
5. Detecting outliers from pairwise proximities: Proximity isolation forests;Pattern Recognition;2023-06