Performance Enhancement of Outlier Removal Using Extreme Value Analysis-Based Mahalonobis Distance-Reference-Cited by-同舟云学术

Performance Enhancement of Outlier Removal Using Extreme Value Analysis-Based Mahalonobis Distance

Published:2020 Issue: Volume: Page:240-252
ISSN:2327-1981
Container-title:Handling Priority Inversion in Time-Constrained Distributed Databases
language:
Short-container-title:

Author:

A Joy Christy¹^ORCID,A Umamakeswari¹

Affiliation:

1. School of Computing, SASTRA University (Deemed), India

Abstract

Outlier detection is a part of data analytics that helps users to find discrepancies in working machines by applying outlier detection algorithm on the captured data for every fixed interval. An outlier is a data point that exhibits different properties from other points due to some external or internal forces. These outliers can be detected by clustering the data points. To detect outliers, optimal clustering of data points is important. The problem that arises quite frequently in statistics is identification of groups or clusters of data within a population or sample. The most widely used procedure to identify clusters in a set of observations is k-means using Euclidean distance. Euclidean distance is not so efficient for finding anomaly in multivariate space. This chapter uses k-means algorithm with Mahalanobis distance metric to capture the variance structure of the clusters followed by the application of extreme value analysis (EVA) algorithm to detect the outliers for detecting rare items, events, or observations that raise suspicions from the majority of the data.

Publisher

IGI Global

Reference18 articles.

1. A novel approach for outlier detection and clustering improvement

2. Effect of Outlier Detection on Clustering Accuracy and Computation Time of CHB K-Means Algorithm

3. Data Clustering in C++

4. k -means clustering with outlier removal