Unsupervised Anomaly Detection Based on Deep Autoencoding and Clustering-Reference-Cited by-同舟云学术

Unsupervised Anomaly Detection Based on Deep Autoencoding and Clustering

Published:2021-10-13 Issue: Volume:2021 Page:1-8
ISSN:1939-0122
Container-title:Security and Communication Networks
language:en
Short-container-title:Security and Communication Networks

Author:

Zhang Chuanlei¹^ORCID,Liu Jiangtao¹,Chen Wei²³^ORCID,Shi Jinyuan¹,Yao Minda¹^ORCID,Yan Xiaoning⁴,Xu Nenghua⁴,Chen Dufeng⁵

Affiliation:

1. College of Artificial Intelligence, Tianjin University of Science and Technology, Tianjin 300457, China

2. School of Mechanical Electronic and Information Engineering, China University of Mining and Technology (Beijing), Beijing 100083, China

3. School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221116, China

4. Softsz Co.,Ltd., Shenzhen 518131, China

5. Beijing Geotechnical and Investigation Engineering Institute, Beijing 100080, China

Abstract

The unsupervised anomaly detection task based on high-dimensional or multidimensional data occupies a very important position in the field of machine learning and industrial applications; especially in the aspect of network security, the anomaly detection of network data is particularly important. The key to anomaly detection is density estimation. Although the methods of dimension reduction and density estimation have made great progress in recent years, most dimension reduction methods are difficult to retain the key information of original data or multidimensional data. Recent studies have shown that the deep autoencoder (DAE) can solve this problem well. In order to improve the performance of unsupervised anomaly detection, we propose an anomaly detection scheme based on a deep autoencoder (DAE) and clustering methods. The deep autoencoder is trained to learn the compressed representation of the input data and then feed it to clustering approach. This scheme makes full use of the advantages of the deep autoencoder (DAE) to generate low-dimensional representation and reconstruction errors for the input high-dimensional or multidimensional data and uses them to reconstruct the input samples. The proposed scheme could eliminate redundant information contained in the data, improve performance of clustering methods in identifying abnormal samples, and reduce the amount of calculation. To verify the effectiveness of the proposed scheme, massive comparison experiments have been conducted with traditional dimension reduction algorithms and clustering methods. The results of experiments demonstrate that, in most cases, the proposed scheme outperforms the traditional dimension reduction algorithms with different clustering methods.

Funder

Tianjin Municipal Science and Technology Bureau

Publisher

Hindawi Limited

Subject

Computer Networks and Communications,Information Systems

Link

http://downloads.hindawi.com/journals/scn/2021/7389943.pdf

Reference31 articles.

1. A data mining approach for fault diagnosis: An application of anomaly detection algorithm

2. A fast MPPT-based anomaly detection and accurate fault diagnosis technique for PV arrays

3. Comparison of unsupervised anomaly detection methods for systems health management using space shuttle;R. A. Martin