Application of Multivariate-Rank-Based Techniques in Clustering of Big Data-Reference-Cited by-同舟云学术

Application of Multivariate-Rank-Based Techniques in Clustering of Big Data

Published:2018-12 Issue:4 Volume:43 Page:179-190
ISSN:0256-0909
Container-title:Vikalpa: The Journal for Decision Makers
language:en
Short-container-title:Vikalpa

Author:

Guha Pritha¹

Affiliation:

1. Pritha Guha is an Assistant Professor in the Institute of Management, Nirma University. She received her PhD in statistics from School of Mathematics, University of Birmingham, UK. She also received her MSc (by research) in statistics from the Department of Statistics and Applied Probability, National University of Singapore, and MSc in mathematics from IIT Kanpur. Her current research interest includes multivariate statistics and clustering of big data.

Abstract

Executive Summary Very large or complex data sets, which are difficult to process or analyse using traditional data handling techniques, are usually referred to as big data. The idea of big data is characterized by the three ‘v’s which are volume, velocity, and variety ( Liu, McGree, Ge, & Xie, 2015 ) referring respectively to the volume of data, the velocity at which the data are processed and the wide varieties in which big data are available. Every single day, different sectors such as credit risk management, healthcare, media, retail, retail banking, climate prediction, DNA analysis and, sports generate petabytes of data (1 petabyte = 250 bytes). Even basic handling of big data, therefore, poses significant challenges, one of them being organizing the data in such a way that it can give better insights into analysing and decision-making. With the explosion of data in our life, it has become very important to use statistical tools to analyse them.

Publisher

SAGE Publications

Subject

General Business, Management and Accounting,General Decision Sciences

Link

http://journals.sagepub.com/doi/pdf/10.1177/0256090918804385

Reference38 articles.

1. Cluster Analysis

2. On a Geometric Notion of Quantiles for Multivariate Data

3. A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. What Drives Petrol Price Dispersion across Australian Cities?;Energies;2022-08-19

2. Identification of Single Spectral Lines in Large Spectroscopic Surveys Using UMLAUT: an Unsupervised Machine-learning Algorithm Based on Unbiased Topology;The Astrophysical Journal Supplement Series;2021-12-01

3. Application Analysis of Artificial Intelligence Technology in Computer Information Security;Journal of Physics: Conference Series;2021-02-01

4. Application Research of Power Big Data Decision Based on Artificial Intelligence;Advances in Intelligent Systems and Computing;2021

5. Big Data and Artificial Intelligence to Support Risk Management: A Systematic Literature Review;SIDREA Series in Accounting and Business Administration;2021