Author:
Banait Satish S.,SANE Dr. S. S.
Abstract
Data mining and big data analytics are approaches for analyzing data and extracting hidden information. Because big data is complicated and large in volume, traditional techniques to analysis and extraction do not function effectively. Data clustering is a common data mining approach that divides data into groups and makes it simple to extract information from them. Big data can include both organized and semi structured information, and it's becoming increasingly beneficial for companies. Examples include old organized database of inventory level, transactions, and consumer information, as well as non - structured comprehension from the internet, social media platforms, and embedded systems. Numerous schemes have been developed to reach the needed in relation to efficiency and effectiveness, and much study has been committed to Big Data analytics. Nevertheless, a few methodologies, such as clustering algorithms, require further research in regards to performance, usefulness, and other factors, leading to the development of a model which gives proper Big Data Analytics assessment and the impactful use of this methodology to retrieve relevant knowledge. We recorded and analyzed several big data sets in our proposed work, as well as discovered relevant current approaches. In this paper we proposed a new clustering technique using dimensionality reduction approach. For implementation of this work, we used real time streaming data in unstructured form and noisy sometimes. The proposed hybrid clustering techniques that improve the clustering accuracy as well as time for generate effectives clusters on large unstructured data. We confirm the findings by testing the suggested methodology on available information sets and comparing and analyzing the effectiveness of the developed system with that of current systems.
Publisher
Perpetual Innovation Media Pvt. Ltd.
Reference38 articles.
1. Ankita Saldhi, A. G. e. a. 2014. Big data analysis using hadoop cluster. IEEE.
2. Anuradha, G. and Roy, B. 2014. Suggested techniques for clustering and mining of data streams. International Conference on Circuits, Systems, Communication and Information Technology Applications. IEEE.
3. Arora, S. and Chana, I. 2014. A survey of clustering techniques for big data analysis. IEEE. pp.391–397.
4. Bin, N. 2018. Research on methods and techniques for iot big data cluster analysis. In Interna- tional Conference on Information Systems and Computer Aided Education. ICISCAE, pp. 51–60. IEEE.
5. Bina Kotiyal, A. K. 2020. Big data: Mining of log file through hadoop. International Con- ference on Circuits, Systems, Communication and Information Technology Applications. IEEE.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献