An Efficient and Unique TF/IDF Algorithmic Model-Based Data Analysis for Handling Applications with Big Data Streaming-Reference-Cited by-同舟云学术

An Efficient and Unique TF/IDF Algorithmic Model-Based Data Analysis for Handling Applications with Big Data Streaming

Published:2019-11-11 Issue:11 Volume:8 Page:1331
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Iwendi Celestine^ORCID,Ponnan Suresh^ORCID,Munirathinam Revathi,Srinivasan Kathiravan^ORCID,Chang Chuan-Yu^ORCID

Abstract

As the field of data science grows, document analytics has become a more challenging task for rough classification, response analysis, and text summarization. These tasks are used for the analysis of text data from various intelligent sensing systems. The conventional approach for data analytics and text processing is not useful for big data coming from intelligent systems. This work proposes a novel TF/IDF algorithm with the temporal Louvain approach to solve the above problem. Such an approach is supposed to help the categorization of documents into hierarchical structures showing the relationship between variables, which is a boon to analysts making essential decisions. This paper used public corpora, such as Reuters-21578 and 20 Newsgroups for massive-data analytic experimentation. The result shows the efficacy of the proposed algorithm in terms of accuracy and execution time across six datasets. The proposed approach is validated to bring value to big text data analysis. Big data handling with map-reduce has led to tremendous growth and support for tasks like categorization, sentiment analysis, and higher-quality accuracy from the input data. Outperforming the state-of-the-art approach in terms of accuracy and execution time for six datasets ensures proper validation.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/8/11/1331/pdf

Reference39 articles.

1. Distributed document clustering analysis based on a hybrid method

2. Optimization for Speculative Execution in Big Data Processing Clusters

3. A Hybrid Approach to Clustering in Big Data

4. Empirical analysis and modeling of the activity dilemmas in big social networks;Xi;IEEE Access,2016

5. Clustering big spatiotemporal-interval data;Wei;IEEE Trans. Big Data,2016

Cited by 42 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An efficient architecture for processing real-time traffic data streams using apache flink;Multimedia Tools and Applications;2023-09-30

2. Intelligent analysis system of college students' employment and entrepreneurship situation: Big data and artificial intelligence-driven approach;Computers and Electrical Engineering;2023-09

3. SAR-BSO meta-heuristic hybridization for feature selection and classification using DBNover stream data;Artificial Intelligence Review;2023-05-04

4. Combining Computer Vision and Word Processing to Classify Film Genres;2023 25th International Conference on Digital Signal Processing and its Applications (DSPA);2023-03-29

5. KIASOntoRec: A Knowledge Infused Approach for Socially Aware Ontology Recommendation;Innovations in Bio-Inspired Computing and Applications;2023