Affiliation:
1. Department of CSE, Jawaharlal Nehru Technological University Anantapur, Ananthapuramu, Andhra Pradesh, India
2. G. Pulla Reddy Engineering College, Kurnool, India
3. Department of CSE, JNTUA College of Engineering, Ananthapuramu, Andhra Pradesh, India
Abstract
With the technical advances, the amount of big data is increasing day-by-day such that the traditional software tools face burden in handling them. Additionally, the presence of the imbalance data in the big data is a huge concern to the research industry. In order to assure the effective management of big data and to deal with the imbalanced data, this paper proposes a new optimization algorithm. Here, the big data classification is performed using the MapReduce framework, wherein the map and reduce functions are based on the proposed optimization algorithm. The optimization algorithm is named as Exponential Bat algorithm (E-Bat), which is the integration of the Exponential Weighted Moving Average (EWMA) and Bat Algorithm (BA). The function of map function is to select the features that are presented to the classification in the reducer module using the Neural Network (NN). Thus, the classification of big data is performed using the proposed E-Bat algorithm-based MapReduce Framework and the experimentation is performed using four standard databases, such as Breast cancer, Hepatitis, Pima Indian diabetes dataset, and Heart disease dataset. From, the experimental results, it can be shown that the proposed method acquired a maximal accuracy of 0.8829 and True Positive Rate (TPR) of 0.9090, respectively.
Subject
Artificial Intelligence,Control and Systems Engineering,Software
Reference21 articles.
1. Towards brain big data classification: epileptic eeg identification with a lightweight VGGNet on Global MIC;Ke;IEEE Access
2. Finding Top-k Dominance on Incomplete Big Data Using MapReduce Framework;Ezatpoor;IEEE Access,2018
3. M. Elkano, M. Galar, J. Sanz and H. Bustince, CHI-PG: A fast prototype generation algorithm for Big Data classification problems, Neurocomputing, 2018.
4. CHI-BD: A fuzzy rule-based classification system for Big Data classification problems;Elkano;Fuzzy Sets and Systems,2017
5. Nearest Neighbor Classification for High-Speed Big Data Streams Using Spark;Ramírez-Gallego;IEEE Transactions on Systems, Man, and Cybernetics: Systems,2017
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献