Optimal Common Job Block Table (CJBT) to improve the Performance in Hadoop framework-Reference-Cited by-同舟云学术

Optimal Common Job Block Table (CJBT) to improve the Performance in Hadoop framework

Published:2021-12-15 Issue: Volume: Page:346-350
ISSN:2456-3307
Container-title:International Journal of Scientific Research in Computer Science, Engineering and Information Technology
language:en
Short-container-title:IJSRCSEIT

Author:

Pinjari Vali Basha ¹

Affiliation:

1. M. Tech Scholar, Computer Science and Engineering, JNTUA College of Engineering, Ananthapuramu, Andhra Pradesh, India

Abstract

By rapid transformation of technology, huge amount of data (structured data and Un Structured data) is generated every day. With the aid of 5G technology and IoT the data generated and processed every day is very large. If we dig deeper the data generated approximately 2.5 quintillion bytes. This data (Big Data) is stored and processed with the help of Hadoop framework. Hadoop framework has two phases for storing and retrieve the data in the network. <ul> <li>Hadoop Distributed file System (HDFS)</li> <li>Map Reduce algorithm</li> </ul> In the native Hadoop framework, there are some limitations for Map Reduce algorithm. If the same job is repeated again then we have to wait for the results to carry out all the steps in the native Hadoop. This led to wastage of time, resources. If we improve the capabilities of Name node i.e., maintain Common Job Block Table (CJBT) at Name node will improve the performance. By employing Common Job Block Table will improve the performance by compromising the cost to maintain Common Job Block Table. Common Job Block Table contains the meta data of files which are repeated again. This will avoid re computations, a smaller number of computations, resource saving and faster processing. The size of Common Job Block Table will keep on increasing, there should be some limit on the size of the table by employing algorithm to keep track of the jobs. The optimal Common Job Block table is derived by employing optimal algorithm at Name node.

Publisher

Technoscience Academy

Subject

General Medicine

Reference16 articles.

1. Sachin Arun Thanekar, K. Subrahmanyam, A. B. Bagwan, “Big Data and MapReduce Challenges, Opportunities and Trends”, International Journal of Electrical and Computer Engineering (IJECE) Vol. 6, No. 6, pp. 2911~2919, December 2016.

2. Sachin Arun Thanekar, K. Subrahmanyam, A. B. Bagwan, “A Study on Digital Forensics in Hadoop”, I J C T A, 9(18), pp. 8927-8933, 2016.

3. H. Alshammari; J. Lee; H. Bajwa, "H2Hadoop: Improving Hadoop Performance using the Metadata of Related Jobs," in IEEE Transactions on Cloud Computing , vol.PP, no.99, pp.1-1

4. H. Alshammari, J. Lee and H. Bajwa, "Evaluate H2Hadoop and Amazon EMR performances by processing MR jobs in text data sets," 2016 IEEE Long Island Systems, Applications and Technology Conference (LISAT), Farmingdale, NY, 2016, pp. 1-6.

5. Ibrahim Abaker Targio Hashem, Ibrar Yaqoob, Nor Badrul Anuar, Salimah Mokhtar, Abdullah Gani, Samee Ulah Khan, “The rise of “big data” on cloud computing : Review and open research issues ”, Elsevier Information Systems 47 (2015) 98–115.