Abstract
Hadoop Distributed File System, which is popularly known as HDFS, is a Java-based distributed file system running on commodity machines. HDFS is basically meant for storing Big Data over distributed commodity machines and getting the work done at a faster rate due to the processing of data in a distributed manner. Basically, HDFS has one name node (master node) and cluster of data nodes (slave nodes). The HDFS files are divided into blocks. The block is the minimum amount of data (64 MB) that can be read or written. The functions of the name node are to master the slave nodes, to maintain the file system, to control client access, and to have control of the replications. To ensure the availability of the name node, a standby name node is deployed by failover control and fencing is done to avoid the activation of the primary name node during failover. The functions of the data nodes are to store the data, serve the read and write requests, replicate the blocks, maintain the liveness of the node, ensure the storage policy, and maintain the block cache size. Also, it ensures the availability of data.
Reference9 articles.
1. Beal, V. (2017). Hadoop Distributed File system- HDFS. Retrieved April 2017, from http://www.webopedia.com/TERM/H/hadoop_distributed_file_system_hdfs.html
2. Dynamic Deduplication Decision in a Hadoop Ditributed File System. International Journal of Distributed Sensor Networks;Chang,2014
3. Cloudera Inc. (2017). HDFS Key Features. Retrieved April 2017, from https://www.cloudera.com/products/open-source/apache-hadoop/hdfs-mapreduce-yarn.html
4. Content-Aware Partial Compression for Textual Big Data Analysis in Hadoop
5. Handling Big Data Using a Data-Aware HDFS and Evolutionary Clustering Technique
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Enhancing File Recovery from Distributed File Systems (DFSs) Using Erasure Coding and Replication;2024 11th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO);2024-03-14
2. A Study of Hadoop and Mapping Approach Techniques on Big Data Strategies;International Journal of Scientific Research in Science and Technology;2022-11-20