Affiliation:
1. Guangdong University of Technology
Abstract
In order to deal with a large number of small files and hotspot data program in Hadoop distributed file system (HDFS)[1,, according to the exit proposal, this paper proposes a new the hotspot data processing model. The model proposals to change the block size, the introduction of efficient indexing mechanism to improve the dynamic replica management strategy and design of the new HDFS architecture to save space, speed up system processing, and enhance security.
Publisher
Trans Tech Publications, Ltd.
Reference13 articles.
1. Hadoop official website, http: /hadoop. apache. org.
2. Tom White. Hadoop definitive guide[M]. Minqi Zhou, Xiaoling Wang, Cheqing Jin, Weining Qian interpret. Tsinghua University Press, 2011. 7.
3. Hayes B. Cloud Computing [J]. Communications of the ACM, 2008, 51(7): 9-11.
4. Yutaka Kawai, Takashi Sasaki, Yoshimi Iida, Yoshiyuki Watase. Managing Large and Small Files in a Distributed System. 2011 IEEE.
5. Xuhui Liu, Jizhong Han, Yunqin Zhong, Chengde Han. Implementing WebGIS on Hadoop: A Case Study of Improving Small File I/O Performance on HDFS. 2009 IEEE.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献