##plugins.themes.academic_pro.article.main##

Abstract

Distributed file systems are key building blocks for cloud computing applications based on the MapReduce programming paradigm. In such file systems, the nodes are simultaneously serve this computing and storage functions; a file is partitioned into a number of chunks allocated in distinct nodes so that this MapReduce tasks can be performed in parallel over the nodes. However, in a cloud computing environment, failure is the norm, and nodes may be upgraded, replaced, and added in the system. Files can also be dynamically created, deleted, and appended. This results in load imbalance in a distributed file system; that is, the file chunks are not distributed as uniformly as possible among the nodes. Emerging distributed file systems in production systems strongly depend on a central node for chunk reallocation. In this paper, a fully distributed load rebalancing algorithm is presented to solve with the load imbalance problem. This algorithm is compared against a centralized approach in a production system and a competing distributed solution presented in the literature. The simulation results indicate that our proposal is comparable with the existing centralized approach and considerably outperforms the prior distributed algorithm in terms of load imbalance factor, movement cost, and algorithmic overhead.

Keywords: Load balance, distributed file systems, clouds,filechunks,load

##plugins.themes.academic_pro.article.details##

Author Biographies

Lakshmi urs S M, Shridevi Institute of Engineering and Technology, Tumkur

Dept. of Computer Science and Engineering

Dr. C.D Guruprakash, Shridevi Institute of Engineering and Technology, Tumkur

Professor and Head, Dept of Computer Science and Engineering
How to Cite
S M, L. urs, & Guruprakash, D. C. (2015). A Simple Load Rebalancing algorithm to rebalance the loads in clouds. International Journal of Emerging Trends in Science and Technology, 2(06). Retrieved from http://igmpublication.org/ijetst.in/index.php/ijetst/article/view/763

References

1. J. Dean and S. Ghemawat, “MapReduce: Simplified Data Processing on Large Clusters,” Proc. Sixth Symp. Operating System Design and Implementation (OSDI ’04), pp. 137-150, Dec. 2004.
2. S. Ghemawat, H. Gobioff, and S.-T. Leung, “The Google File System,” Proc. 19th ACM Symp. Operating Systems Principles (SOSP’03), pp. 29-43, Oct. 2003.
3. Hadoop Distributed File System, http://hadoop.apache.org/hdfs/, 2012.
4. VMware, http://www.vmware.com/, 2012.
5. Xen, http://www.xen.org/, 2012.
6. Apache Hadoop, http://hadoop.apache.org/, 2012.
7. Hadoop Distributed File System “Rebalancing Blocks,” http://developer.yahoo.com/hadoop/tutorial/module2.html#rebalancing,2012.
8. K. McKusick and S. Quinlan, “GFS: Evolution on Fast-Forward,”Comm. ACM, vol. 53, no.3, pp. 42-49, Jan. 2010.
9. HDFS Federation, http://hadoop.apache.org /common/docs/r0.23.0/hadoop-yarn/hadoop-yarn-site/Federation.html, 2012.