Affiliation:
1. M. S. Ramaiah Institute of Technology, India
2. S. R. M. University, India
Abstract
Processing Big Data is a huge challenge for today's technology. There is a need to find, apply and analyze new ways of computing to make use of the Big Data so as to derive business and scientific value from it. Cloud computing with its promise of seemingly infinite computing resources is seen as the solution to this problem. Data Intensive computing on cloud builds upon the already mature parallel and distributed computing technologies such HPC, grid and cluster computing. However, handling Big Data in the cloud presents its own challenges. In this chapter, we analyze issues specific to data intensive cloud computing and provides a study on available solutions in programming models, data distribution and replication, resource provisioning and scheduling with reference to data intensive applications in cloud. Future directions for further research enabling data intensive cloud applications in cloud environment are identified.
Reference35 articles.
1. Ács, B., Llorà, X., Capitanu, B., Auvil, L., Tcheng, D., Haberman, M., … Welge, M. (2011). Meandre Data-Intensive Application Infrastructure: Extreme Scalability for Cloud and / or Grid Computing. In New Frontiers in Artificial Intelligence (pp. 233–242). Academic Press.
2. Makeflow
3. Baker, J., Bond, C., Corbett, J., & Furman, J. (2011). Megastore: Providing Scalable, Highly Available Storage for Interactive Services. CIDR, 223–234.
4. Time and Cost Sensitive Data-Intensive Computing on Hybrid Clouds
5. TRACON: Interference-Aware Schedulingfor Data-Intensive Applicationsin Virtualized Environments