Affiliation:
1. Mahavir Polytechnic, Nashik, Maharashtra, India
Abstract
Data de-duplication refers to providing cloud providers with a way to manage the uncontrollable data and the challenges of cloud storage. The success of IOT and social media led us to face big data challenges. Big data is interesting but also becomes a critical challenge for cloud service providers. Data storage and management become also the topic of discussion where big data generate business opportunities as well as come with big issues for the cloud providers. In this paper, we discussed the issues of redundant data and techniques to prevent data redundancy on the cloud. A third-party auditor checks the user’s data for correctness and gives the accuracy of the data that is stored in a cloud server. The communication and computation overheads were reduced. The deduplication technique is used to check whether the file that users need to store in cloud storage already exists on the cloud server.