Data Lake Architecture-Reference-Cited by-同舟云学术

Data Lake Architecture

Published:2020-01 Issue:1 Volume:10 Page:63-75
ISSN:1947-9344
Container-title:International Journal of Organizational and Collective Intelligence
language:en
Short-container-title:

Author:

Panwar Arvind¹^ORCID,Bhatnagar Vishal²

Affiliation:

1. GGSIP University, Delhi, India

2. Department of Computer Science and Engineering, Ambedkar Institute of Advanced Communication Technologies and Research, New Delhi, India

Abstract

Data is the biggest asset after people for businesses, and it is a new driver of the world economy. The volume of data that enterprises gather every day is growing rapidly. This kind of rapid growth of data in terms of volume, variety, and velocity is known as Big Data. Big Data is a challenge for enterprises, and the biggest challenge is how to store Big Data. In the past and some organizations currently, data warehouses are used to store Big Data. Enterprise data warehouses work on the concept of schema-on-write but Big Data analytics want data storage which works on the schema-on-read concept. To fulfill market demand, researchers are working on a new data repository system for Big Data storage known as a data lake. The data lake is defined as a data landing area for raw data from many sources. There is some confusion and questions which must be answered about data lakes. The objective of this article is to reduce the confusion and address some question about data lakes with the help of architecture.

Publisher

IGI Global

Reference40 articles.

1. Critical success factors (CSFs) for information technology governance (ITG)

2. Big Data computing and clouds: Trends and future directions

3. A Fine‐Grained Distribution Approach for ETL Processes in Big Data Environments

4. Belle, A., Thiagarajan, R., Soroushmehr, S. M., Navidi, F., Beard, D. A., & Najarian, K. (2015). Big data analytics in healthcare. BioMed Research International. Retrieved from https://www.hindawi.com/journals/bmri/2015/370194/abs/

5. Big data, Big bang?

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A data lake-based security transmission and storage scheme for streaming big data;Cluster Computing;2023-12-09

2. Intelligent Decision-Making System for Integrated Geological and Engineering of Deep Coalbed Methane Development;Energy & Fuels;2023-09-06

3. Construction and Application of a Big Data System for Regional Lakes in Coalbed Methane Development;ACS Omega;2023-05-11

4. Development of a software and hardware solution to identify trends in demand for goods;Herald of Dagestan State Technical University. Technical Sciences;2023-05-10

5. Designing Hybrid Storage Architectures with RDBMS and NoSQL Systems: A Survey;International Conference on Advanced Intelligent Systems for Sustainable Development;2023