Author:
Khine Pwint Phyu,Wang Zhaoshun
Abstract
The inevitability of the relationship between big data and distributed systems is indicated by the fact that data characteristics cannot be easily handled by a standalone centric approach. Among the different concepts of distributed systems, the CAP theorem (Consistency, Availability, and Partition Tolerant) points out the prominent use of the eventual consistency property in distributed systems. This has prompted the need for other, different types of databases beyond SQL (Structured Query Language) that have properties of scalability and availability. NoSQL (Not-Only SQL) databases, mostly with the BASE (Basically Available, Soft State, and Eventual consistency), are gaining ground in the big data era, while SQL databases are left trying to keep up with this paradigm shift. However, none of these databases are perfect, as there is no model that fits all requirements of data-intensive systems. Polyglot persistence, i.e., using different databases as appropriate for the different components within a single system, is becoming prevalent in data-intensive big data systems, as they are distributed and parallel by nature. This paper reflects the characteristics of these databases from a conceptual point of view and describes a potential solution for a distributed system—the adoption of polyglot persistence in data-intensive systems in the big data era.
Funder
National Key Research and Development plan 2017 for High-Performance Computing
Reference89 articles.
1. McKinsey Global Institute, Big Data: The Next Frontier for Innovation, Competition, and Productivity; 2011
https://www.mckinsey.com/business-functions/digital-mckinsey/our-insights/big-data-the-next-frontier-for-innovation
2. 3D Data Management: Controlling Data Volume, Velocity, and Variety
https://blogs.gartner.com/doug-laney/files/2012/01/ad949-3D-Data-Management-Controlling-Data-Volume-Velocity-and-Variety.pdf
3. Eventually Consistent
Cited by
19 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. NEST: Node with Statistics Tree for IoT Data Persistence and Real-time Queries;Proceedings of the 15th Asia-Pacific Symposium on Internetware;2024-07-24
2. Security and Ownership in User-Defined Data Meshes;Algorithms;2024-04-22
3. Um Estudo sobre Modelagem Poliglota de Dados;Anais da XIX Escola Regional de Banco de Dados (ERBD 2024);2024-04-10
4. An Overview on Testing Big Data Applications;Lecture Notes in Networks and Systems;2024
5. CCIR: An Architecture for Collecting and Storing Connnected Corridor Infrastructure and Mobility Data;2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC);2023-09-24