Abstract
Time series databases aim to handle big amounts of data in a fast way, both when introducing new data to the system, and when retrieving it later on. However, depending on the scenario in which these databases participate, reducing the number of requested resources becomes a further requirement. Following this goal, NagareDB and its Cascading Polyglot Persistence approach were born. They were not just intended to provide a fast time series solution, but also to find a great cost-efficiency balance. However, although they provided outstanding results, they lacked a natural way of scaling out in a cluster fashion. Consequently, monolithic approaches could extract the maximum value from the solution but distributed ones had to rely on general scalability approaches. In this research, we proposed a holistic approach specially tailored for databases following Cascading Polyglot Persistence to further maximize its inherent resource-saving goals. The proposed approach reduced the cluster size by 33%, in a setup with just three ingestion nodes and up to 50% in a setup with 10 ingestion nodes. Moreover, the evaluation shows that our scaling method is able to provide efficient cluster growth, offering scalability speedups greater than 85% in comparison to a theoretically 100% perfect scaling, while also ensuring data safety via data replication.
Funder
Spanish Ministry of Science and Innovation
Government of Catalonia
Subject
Artificial Intelligence,Computer Science Applications,Information Systems,Management Information Systems
Reference29 articles.
1. Time Series Management Systems: A Survey
2. The DB-Engines Ranking, according to Their Popularity
https://db-engines.com/en/ranking
3. NagareDB: A Resource-Efficient Document-Oriented Time-Series Database
4. Brewer’s Conjecture and the Feasibility of Consistent, Available, Partition-Tolerant Web Services;Gilbert,2002
5. Characterization of data compression across CPU platforms and accelerators;Promberger,2022
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Techniques Used in Time Series Databases and Their Internals;2024 9th International Conference on Big Data Analytics (ICBDA);2024-03-16