Affiliation:
1. School of Artificial Intelligence, Hebei University of Technology, Tianjin 300400, China
2. State Grid Energy Saving Service Co., Ltd., Beijing 100052, China
3. School of Computer Science, Beijing Institute of Technology, Beijing 100081, China
Abstract
Key–value (KV) stores based on the LSM-tree have become the mainstream of contemporary store engines, but there are problems with high write and read amplification. Moreover, the real-world workload has a high data skew, and the existing KV store lacks hot-awareness, leading to its unreliable and poor performance on the highly skewed real-world workload. In this paper, we propose HoaKV, which unifies the key design ideas of hot issues, KV separation, and hybrid indexing technology in a system. Specifically, HoaKV uses the heat differentiation in KV pairs to manage the hot data and the cold data and conducts real-time dynamic adjustment data classification management. It also uses partial KV separation technology to manage differential KV pairs for large and small KV pairs in the cold data. In addition, HoaKV uses hybrid indexing technology to index the hot data and the cold data, respectively, to improve the performance of reading, writing, and scanning at the same time. In the mixed read and write workloads experments show that HoaKV performs significantly better than several state-of-the-art KV store technologies such as LevelDB, RocksDB, PebblesDB, and WiscKey.
Funder
National Natural Science Foundation of China
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference39 articles.
1. Thakur, N., and Han, C.Y. (2020, January 27–29). An Intelligent Ubiquitous Activity Aware Framework for Smart Home. Proceedings of the 3rd International Conference on Human Interaction and Emerging Technologies: Future Applications (IHIET 2020), Paris, France.
2. Thakur, N., and Han, C.Y. (2021). Country-Specific Interests Towards Fall Detection from 2004–2021: An Open Access Dataset and Research Questions. Data, 6.
3. Apache (2023, May 10). HBase. Available online: https://hbase.apache.org/.
4. Bigtable: A Distributed Storage System for Structured Data;Chang;Acm Trans. Comput. Syst.,2008
5. Facebook (2023, May 10). Memcached. Available online: http://memcached.org.