PolarDB-IMCI: A Cloud-Native HTAP Database System at Alibaba

Author:

Wang Jianying1ORCID,Li Tongliang1ORCID,Song Haoze1ORCID,Yang Xinjun1ORCID,Zhou Wenchao1ORCID,Li Feifei1ORCID,Yan Baoyue1ORCID,Wu Qianqian1ORCID,Liang Yukun1ORCID,Ying ChengJun2ORCID,Wang Yujie1ORCID,Chen Baokai1ORCID,Cai Chang1ORCID,Ruan Yubin1ORCID,Weng Xiaoyi1ORCID,Chen Shibin1ORCID,Yin Liang1ORCID,Yang Chengzhong1ORCID,Cai Xin1ORCID,Xing Hongyan1ORCID,Yu Nanlong1ORCID,Chen Xiaofei1ORCID,Huang Dapeng1ORCID,Sun Jianling2ORCID

Affiliation:

1. Alibaba Group, Hangzhou, China

2. Alibaba Group & Zhejiang University, Hangzhou, China

Abstract

Cloud-native databases have become the de-facto choice for mission-critical applications on the cloud due to the need for high availability, resource elasticity, and cost efficiency. Meanwhile, driven by the increasing connectivity between data generation and analysis, users prefer a single database to efficiently process both OLTP and OLAP workloads, which enhances data freshness and reduces the complexity of data synchronization and the overall business cost. In this paper, we summarize five crucial design goals for a cloud-native HTAP database based on our experience and customers' feedback, i.e., transparency, competitive OLAP performance, minimal perturbation on OLTP workloads, high data freshness, and excellent resource elasticity. As our solution to realize these goals, we present PolarDB-IMCI, a cloud-native HTAP database system designed and deployed at Alibaba Cloud. Our evaluation results show that PolarDB-IMCI is able to handle HTAP efficiently on both experimental and production workloads; notably, it speeds up analytical queries up to ×149 on TPC-H (100GB). PolarDB-IMCI introduces low visibility delay and little performance perturbation on OLTP workloads (<5%), and resource elasticity can be achieved by scaling out in tens of seconds.

Publisher

Association for Computing Machinery (ACM)

Reference57 articles.

1. Main-memory hash joins on multi-core CPUs: Tuning to the underlying hardware

2. Wildfire

3. Memory-efficient hash joins

4. Peter A. Boncz , Marcin Zukowski , and Niels Nes . 2005 . MonetDB/X100: Hyper-Pipelining Query Execution . In Second Biennial Conference on Innovative Data Systems Research, CIDR 2005, Asilomar, CA, USA, January 4--7, 2005, Online Proceedings. www.cidrdb.org, 225--237 . Peter A. Boncz, Marcin Zukowski, and Niels Nes. 2005. MonetDB/X100: Hyper-Pipelining Query Execution. In Second Biennial Conference on Innovative Data Systems Research, CIDR 2005, Asilomar, CA, USA, January 4--7, 2005, Online Proceedings. www.cidrdb.org, 225--237.

5. Apache hadoop goes realtime at Facebook

Cited by 6 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Two Birds With One Stone: Designing a Hybrid Cloud Storage Engine for HTAP;Proceedings of the VLDB Endowment;2024-07

2. A survey on hybrid transactional and analytical processing;The VLDB Journal;2024-06-04

3. Riveter: Adaptive Query Suspension and Resumption Framework for Cloud Native Databases;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13

4. Log Replaying for Real-Time HTAP: An Adaptive Epoch-Based Two-Stage Framework;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13

5. Rethink Query Optimization in HTAP Databases;Proceedings of the ACM on Management of Data;2023-12-08

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3