Optimizing in-memory database engine for AI-powered on-line decision augmentation using persistent memory

Author:

Chen Cheng1,Yang Jun2,Lu Mian2,Wang Taize2,Zheng Zhao2,Chen Yuqiang2,Dai Wenyuan2,He Bingsheng3,Wong Weng-Fai3,Wu Guoan4,Zhao Yuping4,Rudoff Andy4

Affiliation:

1. 4Paradigm Inc. and National University of Singapore

2. 4Paradigm Inc.

3. National University of Singapore

4. Intel Corporation

Abstract

On-line decision augmentation (OLDA) has been considered as a promising paradigm for real-time decision making powered by Artificial Intelligence (AI). OLDA has been widely used in many applications such as real-time fraud detection, personalized recommendation, etc. On-line inference puts real-time features extracted from multiple time windows through a pre-trained model to evaluate new data to support decision making. Feature extraction is usually the most time-consuming operation in many OLDA data pipelines. In this work, we started by studying how existing in-memory databases can be leveraged to efficiently support such real-time feature extractions. However, we found that existing in-memory databases cost hundreds or even thousands of milliseconds. This is unacceptable for OLDA applications with strict real-time constraints. We therefore propose FEDB ( <u>F</u> eature <u>E</u> ngineering <u>D</u> ata <u>b</u> ase), a distributed in-memory database system designed to efficiently support on-line feature extraction. Our experimental results show that FEDB can be one to two orders of magnitude faster than the state-of-the-art in-memory databases on real-time feature extraction. Furthermore, we explore the use of the Intel Optane DC Persistent Memory Module (PMEM) to make FEDB more cost-effective. When comparing the proposed PMEM-optimized persistent skiplist to the FEDB using DRAM+SSD, PMEM-based FEDB can shorten the tail latency up to 19.7%, reduce the recovery time up to 99.7%, and save up to 58.4% total cost of a real OLDA pipeline.

Publisher

VLDB Endowment

Subject

General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development

Cited by 13 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Biathlon: Harnessing Model Resilience for Accelerating ML Inference Pipelines;Proceedings of the VLDB Endowment;2024-06

2. Exploiting Persistent CPU Cache for Scalable Persistent Hash Index;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13

3. Optimizing Data Pipelines for Machine Learning in Feature Stores;Proceedings of the VLDB Endowment;2023-09

4. Krypton: Real-Time Serving and Analytical SQL Engine at ByteDance;Proceedings of the VLDB Endowment;2023-08

5. ADOps: An Anomaly Detection Pipeline in Structured Logs;Proceedings of the VLDB Endowment;2023-08

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3