Stateful Serverless Computing with Crucial

Author:

Barcelona-Pons Daniel1,Sutra Pierre2,Sánchez-Artigas Marc1,París Gerard1,García-López Pedro3

Affiliation:

1. Universitat Rovira i Virgili, Tarragona, Spain

2. Télécom SudParis, Palaiseau, France

3. Universitat Rovira i Virgili, Tarragona, Spain and IBM T.J. Watson Research Center, Yorktown Heights, NY

Abstract

Serverless computing greatly simplifies the use of cloud resources. In particular, Function-as-a-Service (FaaS) platforms enable programmers to develop applications as individual functions that can run and scale independently. Unfortunately, applications that require fine-grained support for mutable state and synchronization, such as machine learning (ML) and scientific computing, are notoriously hard to build with this new paradigm. In this work, we aim at bridging this gap. We present Crucial , a system to program highly-parallel stateful serverless applications. Crucial retains the simplicity of serverless computing. It is built upon the key insight that FaaS resembles to concurrent programming at the scale of a datacenter. Accordingly, a distributed shared memory layer is the natural answer to the needs for fine-grained state management and synchronization. Crucial allows to port effortlessly a multi-threaded code base to serverless, where it can benefit from the scalability and pay-per-use model of FaaS platforms. We validate Crucial with the help of micro-benchmarks and by considering various stateful applications. Beyond classical parallel tasks (e.g., a Monte Carlo simulation), these applications include representative ML algorithms such as k -means and logistic regression. Our evaluation shows that Crucial obtains superior or comparable performance to Apache Spark at similar cost (18%–40% faster). We also use Crucial to port (part of) a state-of-the-art multi-threaded ML library to serverless. The ported application is up to 30% faster than with a dedicated high-end server. Finally, we attest that Crucial can rival in performance with a single-machine, multi-threaded implementation of a complex coordination problem. Overall, Crucial delivers all these benefits with less than 6% of changes in the code bases of the evaluated applications.

Funder

EU Horizon 2020 programme

Spanish Government

Publisher

Association for Computing Machinery (ACM)

Subject

Software

Reference105 articles.

1. 2016. Apache OpenWhisk is a serverless open source cloud platform. Retrieved September 2021 from https://openwhisk.apache.org/.

2. 2016. Kubeless. Retrieved September 2021 from https://kubeless.io/.

3. 2016. OpenFaaS. Retrieved September 2021 from https://www.openfaas.com/.

4. 2016. Serverless Functions for Kubernetes - Fission. Retrieved September 2021 from https://fission.io/.

5. 2019. lambda-maven-plugin. Retrieved September 2021 from https://github.com/SeanRoy/lambda-maven-plugin.

Cited by 14 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Orchestration and Management of Adaptive IoT-Centric Distributed Applications;IEEE Internet of Things Journal;2024-02-01

2. MLLess: Achieving cost efficiency in serverless machine learning training;Journal of Parallel and Distributed Computing;2024-01

3. A Survey of Actor-Like Programming Models for Serverless Computing;Lecture Notes in Computer Science;2024

4. Glider;Proceedings of the 24th International Middleware Conference on ZZZ;2023-11-27

5. SPIRT: A Fault-Tolerant and Reliable Peer-to-Peer Serverless ML Training Architecture;2023 IEEE 23rd International Conference on Software Quality, Reliability, and Security (QRS);2023-10-22

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3