AWARE: Workload-aware, Redundancy-exploiting Linear Algebra-Reference-Cited by-同舟云学术

AWARE: Workload-aware, Redundancy-exploiting Linear Algebra

Published:2023-05-26 Issue:1 Volume:1 Page:1-28
ISSN:2836-6573
Container-title:Proceedings of the ACM on Management of Data
language:en
Short-container-title:Proc. ACM Manag. Data

Author:

Baunsgaard Sebastian¹^ORCID,Boehm Matthias¹^ORCID

Affiliation:

1. Technische Universität Berlin, Berlin, Germany

Abstract

Compression is an effective technique for fitting data in available memory, reducing I/O, and increasing instruction parallelism. While data systems primarily rely on lossless compression, modern machine learning (ML) systems exploit the approximate nature of ML and mostly use lossy compression via low-precision floating- or fixed-point representations. The resulting unknown impact on learning progress, and model accuracy, however, create trust concerns, that require trial and error, and are problematic for declarative ML pipelines. Given the trend towards increasingly complex, composite ML pipelines---with outer loops for hyper-parameter tuning, feature selection, and data cleaning/augmentation---it is hard for a user to infer the impact of lossy compression. Sparsity exploitation is a common lossless scheme used to improve performance without this uncertainty. Evolving this concept to general redundancy-exploiting compression is a natural next step. Existing work on lossless compression and compressed linear algebra (CLA) enable such exploitation to a degree, but face challenges for general applicability. In this paper, we address these limitations with a workload-aware compression framework, comprising a broad spectrum of new compression schemes and kernels. Instead of a data-centric approach that optimizes compression ratios, our workload-aware compression summarizes the workload of an ML pipeline, and optimizes the compression and execution plan to minimize execution time. On various micro benchmarks and end-to-end ML pipelines, we observe improvements for individual operations up to 10,000x and ML algorithms up to νmprint6.6 x compared to uncompressed operations.

Funder

Austrian Federal Ministry for Climate Action, Environment, Energy, Mobility, Innovation and Technology

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3588682

Reference107 articles.

1. Column-oriented database systems

2. Daniel J. Abadi Samuel Madden and Miguel Ferreira. 2006. Integrating compression and execution in column-oriented database systems. In SIGMOD. 671--682. https://doi.org/10.1145/1142473.1142548 10.1145/1142473.1142548

3. Daniel J. Abadi Samuel Madden and Miguel Ferreira. 2006. Integrating compression and execution in column-oriented database systems. In SIGMOD. 671--682. https://doi.org/10.1145/1142473.1142548

4. Mart'i n Abadi , Paul Barham , Jianmin Chen , Zhifeng Chen , Andy Davis , Jeffrey Dean , Matthieu Devin , Sanjay Ghemawat , Geoffrey Irving , Michael Isard , Manjunath Kudlur , Josh Levenberg , Rajat Monga , Sherry Moore , Derek Gordon Murray , Benoit Steiner, Paul A. Tucker, Vijay Vasudevan, Pete Warden, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2016 . TensorFlow: A System for Large-Scale Machine Learning. In OSDI. 265--283. https://www.usenix.org/conference/osdi16/technical-sessions/presentation/abadi Mart'i n Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, Manjunath Kudlur, Josh Levenberg, Rajat Monga, Sherry Moore, Derek Gordon Murray, Benoit Steiner, Paul A. Tucker, Vijay Vasudevan, Pete Warden, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2016. TensorFlow: A System for Large-Scale Machine Learning. In OSDI. 265--283. https://www.usenix.org/conference/osdi16/technical-sessions/presentation/abadi

5. Amir Abboud Arturs Backurs Karl Bringmann and Marvin Kü nnemann. 2020. Impossibility Results for Grammar-Compressed Linear Algebra. In NeurIPS. https://arxiv.org/abs/2010.14181 Amir Abboud Arturs Backurs Karl Bringmann and Marvin Kü nnemann. 2020. Impossibility Results for Grammar-Compressed Linear Algebra. In NeurIPS. https://arxiv.org/abs/2010.14181