Affiliation:
1. Stanford University, CA
2. Oracle Labs and EPFL
3. Oracle Labs and Stanford University, CA
4. EPFL
Abstract
Developing high-performance software is a difficult task that requires the use of low-level, architecture-specific programming models (e.g., OpenMP for CMPs, CUDA for GPUs, MPI for clusters). It is typically not possible to write a single application that can run efficiently in different environments, leading to multiple versions and increased complexity. Domain-Specific Languages (DSLs) are a promising avenue to enable programmers to use high-level abstractions and still achieve good performance on a variety of hardware. This is possible because DSLs have higher-level semantics and restrictions than general-purpose languages, so DSL compilers can perform higher-level optimization and translation. However, the cost of developing performance-oriented DSLs is a substantial roadblock to their development and adoption. In this article, we present an overview of the Delite compiler framework and the DSLs that have been developed with it. Delite simplifies the process of DSL development by providing common components, like parallel patterns, optimizations, and code generators, that can be reused in DSL implementations. Delite DSLs are embedded in Scala, a general-purpose programming language, but use metaprogramming to construct an Intermediate Representation (IR) of user programs and compile to multiple languages (including C++, CUDA, and OpenCL). DSL programs are automatically parallelized and different parts of the application can run simultaneously on CPUs and GPUs. We present Delite DSLs for machine learning, data querying, graph analysis, and scientific computing and show that they all achieve performance competitive to or exceeding C++ code.
Funder
Language and Algorithms for Heterogeneous Graph Streams
Oracle
Advanced Micro Devices
Intel Corporation
Division of Computing and Communication Foundations
Huawei Technologies
Defense Advanced Research Projects Agency
Stanford University
National Science Foundation
European Research Council
Nvidia
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Software
Cited by
126 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. eCC++ : A Compiler Construction Framework for Embedded Domain-Specific Languages;2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW);2024-05-27
2. Auto-Generating Diverse Heterogeneous Designs;2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW);2024-05-27
3. Vectorized Intrinsics Can Be Replaced with Pure Java Code without Impairing Steady-State Performance;Proceedings of the 15th ACM/SPEC International Conference on Performance Engineering;2024-05-07
4. Rhyme: A Data-Centric Multi-paradigm Query Language Based on Functional Logic Metaprogramming;Lecture Notes in Computer Science;2024
5. Towards Systematic and Precise Compilation of Domain-Specific Modelling Languages;Advances in Intelligent Systems and Computing;2024