Machine Learning–enabled Scalable Performance Prediction of Scientific Codes-Reference-Cited by-同舟云学术

Machine Learning–enabled Scalable Performance Prediction of Scientific Codes

Published:2021-04-30 Issue:2 Volume:31 Page:1-28
ISSN:1049-3301
Container-title:ACM Transactions on Modeling and Computer Simulation
language:en
Short-container-title:ACM Trans. Model. Comput. Simul.

Author:

Chennupati Gopinath¹^ORCID,Santhi Nandakishore¹,Romero Phill¹,Eidenbenz Stephan¹

Affiliation:

1. Los Alamos National Laboratory, NM

Abstract

Hardware architectures become increasingly complex as the compute capabilities grow to exascale. We present the Analytical Memory Model with Pipelines (AMMP) of the Performance Prediction Toolkit (PPT). PPT-AMMP takes high-level source code and hardware architecture parameters as input and predicts runtime of that code on the target hardware platform, which is defined in the input parameters. PPT-AMMP transforms the code to an (architecture-independent) intermediate representation, then (i) analyzes the basic block structure of the code, (ii) processes architecture-independent virtual memory access patterns that it uses to build memory reuse distance distribution models for each basic block, and (iii) runs detailed basic-block level simulations to determine hardware pipeline usage. PPT-AMMP uses machine learning and regression techniques to build the prediction models based on small instances of the input code, then integrates into a higher-order discrete-event simulation model of PPT running on Simian PDES engine. We validate PPT-AMMP on four standard computational physics benchmarks and present a use case of hardware parameter sensitivity analysis to identify bottleneck hardware resources on different code inputs. We further extend PPT-AMMP to predict the performance of a scientific application code, namely, the radiation transport mini-app SNAP. To this end, we analyze multi-variate regression models that accurately predict the reuse profiles and the basic block counts. We validate predicted SNAP runtimes against actual measured times.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,Modeling and Simulation

Link

https://dl.acm.org/doi/pdf/10.1145/3450264

Reference53 articles.

1. An Integrated Interconnection Network Model for Large-Scale Performance Prediction

2. LogGP

3. Fast, accurate, and scalable memory modeling of GPGPUs using reuse profiles

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. BB-ML: Basic Block Performance Prediction using Machine Learning Techniques;2023 IEEE 29th International Conference on Parallel and Distributed Systems (ICPADS);2023-12-17