1. Runtime Performance Prediction for Deep Learning Models with Graph Neural Network
2. Analyzing CUDA workloads using a detailed GPU simulator
3. Tiresias: A GPU cluster manager for distributed deep learning;gu;16th USENIX Symposium on Networked Systems Design and Implementation NSDI 2019,2019
4. DIPPM: a deep learning inference performance predictive model using graph neural networks;selvam;CoRR,2023
5. Antman: Dynamic scaling on GPU clusters for deep learning;xiao;14th USENIX Symposium on Operating Systems Design and Implementation OSDI 2020 Virtual Event,2020