1. Qos-aware dynamic resource allocation for spatial-multitasking gpus;Aguilera,2014
2. Reasoning based workload performance prediction in cloud data centers;Aslam,2019
3. Integrated deep learning method for workload and resource prediction in cloud systems;Bi;Neurocomputing,2021
4. Balancing efficiency and fairness in heterogeneous GPU clusters for deep learning;Chaudhary,2020
5. Exploiting inter-job and intra-job parallelism of distributed machine learning on heterogeneous GPUs;Chen,2022