1. Mask: Redesigning the gpu memory hierarchy to support multi-application concurrency;Ausavarungnirun,2018
2. Improving GPU multitasking efficiency using dynamic resource sharing;Kim;IEEE Comput. Archit. Lett.,2018
3. Classification-driven search for effective SM partitioning in multitasking GPUs;Zhao,2018
4. Slate: Enabling workload-aware efficient multiprocessing for modern GPGPUs;Allen,2019
5. Salus: Fine-grained GPU sharing primitives for deep learning applications;Yu,2019