1. Tianqi Chen Thierry Moreau Ziheng Jiang Lianmin Zheng Eddie Yan Haichen Shen Meghan Cowan Leyuan Wang Yuwei Hu Luis Ceze etal 2018. {TVM}: An automated end-to-end optimizing compiler for deep learning. In 13th {USENIX} Symposium on Operating Systems Design and Implementation ({OSDI} 18). USENIX Association Boston MA 578--594. Tianqi Chen Thierry Moreau Ziheng Jiang Lianmin Zheng Eddie Yan Haichen Shen Meghan Cowan Leyuan Wang Yuwei Hu Luis Ceze et al. 2018. {TVM}: An automated end-to-end optimizing compiler for deep learning. In 13th {USENIX} Symposium on Operating Systems Design and Implementation ({OSDI} 18). USENIX Association Boston MA 578--594.
2. Daniel Crankshaw , Xin Wang , Guilio Zhou , Michael J Franklin , Joseph E Gonzalez , and Ion Stoica . 2017 . Clipper: A low-latency online prediction serving system . In Proceedings of the Conference on Networked Systems Design and Implementation (NSDI). USENIX Association , Boston, MA, USA, 613--627. Daniel Crankshaw, Xin Wang, Guilio Zhou, Michael J Franklin, Joseph E Gonzalez, and Ion Stoica. 2017. Clipper: A low-latency online prediction serving system. In Proceedings of the Conference on Networked Systems Design and Implementation (NSDI). USENIX Association, Boston, MA, USA, 613--627.
3. Google. 2018. GRPC Framework . https://grpc.io/. [Online ; accessed 17- Apr- 2022 ]. Google. 2018. GRPC Framework. https://grpc.io/. [Online; accessed 17-Apr-2022].
4. Anuj Kalia , Michael Kaminsky , and David Andersen . 2019 . Datacenter {RPCs} can be General and Fast . In Proceedings of the Conference on Networked Systems Design and Implementation (NSDI). USENIX Association , Boston, MA, USA, 1--16. Anuj Kalia, Michael Kaminsky, and David Andersen. 2019. Datacenter {RPCs} can be General and Fast. In Proceedings of the Conference on Networked Systems Design and Implementation (NSDI). USENIX Association, Boston, MA, USA, 1--16.
5. NVIDIA. 2022. CUDA GPUDirect RDMA. https://docs.nvidia.com/cuda/gpudirect-rdma/index.html. [Online ; accessed 17- Apr- 2022 ]. NVIDIA. 2022. CUDA GPUDirect RDMA. https://docs.nvidia.com/cuda/gpudirect-rdma/index.html. [Online; accessed 17-Apr-2022].