1. MLIR: A Compiler Infrastructure for the End of Moores Law;Lattner;CoRR,2020
2. TVM: An automated endto-end optimizing compiler for deep learning;Chen,2018
3. LIBXSMM: Accelerating Small Matrix Multiplications by Runtime Code Generation
4. Tensor comprehensions: Framework-agnostic high-performance machine learning abstractions;Vasilache;CoRR,2018
5. ROLLER: Fast and Efficient Tensor Compilation for Deep Learning;Zhu