1. NVIDIA GPU Inference Engine (2016). https://devblogs.nvidia.com/parallelforall/production-deep-learning-nvidia-gpu-inference-engine/. Accessed 6 July 2020
2. Abadi, M., et al.: TensorFlow: Large-scale machine learning on heterogeneous systems (2015). http://tensorflow.org/
3. Agarwal, A., et al.: An introduction to computational networks and the computational network toolkit. Technical Report MSR-TR-2014-112 (2014). http://research.microsoft.com/apps/pubs/default.aspx?id=226641
4. Ansel, J., et al.: Opentuner: an extensible framework for program autotuning. In: International Conference on Parallel Architectures and Compilation Techniques. Edmonton, Canada (2014). http://groups.csail.mit.edu/commit/papers/2014/ansel-pact14-opentuner.pdf
5. Baghdadi, R., et al.: Tiramisu: a code optimization framework for high performance systems. arXiv preprint arXiv:1804.10694 (2018)