1. An updated set of basic linear algebra subprograms (BLAS)
2. Using Machine Learning to Focus Iterative Optimization
3. OpenTuner
4. An adaptive performance modeling tool for GPU architectures
5. cuDNN. NVIDIA cuDNN GPU accelerated deep learning. https://developer.nvidia.com/cudnn. F. De Mesmay A. Rimmel Y. Voronenko and M. Püschel. Bandit-based optimization on graphs with application to library performance tuning. In Annual International Conference on Machine Learning pages 729– 736. ACM 2009. 10.1145/1553374.1553468 cuDNN. NVIDIA cuDNN GPU accelerated deep learning. https://developer.nvidia.com/cudnn. F. De Mesmay A. Rimmel Y. Voronenko and M. Püschel. Bandit-based optimization on graphs with application to library performance tuning. In Annual International Conference on Machine Learning pages 729– 736. ACM 2009. 10.1145/1553374.1553468