1. Demystifying parallel and distributed deep learning: An in-depth concurrency analysis;Ben-Nun;ACM Comput. Surv.,2019
2. Efficient processing of deep neural networks: A tutorial and survey;Sze;Proc. IEEE,2017
3. K. Chellapilla, S. Puri, P. Simard, High performance convolutional neural networks for document processing, in: International Workshop on Frontiers in Handwriting Recognition, 2006.
4. Anatomy of high-performance deep learning convolutions on SIMD architectures;Georganas,2018
5. High performance and portable convolution operators for multicore processors;San Juan,2020