1. An Analysis of Deep Neural Network Models for Practical Applications;Canziani;arXiv: 1605.07678 [cs.CV],2017
2. Learning Efficient Object Detection Models with Knowledge Distillation;Chen;Neural Information Processing Systems.
3. Learning Efficient Object Detection Models with Knowledge Distillation;Chen
4. Generating Long Sequences with Sparse Transformers;Child;arXiv: 1904.10509 [cs.LG],2019
5. An Image is Worth 16 × 16 Words: Transformers for Image Recognition at Scale;Dosovitskiy;arXiv: 2010.11929 [cs. CV],2021