Author:
Li Weian,Wu Huiwen,Yang Dongping
Publisher
Springer Nature Singapore
Reference21 articles.
1. Redmon J., Divvala S., Girshick R., et al.: You only look once: unified, real-time object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779–788. IEEE, Las Vegas, NV, USA (2016)
2. Deng, L., Li, J., Huang, J.T., et al.: Recent advances in deep learning for speech research at Microsoft. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 8604–8608. IEEE, Vancouver, BC, Canada (2013)
3. Chen, C., Seff, A., Kornhauser, A., et al.: Deepdriving: learning affordance for direct perception in autonomous driving. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp. 2722–2730. IEEE, Santiago, Chile (2015)
4. Dosovitskiy, A., Beyer, L., Kolesnikov, A., et al.: An image is worth 16x16 words: transformers for image recognition at scale. eprint arXiv: 2010.11929 (2020)
5. Chen Z., Xie L., Niu J., et al.: Visformer: the vision-friendly transformer. In: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp. 589–598. IEEE, virtually (2021)