1. Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3–6). ImageNet Classification with Deep Convolutional Neural Networks. Proceedings of the Advances in Neural Information Processing Systems, Lake Tahoe, NV, USA.
2. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups;Hinton;IEEE Signal Process. Mag.,2012
3. Devlin, J., Chang, M.W., Lee, K., and Toutanova, K. (2019, January 2–7). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Minneapolis, MN, USA.
4. Yin, L., Wang, L., Li, T., Lu, S., Yin, Z., Liu, X., Li, X., and Zheng, W. (2023). U-Net-STN: A novel end-to-end lake boundary prediction model. Land, 12.
5. Multiscale feature extraction and fusion of image and text in VQA;Lu;Int. J. Comput. Intell. Syst.,2023