1. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
2. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L.u., Polosukhin, I.: Attention is all you need. In: Advances in Neural Information Processing Systems, vol. 30. Curran Associates, Inc (2017)
3. Yang, W., Cao, Z., Chen, Q., Yang, Y., Yang, G.: Confidence calibration on multiclass classification in medical imaging. In: 2020 IEEE International Conference on Data Mining (ICDM), pp. 1364–1369 (2020)
4. Yang, W., Yang, Y.: A stabilized dense network approach for high-dimensional prediction. In: 2021 International Joint Conference on Neural Networks (IJCNN), pp. 1–8 (2021)
5. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., Uszkoreit, J., Houlsby, N.: An image is worth $$16\times 16$$ words: Transformers for image recognition at scale. In: International Conference on Learning Representations (2021)