1. Lecture Notes in Computer Science;K Ahmed,2016
2. Bao, H., Dong, L., Furu, W.: BEiT: BERT pre-training of image transformers (2021)
3. Bernstein, J., Vahdat, A., Yue, Y., Liu, M.-Y.: On the distance between two neural networks and the stability of learning. In: Advances in Neural Information Processing Systems, vol. 33, pp. 21370–21381 (2020)
4. Beyer, L., Hénaff, O.J., Kolesnikov, A., Zhai, X., van den Oord, A.: Are we done with ImageNet? (2020)
5. Brock, A., De, S., Smith, S.L., Simonyan, K.: High-performance large-scale image recognition without normalization (2021)