1. Brown, T.B., et al.: Language models are few-shot learners (2020). arXiv: 2005.14165 [cs.CL]
2. Schwartz, R., Dodge, J., Smith, N.A., Etzioni, O.: Green AI. (2019). arXiv: 1907.10597 [cs.CY]
3. Oh, K.-S., Jung, K.: GPU implementation of neural networks. Pattern Recogn. 37(6), 1311–1314 (2004)
4. Micikevicius, P., et al.: Mixed precision training. arXiv preprint arXiv:1710.03740 (2017)
5. Jouppi, N.P., et al.: In-datacenter performance analysis of a tensor processing unit. In: Proceedings of the 44th Annual International Symposium on Computer Architecture, pp. 1–12 (2017)