1. [Agg18] C. Aggarwal. Neural Networks and Deep Learning: A Textbook. Springer, 2018.
2. [Bis06] C. Bishop. Pattern Recognition and Machine Learning. Springer, 2006.
3. [Bro20] T. B. Brown, B. Mann, N. Ryder, M. Subbiah et al. Language Models are Few-Shot Learners, 2020. arXiv:abs/2005.14165.
4. [Bru05] Bruce. Bella the Saint-Hubert Bloodhound relaxes. https://commons.wikimedia.org/wiki/File:Bella_the_Saint-Hubert_Bloodhound_relaxes.jpg, 2005. CC BY 2.0: https://creativecommons.org/licenses/by/2.0/legalcode.
5. [Cyb89] G. Cybenko. Approximation by superpositions of a sigmoidal function. Mathematics of Control, Signals and Systems, 2(4):303–314, 1989.