1. T. Mitchell, The need for biases in learning generalizations. Technical Report CBM-TR 5-110 (Rutgers University, New Brunswick, 1980)
2. W. Rudin, Principles of Mathematical Analysis, 3rd edn. (McGraw-Hill, New York, 1976)
3. G. Strang, Introduction to Linear Algebra, 5th edn. (Wellesley-Cambridge Press, Wellesley, MA, 2016)
4. S. Sra, S. Nowozin, S.J. Wright (eds.), Optimization for Machine Learning (MIT Press, Cambridge, 2012)
5. L. Pitt, L.G. Valiant, Computational limitations on learning from examples. J. ACM 35(4), 965–984 (1988)