1. Abu-Mostafa, Y. S., Magdon-Ismail, M., & Lin, H. T. (2012). Learning from data. AMLBook.
2. Balcan, M. F., Blum, A., & Srebro, N. (2008). A theory of learning with similarity functions. Machine Learning, 72(1–2), 89–112.
3. Bartlett, P. L. (1997). For valid generalization, the size of the weights is more important than the size. Advances in Neural Information Processing Systems (NIPS), 9, 134.
4. Lichman, M. (2013). UCI machine learning repository. Irvine, CA: University of California, School of Information and Computer Sciences. http://archive.ics.uci.edu/ml
5. Boser, B. E., Guyon, I., & Vapnik, V. (1992) A training algorithm for optimal margin classifiers. In Fifth annual workshop on computational learning theory (pp. 144–152).