1. Reducing multiclass to binary: a unifying approach for margin classifiers;Allwein;Journal of Machine Learning Research,2000
2. Methods of information geometry;Amari,2000
3. Bartlett, P. L., Jordan, M. I., & McAuliffe, J. D. (2003). Convexity, classification, and risk bounds. Berkeley: Technical report 638. Statistics Department, University of California.
4. Sparseness vs estimating conditional probabilities: some asymptotic results;Bartlett;Journal of Machine Learning Research,2007
5. Adaboost is consistent;Bartlett;Journal of Machine Learning Research,2007