1. Allwein, E. L., Schapire, R. E., & Singer, Y. (2001). Reducing multiclass to binary: A unifying approach for margin classifiers. Journal of Machine Learning Research, 1, 113–141.
2. Bengio, S., Weston, J., & Grangier, D. (2010). Label embedding trees for large multi-class tasks. In NIPS (pp. 163–171).
3. Beygelzimer, A., Langford, J., Lifshits, Y., Sorkin, G., & Strehl, A. (2009). Conditional probability tree estimation analysis and algorithms. In UAI (pp. 51–58).
4. Statistics/probability series;L Breiman,1984
5. Chang, C.-C., & Lin, C.-J. (2011). Libsvm: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2(3), 27:1–27:27.