1. Baldridge, J., Morton, T., & Bierner, G. OpenNLP package, 2001. URL http://opennlp.sourceforge.net/ .
2. Bertsekas, D. P. (1999). Nonlinear programming (2nd edn.). Belmont: Athena Scientific.
3. Chang, K.-W., Hsieh, C.-J., & Lin, C.-J. (2008). Coordinate descent method for large-scale L2-loss linear SVM. Journal of Machine Learning Research, 9, 1369–1398.
4. Collins, M., Globerson, A., Koo, T., Carreras, X., & Bartlett, P. (2008). Exponentiated gradient algorithms for conditional random fields and max-margin Markov networks. Journal of Machine Learning Research, 9, 1775–1822.
5. Crammer, K., & Singer, Y. (2000). On the learnability and design of output codes for multiclass problems. In Computational learning theory (pp. 35–46).