1. Agarwal, A., & Lavie, A. (2008). METEOR, M-BLEU and M-TER: Flexible Matching and Parameter Tuning for High-Correlation with Human Judgments of Machine Translation Quality. In Proceedings of the ACL2008 Workshop on Statistical Machine Translation. Columbus, Ohio, USA.
2. Albrecht, J. S., & Hwa, R. (2007a). A re-examination of machine learning approaches for sentence-level MT evaluation. In Proceedings of the 45th annual meeting of the association for computational linguistics (ACL), Prague, Czech Republic (pp. 880–887).
3. Albrecht, J. S., & Hwa, R. (2007b) Regression for sentence-level MT evaluation with pseudo references. In Proceedings of the 45th annual meeting of the association for computational linguistics, Prague, Czech Republic (pp. 296–303).
4. Atserias, J., Blanco, R., Chenlo, J. M., & Rodriguez, C. (2012). FBM-Yahoo at RepLab 2012. CLEF (Online Working Notes/Labs/Workshop).
5. Banerjee, S., & Lavie, A. (2005). METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. In Proceedings of ACL workshop on intrinsic and extrinsic evaluation measures for MT and/or summarization, Michigan, USA.