1. Bogdan Babych and Anthony Hartley. 2008. Sensitivity of Automated MT Evaluation Metrics on Higher Quality MT Output: BLEU vs Task-Based Evaluation Methods.. In LREC, Vol. 2008. ELRA, Marrakech, Morocco, 6.
2. Anja Belz and Ehud Reiter. 2006. Comparing automatic and human evaluation of NLG systems. In Proc. of EACL. ACL, Trento, Italy, 313--320.
3. Christian Bird, Nachiappan Nagappan, Brendan Murphy, Harald Gall, and Premkumar Devanbu. 2011. Don't touch my code! Examining the effects of ownership on software quality. In Proc. of ESEC/FSE. ACM, New York, NY, USA, 4--14.
4. Characteristics of Useful Code Reviews: An Empirical Study at Microsoft
5. Chris Callison-Burch, Miles Osborne, and Philipp Koehn. 2006. Re-evaluating the role of BLEU in machine translation research. In Proc. of EACL. ACL, Trento, Italy, 249--256.