1. Juice: A large scale distantly supervised dataset for open domain context-based code generation;Agashe Rajas;Retrieved from https://arXiv:1910.02216,2019
2. Wasi Ahmad, Saikat Chakraborty, Baishakhi Ray, and Kai-Wei Chang. 2021. Unified pre-training for program understanding and generation. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2655–2668.
3. AVATAR: A parallel corpus for Java-Python program translation;Ahmad Wasi Uddin;Retrieved from https://arXiv:2108.11590,2021
4. Miltiadis Allamanis. 2019. The adverse effects of code duplication in machine learning models of code. In Proceedings of the ACM SIGPLAN International Symposium on New Ideas, New Paradigms, and Reflections on Programming and Software. 143–153.