1. Toufique Ahmed, Kunal Suresh Pai, Premkumar Devanbu, and Earl T. Barr. 2024. Automatic Semantic Augmentation of Language Model Prompts (for Code Summarization). In 2024 IEEE/ACM 45th International Conference on Software Engineering (ICSE).
2. Anthropic (2023). 2023. Claude 2. https://www.anthropic.com/index/claude-2
3. Jiuhai Chen Lichang Chen Heng Huang and Tianyi Zhou. 2023. When do you need Chain-of-Thought Prompting for ChatGPT? arxiv:2304.03262 arXiv:2304.03262 [cs]
4. Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde de Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, and Greg Brockman. 2021. Evaluating large language models trained on code. arXiv preprint arXiv:2107.03374.
5. Xinyun Chen Maxwell Lin Nathanael Schärli and Denny Zhou. 2023. Teaching Large Language Models to Self-Debug. arxiv:2304.05128