1. Kamel Alrashedy. 2023. Language Models are Better Bug Detector Through Code-Pair Classification. arXiv preprint arXiv:2311.07957 (2023).
2. Saleema Amershi Dan Weld Mihaela Vorvoreanu Adam Fourney Besmira Nushi Penny Collisson Jina Suh Shamsi Iqbal Paul N. Bennett Kori Inkpen Jaime Teevan Ruth Kikin-Gil and Eric Horvitz. 2019. Guidelines for Human-AI Interaction(CHI ’19).
3. Grounded Copilot: How Programmers Interact with Code-Generating Models
4. Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde de Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, 2021. Evaluating large language models trained on code. arXiv preprint arXiv:2107.03374 (2021).
5. Xinyun Chen, Maxwell Lin, Nathanael Schärli, and Denny Zhou. 2023. Teaching large language models to self-debug. arXiv preprint arXiv:2304.05128 (2023).