1. Milton Allamanis. [n.d.]. CodeSearchNet Deduplication Algorithm. https://github.com/github/CodeSearchNet/blob/master/src/dataextraction/dedup_split.py. Milton Allamanis. [n.d.]. CodeSearchNet Deduplication Algorithm. https://github.com/github/CodeSearchNet/blob/master/src/dataextraction/dedup_split.py.
2. The adverse effects of code duplication in machine learning models of code
3. Uri Alon , Roy Sadaka , Omer Levy , and Eran Yahav . 2020 . Structural language models of code . In International Conference on Machine Learning. PMLR, 245--256 . Uri Alon, Roy Sadaka, Omer Levy, and Eran Yahav. 2020. Structural language models of code. In International Conference on Machine Learning. PMLR, 245--256.
4. Anon. 2022. Replication Package https://github.com/DLCloning/DLClone.git. Anon. 2022. Replication Package https://github.com/DLCloning/DLClone.git.
5. Gareth Ari Aye and Gail E Kaiser . 2020. Sequence Model Design for Code Completion in the Modern IDE. arXiv preprint arXiv:2004.05249 ( 2020 ). Gareth Ari Aye and Gail E Kaiser. 2020. Sequence Model Design for Code Completion in the Modern IDE. arXiv preprint arXiv:2004.05249 (2020).