1. 2023. Repository for SimClone. https://zenodo.org/record/7613379#.Y-FksuzMJQ0
2. Qurat Ul Ain, Wasi Haider Butt, Muhammad Waseem Anwar, Farooque Azam, and Bilal Maqbool. 2019. A systematic review on code clone detection. IEEE access 7 (2019), 86121–86144.
3. Ibrahim Alabdulmohsin, Jessica Schrouff, and Oluwasanmi Koyejo. 2022. A reduction to binary approach for debiasing multiclass datasets. arXiv preprint arXiv:2205.15860 (2022).
4. The adverse effects of code duplication in machine learning models of code
5. A survey on data leakage prevention systems