1. Taraghi, M., Dorcelus, G., Foundjem, A., Tambon, F., Khomh, F.: Deep learning model reuse in the huggingface community: challenges, benefit and trends. arXiv preprint arXiv:2401.13177 (2024)
2. Kossen, J., Farquhar, S., Gal, Y., Rainforth, T.: Active testing: sample-efficient model evaluation. In: International Conference on Machine Learning. PMLR, pp. 5753–5763 (2021)
3. Kossen, J., Farquhar, S., Gal, Y., Rainforth, T.: Active surrogate estimators: an active learning approach to label-efficient model evaluation. In: Advances in Neural Information Processing Systems, vol. 35, pp. 24 557–24 570 (2022)
4. Raschka, S.: Model evaluation, model selection, and algorithm selection in machine learning (2020)
5. Zheng, A., Shelby, N., Volckhausen, E.: Evaluating machine learning models. In: Machine Learning in the AWS Cloud (2019)