1. Medmnist v2-a large-scale lightweight benchmark for 2d and 3d biomedical image classification;Yang;Sci. Data,2023
2. A survey on evaluation of large language models;Chang;ACM Trans. Intell. Syst. Technol.,2024
3. Radford, A., Kim, J.W., Xu, T., Brockman, G., McLeavey, C., and Sutskever, I. (2023, January 23–29). Robust speech recognition via large-scale weak supervision. Proceedings of the 40th International Conference on Machine Learning, Honolulu, HI, USA.
4. Szegedy, C., Zaremba, W., Sutskever, I., Bruna, J., Erhan, D., Goodfellow, I., and Fergus, R. (2013). Intriguing properties of neural networks. arXiv.
5. Goodfellow, I.J., Shlens, J., and Szegedy, C. (2014). Explaining and harnessing adversarial examples. arXiv.