1. S. Ma , R. Bassily , M. Belkin , “The power of interpolation: Understanding the effectiveness of SGD in modern over-parametrized learning” in International Conference on Machine Learning (Proceedings of Machine Learning Research [PMLR], Cambridge, MA, 2018), pp. 3325–3334.
2. M. Belkin , A. Rakhlin , A. B. Tsybakov , “Does data interpolation contradict statistical optimality?” in The 22nd International Conference on Artificial Intelligence and Statistics (Proceedings of Machine Learning Research [PMLR], Cambridge, MA, 2019), pp. 1611–1619.
3. M. Belkin , D. J. Hsu , P. Mitra , “Overfitting or perfect fitting? Risk bounds for classification and regression rules that interpolate” in Advances in Neural Information Processing Systems (Curran, Vancouver, Canada, 2018), pp. 2300–2311.
4. Reconciling modern machine-learning practice and the classical bias–variance trade-off
5. M. Belkin , Beyond Empirical Risk Minimization: The Lessons of Deep Learning (colloquium recording, MIT CBMM, 2019).