1. Large scale distributed neural network training through online distillation;Anil,2018
2. Learning from rules generalizing labeled exemplars;Awasthi,2020
3. Y. Bengio, J. Louradour, R. Collobert, J. Weston, Curriculum learning, International Conference on Machine Learning (ICML).
4. Data programming using continuous and quality-guided labeling functions;Chatterjee,2020
5. D. Chen, J. Mei, H. Zhang, C. Wang, Y. Feng, C. Chen, Knowledge Distillation with the Reused Teacher Classifier, IEEE/CVF Conference on Computer Vision and Pattern, Recognition (CVPR).