1. Rohan Anil Gabriel Pereyra Alexandre Passos Robert Ormandi George E Dahl and Geoffrey E Hinton. 2018. Large scale distributed neural network training through online distillation. ICLR. Rohan Anil Gabriel Pereyra Alexandre Passos Robert Ormandi George E Dahl and Geoffrey E Hinton. 2018. Large scale distributed neural network training through online distillation. ICLR.
2. Mehdi Bahri Gaétan Bahl and Stefanos Zafeiriou. 2021. Binary Graph Neural Networks. In CVPR. 9492--9501. Mehdi Bahri Gaétan Bahl and Stefanos Zafeiriou. 2021. Binary Graph Neural Networks. In CVPR. 9492--9501.
3. Ron Banner , Itay Hubara , Elad Hoffer , and Daniel Soudry . 2018 . Scalable methods for 8-bit training of neural networks . NeurIPS , Vol. 31 . Ron Banner, Itay Hubara, Elad Hoffer, and Daniel Soudry. 2018. Scalable methods for 8-bit training of neural networks. NeurIPS, Vol. 31.
4. Yoshua Bengio Nicholas Léonard and Aaron Courville. 2013. Estimating or propagating gradients through stochastic neurons for conditional computation. arXiv. Yoshua Bengio Nicholas Léonard and Aaron Courville. 2013. Estimating or propagating gradients through stochastic neurons for conditional computation. arXiv.
5. Rianne van den Berg Thomas N Kipf and Max Welling. 2017. Graph convolutional matrix completion. arXiv preprint arXiv:1706.02263. Rianne van den Berg Thomas N Kipf and Max Welling. 2017. Graph convolutional matrix completion. arXiv preprint arXiv:1706.02263.