1. Bengio, Y., Léonard, N., & Courville, A. (2013). Estimating or propagating gradients through stochastic neurons for conditional computation. arXiv:1308.3432.
2. Brock, A., Lim, T., Ritchie, J. M., & Weston, N. (2017). Smash: One-shot model architecture search through hypernetworks. arXiv preprint arXiv:1708.05344.
3. Bulat, A., Martinez, B., & Tzimiropoulos, G. (2020a). Bats: Binary architecture search. In Proc. of ECCV pp. 309–325.
4. Bulat, A., Martinez, B., & Tzimiropoulos, G. (2020b). High-capacity expert binary networks. arXiv:2010.03558.
5. Bulat, A., & Tzimiropoulos, G. (2019). Xnor-net++: Improved binary neural networks. In Proc. of BMVC pp. 1–12.