1. Bakunas-Milanowski, D.: Efficient algorithms for stream compaction on GPUs. Int. J. Netw. Comput. (IJNC) 7, 208–226 (2017)
2. Chen, T., et al.: Training deep nets with sublinear memory cost. arXiv:1604.06174 (2016)
3. De Sa, C., et al.: High-accuracy low-precision training. arXiv:1803.03383 (2018)
4. Ginsburg, B., et al.: NVIDIA mixed precision training on volta GPUs. In: GPU Technology Conference (2017)
5. Gomez, A.N., et al.: The reversible residual network: backpropagation without storing activations. In: Advances in Neural Information Processing Systems (NIPS) (2017)