1. Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks;hoefler;J Mach Learn Res,2021
2. Efficient mpi-allreduce for large-scale deep learning on gpu-clusters;truong;Concurrency and Computation Practice and Experience,2019
3. Long Short-Term Memory
4. Improving the Performance of Collective Operations in MPICH
5. Gpipe: Efficient training of giant neural networks using pipeline parallelism;huang;Advances in neural information processing systems,2019