1. Optimizing nonzero-based sparse matrix partitioning models via reducing latency.;Journal of Parallel and Distributed Computing,2018
2. A three-dimensional approach to parallel matrix multiplication.;IBM Journal of Research and Development,1995
3. Communication complexity of PRAMs.;Theoretical Computer Science,1990
4. Numerical linear algebra on emerging architectures: the PLASMA and MAGMA projects.;Journal of Physics: Conference Series,2009
5. Multi-ML: Programming multi-BSP algorithms in ML.;International Journal of Parallel Programming,2017