1. Optimizing Irregular Communication with Neighborhood Collectives and Locality-Aware Parallelism;Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis;2023-11-12
2. Using Mixed-Radix Decomposition to Enumerate Computational Resources of Deeply Hierarchical Architectures;Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis;2023-11-12
3. Optimizing MPI Collectives on Shared Memory Multi-Cores;Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis;2023-11-11
4. Invited Paper: Benchmarking and Optimizing Data Movement on Emerging Heterogeneous Architectures;2023 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW);2023-05
5. Threshold Pivoting for Dense LU Factorization;2022 IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH);2022-11