1. Partitioned Reduction for Heterogeneous Environments;2024 32nd Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP);2024-03-20
2. Efficient Algorithm for All-Gather Operation in Optical Interconnect Systems;IEEE Open Journal of the Communications Society;2024
3. Verifying Performance Guidelines for MPI Collectives at Scale;Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis;2023-11-12
4. Optimizing Irregular Communication with Neighborhood Collectives and Locality-Aware Parallelism;Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis;2023-11-12
5. Optimizing MPI Collectives on Shared Memory Multi-Cores;Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis;2023-11-11