1. PML-MPI: A Pre-Trained ML Framework for Efficient Collective Algorithm Selection in MPI;2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW);2024-05-27
2. MSCCLang: Microsoft Collective Communication Language;Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2;2023-01-27
3. Optimizing All-to-All Collective Communication on Tianhe Supercomputer;2022 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom);2022-12
4. Synthesizing optimal collective algorithms;Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming;2021-02-17
5. Improving the efficiency of HPC data movement on container-based virtual cluster;CCF Transactions on High Performance Computing;2020-03