1. Design and Implementation of an IPC-based Collective MPI Library for Intel GPUs;Practice and Experience in Advanced Research Computing 2024: Human Powered Computing;2024-07-17
2. On the Performance Portability of OpenACC, OpenMP, Kokkos and RAJA;International Conference on High Performance Computing in Asia-Pacific Region;2022-01-07
3. Revisiting a Metric for Performance Portability;2021 International Workshop on Performance, Portability and Productivity in HPC (P3HPC);2021-11
4. Performance Study of GPU applications using SYCL and CUDA on Tesla V100 GPU;2021 IEEE High Performance Extreme Computing Conference (HPEC);2021-09-20
5. Evaluating the Performance of Integer Sum Reduction in SYCL on GPUs;50th International Conference on Parallel Processing Workshop;2021-08-09