1. BrickDL: Graph-Level Optimizations for DNNs with Fine-Grained Data Blocking on GPUs;Proceedings of the 53rd International Conference on Parallel Processing;2024-08-12
2. Leveraging the High Bandwidth of Last-Level Cache for HPC Seismic Imaging Applications;Proceedings of the Platform for Advanced Scientific Computing Conference;2024-06-03
3. Evaluation of Programming Models and Performance for Stencil Computation on GPGPUs;2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW);2024-05-27
4. Performance Portability Evaluation of Blocked Stencil Computations on GPUs;Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis;2023-11-12
5. Scalable Distributed High-Order Stencil Computations;SC22: International Conference for High Performance Computing, Networking, Storage and Analysis;2022-11