1. A Survey of Design and Optimization for Systolic Array-based DNN Accelerators;ACM Computing Surveys;2023-08-25
2. High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs;Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming;2023-02-21
3. Sigma: Compiling Einstein Summations to Locality-Aware Dataflow;Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2;2023-01-27
4. VeyMont: Parallelising Verified Programs Instead of Verifying Parallel Programs;Formal Methods;2023
5. Optimizing GPU Deep Learning Operators with Polyhedral Scheduling Constraint Injection;2022 IEEE/ACM International Symposium on Code Generation and Optimization (CGO);2022-04-02