1. Efficient Algorithm Design of Optimizing SpMV on GPU;Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing;2023-08-07
2. Compiling KB-sized machine learning models to tiny IoT devices;Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation;2019-06-08
3. Merge-Based Parallel Sparse Matrix-Sparse Vector Multiplication with a Vector Architecture;2018 IEEE 20th International Conference on High Performance Computing and Communications; IEEE 16th International Conference on Smart City; IEEE 4th International Conference on Data Science and Systems (HPCC/SmartCity/DSS);2018-06