Author:
Tan Guangming,Li Linchuan,Triechle Sean,Phillips Everett,Bao Yungang,Sun Ninghui
Cited by
42 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. FlexGEMM: A Flexible Micro-kernel Generation Framework;Proceedings of the 5th International Conference on Computer Information and Big Data Applications;2024-04-26
2. General Matrix Multiplication (GEMM) Evaluation on Cyclone-V SoC FPGA Using OpenCL;2023 International Conference on Radar, Antenna, Microwave, Electronics, and Telecommunications (ICRAMET);2023-11-15
3. TurboGNN: Improving the End-to-End Performance for Sampling-Based GNN Training on GPUs;IEEE Transactions on Computers;2023-09-01
4. Fast All-Pairs Shortest Paths Algorithm in Large Sparse Graph;Proceedings of the 37th International Conference on Supercomputing;2023-06-21
5. Accelerating Deep Neural Networks on Mobile Multicore NPUs;Proceedings of the 21st ACM/IEEE International Symposium on Code Generation and Optimization;2023-02-17