Affiliation:
1. Chinese Academy of Sciences, Beijing, China
2. Georgia Institute of Technology, Atlanta, GA, USA
Funder
the National Key Research and Development Program of China
National Natural Science Foundation of China
the joint deep learning lab of Institute of Computing Technology and Sugon and CAS Holdings
National 863 Foundation of China
Reference34 articles.
1. Envytools. https://github.com/envytools/envytools. Envytools. https://github.com/envytools/envytools.
2. AMD. clMath. https://github.com/clMathLibraries. AMD. clMath. https://github.com/clMathLibraries.
3. Extending the ArchC Language for Automatic Generation of Assemblers
Cited by
25 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Kernel fusion in atomistic spin dynamics simulations on Nvidia GPUs using tensor core;Journal of Computational Science;2024-09
2. Ensino de Software Pipelining e Escalonamento em GPUs com Python no Google Colab;International Journal of Computer Architecture Education;2023-12-01
3. ReFloat: Low-Cost Floating-Point Processing in ReRAM for Accelerating Iterative Linear Solvers;Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis;2023-11-11
4. rNdN: Fast Query Compilation for NVIDIA GPUs;ACM Transactions on Architecture and Code Optimization;2023-07-19
5. Fast All-Pairs Shortest Paths Algorithm in Large Sparse Graph;Proceedings of the 37th International Conference on Supercomputing;2023-06-21