Author:
Yotov K.,Xiaoming Li ,Gang Ren ,Garzaran M.J.S.,Padua D.,Pingali K.,Stodghill P.
Publisher
Institute of Electrical and Electronics Engineers (IEEE)
Subject
Electrical and Electronic Engineering
Cited by
69 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Cache Optimization and Performance Modeling of Batched, Small, and Rectangular Matrix Multiplication on Intel, AMD, and Fujitsu Processors;ACM Transactions on Mathematical Software;2023-09-19
2. High-Performance Matrix Multiplication on the New Generation Shenwei Processor;2022 IEEE 24th Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys);2022-12
3. TileSpGEMM;Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming;2022-03-28
4. Shrinking Sample Search Algorithm for Automatic Tuning of GPU Kernels;2021 IEEE 28th International Conference on High Performance Computing, Data, and Analytics (HiPC);2021-12
5. Predictive data locality optimization for higher-order tensor computations;Proceedings of the 5th ACM SIGPLAN International Symposium on Machine Programming;2021-06-20