Author:
Abdelfattah Ahmad,Tomov Stanimire,Dongarra Jack
Cited by
26 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Optimizing Full-Spectrum Matrix Multiplications on ARMv8 Multi-Core CPUs;IEEE Transactions on Parallel and Distributed Systems;2024-03
2. Stability Analysis and Performance Evaluation of Additive Mixed-Precision Runge-Kutta Methods;Communications on Applied Mathematics and Computation;2023-12-21
3. A Comprehensive Evaluation of Novel AI Accelerators for Deep Learning Workloads;2022 IEEE/ACM International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS);2022-11
4. Optimizing small channel 3D convolution on GPU with tensor core;Parallel Computing;2022-10
5. Seamless optimization of the GEMM kernel for task-based programming models;Proceedings of the 36th ACM International Conference on Supercomputing;2022-06-28