Author:
Talla D.,John L.K.,Burger D.
Publisher
Institute of Electrical and Electronics Engineers (IEEE)
Subject
Computational Theory and Mathematics,Hardware and Architecture,Theoretical Computer Science,Software
Cited by
49 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. A Journey of a 1,000 Kernels Begins with a Single Step: A Retrospective of Deep Learning on GPUs;Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2;2024-04-27
2. Tandem Processor: Grappling with Emerging Operators in Neural Networks;Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2;2024-04-27
3. SIMD programming using Intel vector extensions;Journal of Parallel and Distributed Computing;2020-01
4. Optimizing data permutations in structured loads/stores translation and SIMD register mapping for a cross-ISA dynamic binary translator;Journal of Systems Architecture;2019-09
5. Automated Compiler Optimization of Multiple Vector Loads/Stores;International Journal of Parallel Programming;2017-01-09