Author:
Choquette Jack,Lee Edward,Krashinsky Ronny,Balan Vishnu,Khailany Brucek
Cited by
30 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. ABS: Accumulation Bit-Width Scaling Method for Designing Low-Precision Tensor Core;IEEE Transactions on Very Large Scale Integration (VLSI) Systems;2024-09
2. A Low-Cost Floating-Point FMA Unit Supporting Package Operations for HPC-AI Applications;IEEE Transactions on Circuits and Systems II: Express Briefs;2024-07
3. sys-sage: A Unified Representation of Dynamic Topologies & Attributes on HPC Systems;Proceedings of the 38th ACM International Conference on Supercomputing;2024-05-30
4. PrimePar: Efficient Spatial-temporal Tensor Partitioning for Large Transformer Model Training;Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3;2024-04-27
5. CUTE: A scalable CPU-centric and Ultra-utilized Tensor Engine for convolutions;Journal of Systems Architecture;2024-04