GPU-Initiated On-Demand High-Throughput Storage Access in the BaM System Architecture-Reference-Cited by-同舟云学术

GPU-Initiated On-Demand High-Throughput Storage Access in the BaM System Architecture

Published:2023-01-27 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2
language:
Short-container-title:

Author:

Qureshi Zaid¹,Mailthody Vikram Sharma¹,Gelado Isaac²,Min Seungwon¹,Masood Amna³,Park Jeongmin⁴,Xiong Jinjun⁵,Newburn C. J.²,Vainbrand Dmitri²,Chung I-Hsin⁶,Garland Michael²,Dally William⁷,Hwu Wen-mei¹

Affiliation:

1. University of Illinois at Urbana-Champaign, USA / NVIDIA, USA

2. NVIDIA, USA

3. University of Illinois at Urbana-Champaign, USA / AMD, USA

4. University of Illinois at Urbana-Champaign, USA

5. University at Buffalo, USA

6. IBM Research, USA

7. NVIDIA, USA / Stanford University, USA

Funder

IBM-ILLINOIS C3SR

IBM-ILLINOIS Discovery Accelerator Institute

Nvidia

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3575693.3575748

Reference74 articles.

1. B. Acun , M. Murphy , X. Wang , J. Nie , C. Wu , and K. Hazelwood . 2021. Understanding Training Efficiency of Deep Learning Recommendation Models at Scale . In 2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA’21) . IEEE Computer Society, Los Alamitos, CA, USA. 802–814. B. Acun, M. Murphy, X. Wang, J. Nie, C. Wu, and K. Hazelwood. 2021. Understanding Training Efficiency of Deep Learning Recommendation Models at Scale. In 2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA’21). IEEE Computer Society, Los Alamitos, CA, USA. 802–814.

2. DCS

3. AMD. 2021. RADEON-SSG API Manual. https://www.amd.com/system/files/documents/ssg-api-user-manual.pdf AMD. 2021. RADEON-SSG API Manual. https://www.amd.com/system/files/documents/ssg-api-user-manual.pdf

4. Jens Axboe. 2020. Efficient IO with io_uring. Jens Axboe. 2020. Efficient IO with io_uring.

5. 2022. BaM GitHub Repository. https://github.com/ZaidQureshi/bam 2022. BaM GitHub Repository. https://github.com/ZaidQureshi/bam

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. OUTRE: An OUT-of-Core De-REdundancy GNN Training Framework for Massive Graphs within A Single Machine;Proceedings of the VLDB Endowment;2024-07

2. Neos: A NVMe-GPUs Direct Vector Service Buffer in User Space;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13

3. GMT: GPU Orchestrated Memory Tiering for the Big Data Era;Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3;2024-04-27

4. Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System;2024 IEEE International Symposium on High-Performance Computer Architecture (HPCA);2024-03-02

5. H3DM: A High-bandwidth High-capacity Hybrid 3D Memory Design for GPUs;Proceedings of the ACM on Measurement and Analysis of Computing Systems;2024-02-16