1. B. Acun , M. Murphy , X. Wang , J. Nie , C. Wu , and K. Hazelwood . 2021. Understanding Training Efficiency of Deep Learning Recommendation Models at Scale . In 2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA’21) . IEEE Computer Society, Los Alamitos, CA, USA. 802–814. B. Acun, M. Murphy, X. Wang, J. Nie, C. Wu, and K. Hazelwood. 2021. Understanding Training Efficiency of Deep Learning Recommendation Models at Scale. In 2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA’21). IEEE Computer Society, Los Alamitos, CA, USA. 802–814.
2. DCS
3. AMD. 2021. RADEON-SSG API Manual. https://www.amd.com/system/files/documents/ssg-api-user-manual.pdf AMD. 2021. RADEON-SSG API Manual. https://www.amd.com/system/files/documents/ssg-api-user-manual.pdf
4. Jens Axboe. 2020. Efficient IO with io_uring. Jens Axboe. 2020. Efficient IO with io_uring.
5. 2022. BaM GitHub Repository. https://github.com/ZaidQureshi/bam 2022. BaM GitHub Repository. https://github.com/ZaidQureshi/bam