Optimizing CPU Performance for Recommendation Systems At-Scale-Reference-Cited by-同舟云学术

Optimizing CPU Performance for Recommendation Systems At-Scale

Published:2023-06-17 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 50th Annual International Symposium on Computer Architecture
language:
Short-container-title:

Author:

Jain Rishabh¹^ORCID,Cheng Scott¹^ORCID,Kalagi Vishwas¹^ORCID,Sanghavi Vrushabh²^ORCID,Kaul Samvit³^ORCID,Arunachalam Meena²^ORCID,Maeng Kiwan¹^ORCID,Jog Adwait⁴⁵^ORCID,Sivasubramaniam Anand¹^ORCID,Kandemir Mahmut Taylan¹^ORCID,Das Chita R.¹^ORCID

Affiliation:

1. Pennsylvania State University, University Park, PA, USA

2. Intel, Portland, USA

3. Intel, Folsom, USA

4. William & Mary, Williamsburg, VA, USA

5. University of Virginia, Charlottesville, VA, USA

Funder

National Science Foundation

NSF (National Science Foundation) Chameleon Cloud

Defense Advanced Research Projects Agency

Publisher

ACM

Reference76 articles.

1. Bilge Acun , Matthew Murphy , Xiaodong Wang , Jade Nie , Carole-Jean Wu , and Kim Hazelwood . 2021 . Understanding training efficiency of deep learning recommendation models at scale . In 2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA). IEEE, 802--814 . Bilge Acun, Matthew Murphy, Xiaodong Wang, Jade Nie, Carole-Jean Wu, and Kim Hazelwood. 2021. Understanding training efficiency of deep learning recommendation models at scale. In 2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA). IEEE, 802--814.

2. Software prefetching for indirect memory accesses

3. AMD. 2021. AMD EPYC 7763. "https://www.amd.com/en/products/cpu/amdepyc-7763". AMD. 2021. AMD EPYC 7763. "https://www.amd.com/en/products/cpu/amdepyc-7763".

4. AMD. 2022. AMD Zen3 3D V-Cache. "https://www.amd.com/en/pressreleases/2022-03-21-3rd-gen-amd-epyc-processors-amd-3d-v-cachetechnology-deliver-outstanding". AMD. 2022. AMD Zen3 3D V-Cache. "https://www.amd.com/en/pressreleases/2022-03-21-3rd-gen-amd-epyc-processors-amd-3d-v-cachetechnology-deliver-outstanding".

5. Analysis and Optimization of the Memory Hierarchy for Graph Processing Workloads

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. ElasticRec: A Microservice-based Model Serving Architecture Enabling Elastic Resource Scaling for Recommendation Models;2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA);2024-06-29

2. Accelerating Large-Scale DLRM Inference through Dynamic Hot Data Rearrangement;2024 IEEE International Symposium on Circuits and Systems (ISCAS);2024-05-19

3. MPC-Wrapper: Fully Harnessing the Potential of Samsung Aquabolt-XL HBM2-PIM on FPGAs;2024 IEEE 32nd Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM);2024-05-05

4. Providing scalable single‐operating‐system NUMA abstraction of physically discrete resources;ETRI Journal;2024-01-16

5. Application-Aware Resource Allocation Based on Benefit–Cost Ratio in Computing Power Network with Heterogeneous Computing Resources;Photonics;2023-11-17