Cocco: Hardware-Mapping Co-Exploration towards Memory Capacity-Communication Optimization-Reference-Cited by-同舟云学术

Cocco: Hardware-Mapping Co-Exploration towards Memory Capacity-Communication Optimization

Published:2024-04-17 Issue: Volume:22 Page:69-84
ISSN:
Container-title:Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 1
language:
Short-container-title:

Author:

Tan Zhanhong¹^ORCID,Zhu Zijian¹^ORCID,Ma Kaisheng¹^ORCID

Affiliation:

1. Tsinghua University, Beijing, China

Funder

National Key R&D Program of China

Tsinghua University Dushi Program

National Natural Science Foundation of China

Key Research and Development Program of Shaanxi

Tsinghua University Talent Program

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3617232.3624865

Reference75 articles.

1. Think Fast: A Tensor Streaming Processor (TSP) for Accelerating Deep Learning Workloads

2. Ravichandra Addanki, Shaileshh Bojja Venkatakrishnan, Shreyan Gupta, Hongzi Mao, and Mohammad Alizadeh. 2019. Learning Generalizable Device Placement Algorithms for Distributed Machine Learning. In Advances in Neural Information Processing Systems (NeurIPS), Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d'Alché-Buc, Emily B. Fox, and Roman Garnett (Eds.). OpenReview.net, Vancouver, BC, Canada, 3983--3993.

3. Byung Hoon Ahn, Jinwon Lee, Jamie Menjay Lin, Hsin-Pai Cheng, Jilei Hou, and Hadi Esmaeilzadeh. 2020. Ordering Chaos: Memory-Aware Scheduling of Irregularly Wired Neural Networks for Edge Devices. In Proceedings of Machine Learning and Systems (MLSys), Inderjit S. Dhillon, Dimitris S. Papailiopoulos, and Vivienne Sze (Eds.). mlsys.org, Austin, TX, USA, 1--14.

4. Fused-layer CNN accelerators. In Proceedings of the 49th IEEE/ACM International Symposium on Microarchitecture (MICRO). IEEE Computer Society, Taipei;Alwani Manoj;Taiwan,2016

5. Arteries. 2022. Arteries IP Homepage. https://www.arteris.com.