Sextans: A Streaming Accelerator for General-Purpose Sparse-Matrix Dense-Matrix Multiplication

Author:

Song Linghao1,Chi Yuze1,Sohrabizadeh Atefeh1,Choi Young-kyu2,Lau Jason1,Cong Jason1

Affiliation:

1. University of California, Los Angeles, Los Angeles, CA, USA

2. Inha University, Incheon, South Korea

Funder

Xilinx XACC Program

National Science Foundation

CDSC industrial partners (https://cdsc.ucla.edu/partners)

Publisher

ACM

Reference92 articles.

1. A scalable processing-in-memory accelerator for parallel graph processing

2. Aman Arora , Samidh Mehta , Vaughn Betz , and Lizy K. John . 2021. Tensor Slices to the Rescue: Supercharging ML Acceleration on FPGAs . In The 2021 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays . 23--33 . Aman Arora, Samidh Mehta, Vaughn Betz, and Lizy K. John. 2021. Tensor Slices to the Rescue: Supercharging ML Acceleration on FPGAs. In The 2021 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays . 23--33.

3. Implementing sparse matrix-vector multiplication on throughput-oriented processors

4. Maciej Besta , Raghavendra Kanakagiri , Grzegorz Kwasniewski , Rachata Ausavarungnirun , Jakub Beránek , Konstantinos Kanellopoulos , Kacper Janda , Zur Vonarburg-Shmaria , Lukas Gianinazzi , Ioana Stefan , Juan Gómez Luna , Jakub Golinowski , Marcin Copik , Lukas Kapp-Schwoerer , Salvatore Di Girolamo , Nils Blach , Marek Konieczny , Onur Mutlu , and Torsten Hoefler . 2021 . SISA: Set-Centric Instruction Set Architecture for Graph Mining on Processing-in-Memory Systems. In MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture. 282--297 . Maciej Besta, Raghavendra Kanakagiri, Grzegorz Kwasniewski, Rachata Ausavarungnirun, Jakub Beránek, Konstantinos Kanellopoulos, Kacper Janda, Zur Vonarburg-Shmaria, Lukas Gianinazzi, Ioana Stefan, Juan Gómez Luna, Jakub Golinowski, Marcin Copik, Lukas Kapp-Schwoerer, Salvatore Di Girolamo, Nils Blach, Marek Konieczny, Onur Mutlu, and Torsten Hoefler. 2021. SISA: Set-Centric Instruction Set Architecture for Graph Mining on Processing-in-Memory Systems. In MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture. 282--297.

5. Damla Senol Cali , Gurpreet S. Kalsi , Zülal Bingöl , Can Firtina , Lavanya Subramanian , Jeremie S. Kim , Rachata Ausavarungnirun , Mohammed Alser , Juan Gomez-Luna , Amirali Boroumand , 2020 . GenASM: A High-Performance , Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis. In 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO). IEEE, 951--966 . Damla Senol Cali, Gurpreet S. Kalsi, Zülal Bingöl, Can Firtina, Lavanya Subramanian, Jeremie S. Kim, Rachata Ausavarungnirun, Mohammed Alser, Juan Gomez-Luna, Amirali Boroumand, et almbox. 2020. GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis. In 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO). IEEE, 951--966.

Cited by 26 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine;2023 International Conference on Field Programmable Technology (ICFPT);2023-12-12

2. Algorithm/Hardware Co-Optimization for Sparsity-Aware SpMM Acceleration of GNNs;IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems;2023-12

3. An Efficient Gustavson-Based Sparse Matrix–Matrix Multiplication Accelerator on Embedded FPGAs;IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems;2023-12

4. ReFloat: Low-Cost Floating-Point Processing in ReRAM for Accelerating Iterative Linear Solvers;Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis;2023-11-11

5. SAGA: Sparsity-Agnostic Graph Convolutional Network Acceleration with Near-Optimal Workload Balance;2023 IEEE/ACM International Conference on Computer Aided Design (ICCAD);2023-10-28

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3