Efficient Algorithm Design of Optimizing SpMV on GPU

Author:

Chu Genshen1ORCID,He Yuanjie1ORCID,Dong Lingyu1ORCID,Ding Zhezhao1ORCID,Chen Dandan1ORCID,Bai He1ORCID,Wang Xuesong2ORCID,Hu Changjun1ORCID

Affiliation:

1. University of Science and Technology Beijing, Beijing, China

2. China Institute of Atomic Energy, Beijing, China

Funder

National Key R&D Program of China

Publisher

ACM

Reference41 articles.

1. Sarah AlAhmadi , Thaha Muhammed , Rashid Mehmood , and Aiiad Albeshri . 2020. Performance Characteristics for Sparse Matrix-Vector Multiplication on GPUs . In Smart Infrastructure and Applications, Rashid Mehmood, Simon See, Iyad Katib, and Imrich Chlamtac (Eds.). Springer International Publishing , Cham, 409--426. https://doi.org/10.1007/978--3-030--13705--2_17 Series Title: EAI/Springer Innovations in Communication and Computing. 10.1007/978--3-030--13705--2_17 Sarah AlAhmadi, Thaha Muhammed, Rashid Mehmood, and Aiiad Albeshri. 2020. Performance Characteristics for Sparse Matrix-Vector Multiplication on GPUs. In Smart Infrastructure and Applications, Rashid Mehmood, Simon See, Iyad Katib, and Imrich Chlamtac (Eds.). Springer International Publishing, Cham, 409--426. https://doi.org/10.1007/978--3-030--13705--2_17 Series Title: EAI/Springer Innovations in Communication and Computing.

2. An efficient sparse matrix-vector multiplication on CUDA-enabled graphic processing units for finite element method simulations

3. Accelerating Machine Learning on Sparse Datasets with a Distributed Memory Vector Architecture

4. Arash Ashari , Naser Sedaghati , John Eisenlohr , Srinivasan Parthasarath , and P. Sadayappan . 2014 . Fast Sparse Matrix-Vector Multiplication on GPUs for Graph Applications. In SC14: International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE , New Orleans, LA, USA, 781--792. https://doi.org/10.1109/SC. 2014 .69 10.1109/SC.2014.69 Arash Ashari, Naser Sedaghati, John Eisenlohr, Srinivasan Parthasarath, and P. Sadayappan. 2014. Fast Sparse Matrix-Vector Multiplication on GPUs for Graph Applications. In SC14: International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE, New Orleans, LA, USA, 781--792. https://doi.org/10.1109/SC.2014.69

5. Implementing sparse matrix-vector multiplication on throughput-oriented processors

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3