Performance Engineering for a Tall & Skinny Matrix Multiplication Kernels on GPUs
Author:
Publisher
Springer International Publishing
Link
http://link.springer.com/content/pdf/10.1007/978-3-030-43229-4_43
Reference12 articles.
1. Cullum, J., Donath, W.E.: A block Lanczos algorithm for computing the $$q$$ algebraically largest eigenvalues and a corresponding eigenspace of large, sparse, real symmetric matrices. In: 1974 IEEE Conference on Decision and Control Including the 13th Symposium on Adaptive Processes, pp. 505–509, November 1974
2. Ernst, D.: CUDA Microbenchmarks. http://tiny.cc/cudabench
3. Gropp, W.D., Kaushik, D.K., Keyes, D.E., Smith, B.F.: Towards realistic performance bounds for implicit CFD codes. In: Proceedings of Parallel CFD 1999, pp. 233–240. Elsevier (1999)
4. Lecture Notes in Computer Science;JR Herrero,2006
5. Kreutzer, M., et al.: GHOST: building blocks for high performance sparse linear algebra on heterogeneous systems. Int. J. Parallel Program., 1–27 (2016). https://doi.org/10.1007/s10766-016-0464-z
Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. SeisSol on Distributed Multi-GPU Systems: CUDA Code Generation for the Modal Discontinuous Galerkin Method;The International Conference on High Performance Computing in Asia-Pacific Region;2021-01-20
2. Efficient Mixed-Precision Tall-and-Skinny Matrix-Matrix Multiplication for GPUs;International Journal of Networking and Computing;2021
3. Performance engineering for real and complex tall & skinny matrix multiplication kernels on GPUs;The International Journal of High Performance Computing Applications;2020-10-09
4. ESSEX: Equipping Sparse Solvers For Exascale;Software for Exascale Computing - SPPEXA 2016-2019;2020
1.学者识别学者识别
2.学术分析学术分析
3.人才评估人才评估
"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370
www.globalauthorid.com
TOP
Copyright © 2019-2024 北京同舟云网络信息技术有限公司 京公网安备11010802033243号 京ICP备18003416号-3