Author:
Abdelfattah Ahmad,Haidar Azzam,Tomov Stanimire,Dongarra Jack
Reference14 articles.
1. E. Agullo, C. Augonnet, J. Dongarra, H. Ltaief, R. Namyst, S. Thibault, and S. Tomov. Faster, Cheaper, Better – a Hybridization Methodology to Develop Linear Algebra Software for GPUs. In W. mei W. Hwu, editor, GPU Computing Gems, volume 2. Morgan Kaufmann, Sept. 2010.
2. Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects;Agullo;J. Phys.: Conf. Ser.,2009
3. T. Dong, A. Haidar, S. Tomov, and J. Dongarra. A fast batched Cholesky factorization on a GPU. In Proc. of 2014 International Conference on Parallel Processing (ICPP-2014), September 2014.
4. J. Dongarra, A. Haidar, J. Kurzak, P. Luszczek, S. Tomov, and A. YarKhan. Model-driven one-sided factorizations on multicore accelerated systems. International Journal on Supercomputing Frontiers and Innovations, 1(1), June 2014.
5. A. Haidar, T. Dong, P. Luszczek, S. Tomov, and J. Dongarra. Batched matrix computations on hardware accelerators based on gpus. International Journal of High Performance Computing Applications, 2015.
Cited by
12 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Optimization Techniques for GPU Programming;ACM Computing Surveys;2023-03-16
2. Research on Strawberry Positioning Technology including Maturity Classification;2022 5th International Conference on Computer Science and Software Engineering (CSSE 2022);2022-10-21
3. Implementing LU and Cholesky factorizations on artificial intelligence accelerators;CCF Transactions on High Performance Computing;2021-08-24
4. Batched Triangular Dense Linear Algebra Kernels for Very Small Matrix Sizes on GPUs;ACM Transactions on Mathematical Software;2019-06-30
5. Accelerating Spectral Graph Analysis Through Wavefronts of Linear Algebra Operations;2019 27th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP);2019-02