Publisher
Springer Nature Switzerland
Reference17 articles.
1. Agullo, E., Augonnet, C., Dongarra, J., Ltaief, H., Namyst, R., Thibault, S.: Faster, cheaper, better - a hybridization methodology to develop linear algebra software for GPUs. In: GPU Computing Gems, vol. 2 (2010)
2. Agullo, E., et al.: Achieving high performance on supercomputers with a sequential task-based programming model. IEEE Trans. Parallel Distrib. Syst. (2017)
3. Agullo, E., Cámara, J., Cuenca, J., Giménez, D.: On the autotuning of task-based numerical libraries for heterogeneous architectures. In: Advances in Parallel Computing, vol. 36, pp. 157–166 (2020)
4. Anzt, H., Haugen, B., Kurzak, J., Luszczek, P., Dongarra, J.: Experiences in autotuning matrix multiplication for energy minimization on GPUs. Concurr. Comput. Pract. Exp. 27, 5096–5113 (2015)
5. Augonnet, C., Thibault, S., Namyst, R., Wacrenier, P.A.: STARPU: a unified platform for task scheduling on heterogeneous multicore architectures. Concurr. Comput. Pract. Exp. 23(2), 187–198 (2011)