Funder
National Research Foundation (NRF) grant funded by the Korea government
Publisher
Springer Science and Business Media LLC
Subject
Computer Networks and Communications,Software
Reference14 articles.
1. Jeffers, J., Reinders, J., Sodani, A.: Intel Xeon Phi Processor High Performance Programming: Knights Landing Edition. Morgan Kaufmann (2016)
2. Bilmes, J., Asanovic, K., Chin, C.W., Demmel, J.: Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology. In: ACM International Conference on Supercomputing 25th Anniversary Volume, pp. 253–260. ACM (2014)
3. Goto, K., van de Geijn, R.A.: Anatomy of high-performance matrix multiplication. ACM Trans. Math. Softw. (TOMS) 34(3), 12 (2008)
4. Heinecke, A., Vaidyanathan, K., Smelyanskiy, M., Kobotov, A., Dubtsov, R., Henry, G., Shet, A.G., Chrysos, G., Dubey, P.: Design and implementation of the linpack benchmark for single and multi-node systems based on Intel® Xeon Phi Coprocessor. In: 2013 IEEE 27th International Symposium on Parallel & Distributed Processing (IPDPS), pp. 126–137. IEEE (2013)
5. Peyton, J.L.: Programming dense linear algebra kernels on vectorized architectures. Master’s thesis, The University of Tennessee, Knoxville (2013)
Cited by
23 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Revisiting the performance optimization of QR factorization on Intel KNL and SKL multiprocessors;The Journal of Supercomputing;2024-03-13
2. Efficient Execution of SpGEMM on Long Vector Architectures;Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing;2023-08-07
3. StreamSpeech: Low-Latency Neural Architecture for High-Quality on-Device Speech Synthesis;ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2023-06-04
4. NIOT: A Novel Inference Optimization of Transformers on Modern CPUs;IEEE Transactions on Parallel and Distributed Systems;2023-06
5. DGEMM Optimization Oriented to ARM SVE Instruction Set Architecture;2022 IEEE 28th International Conference on Parallel and Distributed Systems (ICPADS);2023-01