1. Neha Agarwal , David W. Nellans , Mike O'Connor , Stephen W. Keckler , and Thomas F . Wenisch . 2015 . Unlocking bandwidth for GPUs in CC-NUMA systems. In HPCA. IEEE Computer Society , Los Alamitos, CA, USA, 354--365. https://doi.org/10.1109/HPCA.2015.7056046 Neha Agarwal, David W. Nellans, Mike O'Connor, Stephen W. Keckler, and Thomas F. Wenisch. 2015. Unlocking bandwidth for GPUs in CC-NUMA systems. In HPCA. IEEE Computer Society, Los Alamitos, CA, USA, 354--365. https://doi.org/10.1109/HPCA.2015.7056046
2. AMD 2015. Asynchronous shaders: Unlocking the full potential of the GPU. AMD. https://developer.amd.com/wordpress/media/2012/10/Asynchronous-Shaders-White-Paper-FINAL.pdf AMD 2015. Asynchronous shaders: Unlocking the full potential of the GPU. AMD. https://developer.amd.com/wordpress/media/2012/10/Asynchronous-Shaders-White-Paper-FINAL.pdf
3. AMD 2017. Radeon's next-generation Vega architecture. AMD. https://en.wikichip.org/w/images/a/a1/vega-whitepaper.pdf AMD 2017. Radeon's next-generation Vega architecture. AMD. https://en.wikichip.org/w/images/a/a1/vega-whitepaper.pdf
4. AMD 2020. OpenCL optimization. AMD. https://rocmdocs.amd.com/en/latest/Programming_Guides/Opencl-optimization.html Git Revision 1f057816. AMD 2020. OpenCL optimization. AMD. https://rocmdocs.amd.com/en/latest/Programming_Guides/Opencl-optimization.html Git Revision 1f057816.
5. AMD 2020. OpenCL programming guide. AMD. https://rocmdocs.amd.com/en/latest/Programming_Guides/Opencl-programming-guide.html Git Revision 611e249. AMD 2020. OpenCL programming guide. AMD. https://rocmdocs.amd.com/en/latest/Programming_Guides/Opencl-programming-guide.html Git Revision 611e249.