1. BluesMPI: Efficient MPI Non-blocking Alltoall Offloading Designs on Modern BlueField Smart NICs
2. Nvidia Data Center Processing Unit (DPU) Architecture
3. Optimizing Distributed DNN Training using CPUs and BlueField-2 DPUs
4. G. Juckeland , W. Brantley , S. Chandrasekaran , B. Chapman , 2014 . SPEC ACCEL: A standard application suite for measuring hardware accelerator performance . In International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems. 46–67 . G. Juckeland, W. Brantley, S. Chandrasekaran, B. Chapman, 2014. SPEC ACCEL: A standard application suite for measuring hardware accelerator performance. In International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems. 46–67.
5. “Smarter” NICs for faster molecular dynamics: a case study