1. Bayatpour , M. , Sarkauskas , N. , Subramoni , H. , Maqbool Hashmi , J. , Panda , D.K. : Bluesmpi: Efficient mpi non-blocking alltoall offloading designs on modern blue-field smart nics. In : Chamberlain, B.L., Varbanescu, A.L., Ltaief, H., Luszczek, P. (eds.) High Performance Computing. pp. 18 -- 37 . Springer International Publishing , Cham ( 2021 ) Bayatpour, M., Sarkauskas, N., Subramoni, H., Maqbool Hashmi, J., Panda, D.K.: Bluesmpi: Efficient mpi non-blocking alltoall offloading designs on modern blue-field smart nics. In: Chamberlain, B.L., Varbanescu, A.L., Ltaief, H., Luszczek, P. (eds.) High Performance Computing. pp. 18--37. Springer International Publishing, Cham (2021)
2. FPGAs in Supercomputers
3. Christgau , S. , Knaust , M. , Steinke , T. : A first step towards support for mpi partitioned communication on sycl-programmed fpgas . In: IEEE/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing, H2RC@ SC 2022 , Dallas, TX, USA , November , 2022 (2022) Christgau, S., Knaust, M., Steinke, T.: A first step towards support for mpi partitioned communication on sycl-programmed fpgas. In: IEEE/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing, H2RC@ SC 2022, Dallas, TX, USA, November, 2022 (2022)
4. Favaro , F. , Dufrechou , E. , Oliver , J.P. , Ezzatti , P. : Time-power-energy balance of blas kernels in modern fpgas. In : Navaux, P., Barrios H., C.J., Osthoff, C., Guerrero, G. (eds.) High Performance Computing. pp. 78 -- 89 . Springer International Publishing , Cham ( 2022 ) Favaro, F., Dufrechou, E., Oliver, J.P., Ezzatti, P.: Time-power-energy balance of blas kernels in modern fpgas. In: Navaux, P., Barrios H., C.J., Osthoff, C., Guerrero, G. (eds.) High Performance Computing. pp. 78--89. Springer International Publishing, Cham (2022)
5. Freitag , T. : Acceleration of an autoencoder using a fpga-soc in a high-performance node of a distributed onboard computer ( 2022 ), https://publica.fraunhofer.de/handle/publica/430107 Freitag, T.: Acceleration of an autoencoder using a fpga-soc in a high-performance node of a distributed onboard computer (2022), https://publica.fraunhofer.de/handle/publica/430107