All-to-All Broadcast Algorithm in Galaxyfly Networks
-
Published:2023-05-26
Issue:11
Volume:11
Page:2459
-
ISSN:2227-7390
-
Container-title:Mathematics
-
language:en
-
Short-container-title:Mathematics
Author:
Zhuang Hongbin1ORCID, Chang Jou-Ming2ORCID, Li Xiao-Yan1ORCID, Song Fangying3ORCID, Lin Qinying1
Affiliation:
1. College of Computer and Data Science, Fuzhou University, Fuzhou 350108, China 2. Institute of Information and Decision Sciences, National Taipei University of Business, Taipei 10051, Taiwan 3. School of Mathematics and Statistics, Fuzhou University, Fuzhou 350108, China
Abstract
The design of interconnection networks is a fundamental aspect of high-performance computing (HPC) systems. Among the available topologies, the Galaxyfly network stands out as a low-diameter and flexible-radix network for HPC applications. Given the paramount importance of collective communication in HPC performance, in this paper, we present two different all-to-all broadcast algorithms for the Galaxyfly network, which adhere to the supernode-first rule and the router-first rule, respectively. Our performance evaluation validates their effectiveness and shows that the first algorithm has a higher degree of utilization of network channels, and that the second algorithm can significantly reduce the average time for routers to collect packets from the supernode.
Funder
National Natural Science Foundation of China Ministry of Science and Technology of Taiwan Natural Science Foundation of Fujian Province
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference33 articles.
1. Fault-tolerant routing with load balancing in LeTQ networks;Fan;IEEE Trans. Dependable Secur. Comput.,2023 2. Optimizing data query performance of Bi-cluster for large-scale scientific data in supercomputers;Liao;J. Supercomput.,2022 3. Mavroidis, I., Papaefstathiou, I., Lavagno, L., Nikolopoulos, D.S., Koch, D., Goodacre, J., Sourdis, I., Papaefstathiou, P., Coppola, M., and Palomino, M. (2016, January 14–18). Ecoscale: Reconfigurable computing and runtime system for future exascale systems. Proceedings of the 2016 Design, Automation & Test in Europe Conference & Exhibition (DATE), Dresden, Germany. 4. Sedova, A., Davidson, R., Taillefumier, M., and Elwasif, W. (2022). HPC Molecular Simulation Tries Out a New GPU: Experiences on Early AMD Test Systems for the Frontier Supercomputer, Oak Ridge National Lab. 5. Bharadwaj, S., Yin, J., Beckmann, B., and Krishna, T. (2020, January 20–24). Kite: A family of heterogeneous interposer topologies enabled via accurate interconnect modeling. Proceedings of the 2020 57th ACM/IEEE Design Automation Conference (DAC), San Francisco, CA, USA.
|
|