Affiliation:
1. Cray Inc., Chippewa Falls,Wisconsin
2. Stanford University
Abstract
This paper describes the radix-64 folded-Clos network of the Cray BlackWidow scalable vector multiprocessor. We describe the BlackWidow network which scales to 32K processors with a worstcase diameter of seven hops, and the underlying high-radix router microarchitecture and its implementation. By using a high-radix router with many narrow channels we are able to take advantage of the higher pin density and faster signaling rates available in modern ASIC technology. The BlackWidow router is an 800 MHz ASIC with 64 18.75Gb/s bidirectional ports for an aggregate offchip bandwidth of 2.4Tb/s. Each port consists of three 6.25Gb/s differential signals in each direction. The router supports deterministic and adaptive packet routing with separate buffering for request and reply virtual channels. The router is organized hierarchically [13] as an 8×8 array of tiles which simplifies arbitration by avoiding long wires in the arbiters. Each tile of the array contains a router port, its associated buffering, and an 8×8 router subswitch. The router ASIC is implemented in a 90nm CMOS standard cell ASIC technology and went from concept to tapeout in 17 months.
Publisher
Association for Computing Machinery (ACM)
Reference23 articles.
1. {2} C. Clos. A Study of Non-Blocking Switching Networks. The Bell System technical Journal 32(2):406-424 March 1953. {2} C. Clos. A Study of Non-Blocking Switching Networks. The Bell System technical Journal 32(2):406-424 March 1953.
2. {3} Cray X1. http://www.cray.com/products/x1/. {3} Cray X1. http://www.cray.com/products/x1/.
3. {4} Cray XD1. http://www.cray.com/products/xd1/. {4} Cray XD1. http://www.cray.com/products/xd1/.
4. {5} Cray XT3. http://www.cray.com/products/xt3/. {5} Cray XT3. http://www.cray.com/products/xt3/.
Cited by
85 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. A Lightweight Routing Layer Using a Reliable Link-Layer Protocol;2023 IEEE International Conference on Cloud Computing Technology and Science (CloudCom);2023-12-04
2. VVQ: Virtualizing Virtual Channel for Cost-Efficient Protocol Deadlock Avoidance;2023 IEEE International Symposium on High-Performance Computer Architecture (HPCA);2023-02
3. Dynamic global adaptive routing in high-radix networks;Proceedings of the 49th Annual International Symposium on Computer Architecture;2022-06-11
4. A software-defined tensor streaming multiprocessor for large-scale machine learning;Proceedings of the 49th Annual International Symposium on Computer Architecture;2022-06-11
5. Providing quality of service in omni-path networks;The Journal of Supercomputing;2022-03-02