Mapping of Deep Neural Network Accelerators on Wireless Multistage Interconnection NoCs
-
Published:2023-12-20
Issue:1
Volume:14
Page:56
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Aydi Yassine1ORCID, Mnejja Sirine1ORCID, Mohammed Faraqid Q.1ORCID, Abid Mohamed1
Affiliation:
1. CES Laboratory, National Engineering School of Sfax, Sfax 3038, Tunisia
Abstract
In the last few decades, the concept of Wireless Network-on-chip (WiNoC) has emerged as a promising alternative for Multiprocessor Systems on Chip (MPSOC) to achieve reliable and scalable communication. Worth recalling in this regard is that our research team has already designed, verified and evaluated Multistage Interconnection Networks (MIN) in this field. With respect to the present work, we consider proceeding with further exploring our thoughts on this research area. Firstly, we propose the design and performance evaluation of a hybrid (wireless/wired) MIN, analysing how this augmented network can potentially improve not only the average delay, but also energy consumption. Secondly, we continue with examining the implementation of our advanced DELTA-based MIN architecture on Deep Neural Network (DNN) accelerators, while accounting for its potential regularity and scalability in simultaneously maintaining an effective power efficiency and lower latency throughout the DNN operating process. In this context, several metrics have been evaluated in regard to three DNN application cases through implementation of their main respective modules.
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference63 articles.
1. Abeyratn, N., Das, R., Li, Q., Sewell, K., Giridhar, B., Dreslinski, R.G., Blaauw, D., and Mudge, T. (2013, January 23–27). Scaling towards kilo-core processors with asymmetric high-radix topologies. Proceedings of the IEEE 19th International Symposium on High Performance Computer Architecture (HPCA), Shenzhen, China. 2. Bohnenstieh, B., Stillmaker, A., Pimntel, J., Andreas, T., Liu, B., Tran, A., Adeagbo, E., and Baas, B. (2016, January 15–17). A 5.8 pj/op 115 billion ops/sec, to 1.78 trillion ops/sec 32nm 1000-processor array. Proceedings of the IEEE Symposium on VLSI Circuits (VLSI-Circuits), Honolulu, HI, USA. 3. Kel, J.H., Johnsonn, M.R., Lumtta, S.S., and Patel, S.J. (2010, January 11–15). WayPoint: Scaling coherence to 1000-core architectures. Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques (PACT), Vienna, Austria. 4. Abada, S., Mestres, A., Martinez, R., Alarcon, E., Cabellos-Aparicio, A., and Martinez, R. (2015, January 4–6). Multicast on-chip traffic analysis targeting manycore NoC design. Proceedings of the 23rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, Turku, Finland. 5. WiSync: An architecture for fast synchronization through on-chip wireless communication;Abada;ACM Sigplan Not.,2016
|
|