Auto-Selection of an Optimal Sparse Matrix Format in the Neuro-Simulator ANNarchy-Reference-Cited by-同舟云学术

Auto-Selection of an Optimal Sparse Matrix Format in the Neuro-Simulator ANNarchy

Published:2022-05-23 Issue: Volume:16 Page:
ISSN:1662-5196
Container-title:Frontiers in Neuroinformatics
language:
Short-container-title:Front. Neuroinform.

Author:

Dinkelbach Helge Ülo,Bouhlal Badr-Eddine,Vitay Julien,Hamker Fred H.

Abstract

Modern neuro-simulators provide efficient implementations of simulation kernels on various parallel hardware (multi-core CPUs, distributed CPUs, GPUs), thereby supporting the simulation of increasingly large and complex biologically realistic networks. However, the optimal configuration of the parallel hardware and computational kernels depends on the exact structure of the network to be simulated. For example, the computation time of rate-coded neural networks is generally limited by the available memory bandwidth, and consequently, the organization of the data in memory will strongly influence the performance for different connectivity matrices. We pinpoint the role of sparse matrix formats implemented in the neuro-simulator ANNarchy with respect to computation time. Rather than asking the user to identify the best data structures required for a given network and platform, such a decision could also be carried out by the neuro-simulator. However, it requires heuristics that need to be adapted over time for the available hardware. The present study investigates how machine learning methods can be used to identify appropriate implementations for a specific network. We employ an artificial neural network to develop a predictive model to help the developer select the optimal sparse matrix format. The model is first trained offline using a set of training examples on a particular hardware platform. The learned model can then predict the execution time of different matrix formats and decide on the best option for a specific network. Our experimental results show that using up to 3,000 examples of random network configurations (i.e., different population sizes as well as variable connectivity), our approach effectively selects the appropriate configuration, providing over 93% accuracy in predicting the suitable format on three different NVIDIA devices.

Funder

Deutsche Forschungsgemeinschaft

Publisher

Frontiers Media SA

Subject

Computer Science Applications,Biomedical Engineering,Neuroscience (miscellaneous)

Reference51 articles.

1. Tensorflow: a system for large-scale machine learning;Abadi,2016

2. Optuna: a next-generation hyperparameter optimization framework;Akiba,2019

3. Autotuning in high-performance computing applications;Balaprakash;Proc. IEEE,2018

4. Implementing sparse matrix-vector multiplication on throughput-oriented processors;Bell,2009

5. BestSF: a sparse meta-format for optimizing SpMV on GPU;Benatia;ACM Trans. Architect. Code Optim,2018

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Cerberus: Triple Mode Acceleration of Sparse Matrix and Vector Multiplication;ACM Transactions on Architecture and Code Optimization;2024-05-21

2. Automated tuning for the parameters of linear solvers;Journal of Computational Physics;2023-12