Auto-Selection of an Optimal Sparse Matrix Format in the Neuro-Simulator ANNarchy

Author:

Dinkelbach Helge Ülo,Bouhlal Badr-Eddine,Vitay Julien,Hamker Fred H.

Abstract

Modern neuro-simulators provide efficient implementations of simulation kernels on various parallel hardware (multi-core CPUs, distributed CPUs, GPUs), thereby supporting the simulation of increasingly large and complex biologically realistic networks. However, the optimal configuration of the parallel hardware and computational kernels depends on the exact structure of the network to be simulated. For example, the computation time of rate-coded neural networks is generally limited by the available memory bandwidth, and consequently, the organization of the data in memory will strongly influence the performance for different connectivity matrices. We pinpoint the role of sparse matrix formats implemented in the neuro-simulator ANNarchy with respect to computation time. Rather than asking the user to identify the best data structures required for a given network and platform, such a decision could also be carried out by the neuro-simulator. However, it requires heuristics that need to be adapted over time for the available hardware. The present study investigates how machine learning methods can be used to identify appropriate implementations for a specific network. We employ an artificial neural network to develop a predictive model to help the developer select the optimal sparse matrix format. The model is first trained offline using a set of training examples on a particular hardware platform. The learned model can then predict the execution time of different matrix formats and decide on the best option for a specific network. Our experimental results show that using up to 3,000 examples of random network configurations (i.e., different population sizes as well as variable connectivity), our approach effectively selects the appropriate configuration, providing over 93% accuracy in predicting the suitable format on three different NVIDIA devices.

Funder

Deutsche Forschungsgemeinschaft

Publisher

Frontiers Media SA

Subject

Computer Science Applications,Biomedical Engineering,Neuroscience (miscellaneous)

Reference51 articles.

1. Tensorflow: a system for large-scale machine learning;Abadi,2016

2. Optuna: a next-generation hyperparameter optimization framework;Akiba,2019

3. Autotuning in high-performance computing applications;Balaprakash;Proc. IEEE,2018

4. Implementing sparse matrix-vector multiplication on throughput-oriented processors;Bell,2009

5. BestSF: a sparse meta-format for optimizing SpMV on GPU;Benatia;ACM Trans. Architect. Code Optim,2018

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Cerberus: Triple Mode Acceleration of Sparse Matrix and Vector Multiplication;ACM Transactions on Architecture and Code Optimization;2024-05-21

2. Automated tuning for the parameters of linear solvers;Journal of Computational Physics;2023-12

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3