DLBench: a comprehensive experimental evaluation of deep learning frameworks-Reference-Cited by-同舟云学术

DLBench: a comprehensive experimental evaluation of deep learning frameworks

Published:2021-02-07 Issue:3 Volume:24 Page:2017-2038
ISSN:1386-7857
Container-title:Cluster Computing
language:en
Short-container-title:Cluster Comput

Author:

Elshawi Radwa,Wahab Abdul,Barnawi Ahmed,Sakr Sherif^ORCID

Abstract

AbstractDeep Learning (DL) has achieved remarkable progress over the last decade on various tasks such as image recognition, speech recognition, and natural language processing. In general, three main crucial aspects fueled this progress: the increasing availability of large amount of digitized data, the increasing availability of affordable parallel and powerful computing resources (e.g., GPU) and the growing number of open source deep learning frameworks that facilitate and ease the development process of deep learning architectures. In practice, the increasing popularity of deep learning frameworks calls for benchmarking studies that can effectively evaluate and understand the performance characteristics of these systems. In this paper, we conduct an extensive experimental evaluation and analysis of six popular deep learning frameworks, namely, TensorFlow, MXNet, PyTorch, Theano, Chainer, and Keras, using three types of DL architectures Convolutional Neural Networks (CNN), Faster Region-based Convolutional Neural Networks (Faster R-CNN), and Long Short Term Memory (LSTM). Our experimental evaluation considers different aspects for its comparison including accuracy, training time, convergence and resource consumption patterns. Our experiments have been conducted on both CPU and GPU environments using different datasets. We report and analyze the performance characteristics of the studied frameworks. In addition, we report a set of insights and important lessons that we have learned from conducting our experiments.

Publisher

Springer Science and Business Media LLC

Subject

Computer Networks and Communications,Software

Link

https://link.springer.com/content/pdf/10.1007/s10586-021-03240-4.pdf

Reference55 articles.

1. Abadi, M., et al.: Tensorflow: a system for large-scale machine learning. OSDI 16, 265–283 (2016)

2. Abugabah, A., AlZubi, A.A., Al-Obeidat, F.N., Alarifi, A., Alwadain, A.: Data mining techniques for analyzing healthcare conditions of urban space-person lung using meta-heuristic optimized neural networks. Clust. Comput. 23(3), 1781–1794 (2020)

3. Awan, A.A., Subramoni, H., Panda, D.K.: An in-depth performance characterization of CPU-and GPU-based DNN training on modern architectures. In: Proceedings of the Machine Learning on HPC Environments, p. 8. ACM, (2017)

4. Bahrampour, S., Ramakrishnan, N., Schott, L., Shah, M.: Comparative study of caffe, neon, theano, and torch for deep learning (2016)

5. Bastien, F., Lamblin, P., Pascanu, R., Bergstra, J., Goodfellow, I., Bergeron, A., Bouchard, N., Warde-Farley, D., Bengio, Y.: Theano: new features and speed improvements. arXiv preprint arXiv:1211.5590 (2012)

Cited by 38 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Recent advances in Machine Learning based Advanced Driver Assistance System applications;Microprocessors and Microsystems;2024-09

2. Uma metodologia para a avaliação de desempenho e custos do treinamento de redes neurais em ambientes de nuvem;Anais do XXIII Workshop em Desempenho de Sistemas Computacionais e de Comunicação (WPerformance 2024);2024-07-21

3. Studying the Impact of TensorFlow and PyTorch Bindings on Machine Learning Software Quality;ACM Transactions on Software Engineering and Methodology;2024-07-13

4. State-of-the-Art Machine Learning Frameworks for Training or Inference on Business Process Dataset;2024 47th MIPRO ICT and Electronics Convention (MIPRO);2024-05-20

5. Enhanced Small Drone Detection Using Optimized YOLOv8 With Attention Mechanisms;IEEE Access;2024