Deep Learning at Scale and at Ease

Author:

Wang Wei1ORCID,Chen Gang2,Chen Haibo3,Dinh Tien Tuan Anh1,Gao Jinyang1,Ooi Beng Chin1,Tan Kian-Lee1,Wang Sheng1,Zhang Meihui4

Affiliation:

1. National University of Singapore

2. Zhejiang University, Hangzhou, PR China

3. NetEase Inc., Zhejiang, China

4. Singapore University of Technology and Design

Abstract

Recently, deep learning techniques have enjoyed success in various multimedia applications, such as image classification and multimodal data analysis. Large deep learning models are developed for learning rich representations of complex data. There are two challenges to overcome before deep learning can be widely adopted in multimedia and other applications. One is usability, namely the implementation of different models and training algorithms must be done by nonexperts without much effort, especially when the model is large and complex. The other is scalability, namely the deep learning system must be able to provision for a huge demand of computing resources for training large models with massive datasets. To address these two challenges, in this article we design a distributed deep learning platform called SINGA , which has an intuitive programming model based on the common layer abstraction of deep learning models. Good scalability is achieved through flexible distributed training architecture and specific optimization techniques. SINGA runs on both GPUs and CPUs, and we show that it outperforms many other state-of-the-art deep learning systems. Our experience with developing and training deep learning models for real-life multimedia applications in SINGA shows that the platform is both usable and scalable.

Funder

A*STAR

National Natural Science Foundation of China

National Research Foundation, Prime Minister's Office, Singapore under its Competitive Research Programme

National Research Foundation, Energy Innovation Programme Office, Singapore under Energy Innovation Research Programme

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Reference45 articles.

1. Tianqi Chen Bing Xu Chiyuan Zhang and Carlos Guestrin. 2016b. Training deep nets with sublinear memory cost. arXiv:1604.06174. http://arxiv.org/abs/1604.06174 Tianqi Chen Bing Xu Chiyuan Zhang and Carlos Guestrin. 2016b. Training deep nets with sublinear memory cost. arXiv:1604.06174. http://arxiv.org/abs/1604.06174

2. Tianqi Chen Mu Li Yutian Li Min Lin Naiyan Wang Minjie Wang Tianjun Xiao Bing Xu Chiyuan Zhang and Zheng Zhang. 2015. MXNet: A flexible and efficient machine learning library for heterogeneous distributed systems. arXiv:1512.01274. Tianqi Chen Mu Li Yutian Li Min Lin Naiyan Wang Minjie Wang Tianjun Xiao Bing Xu Chiyuan Zhang and Zheng Zhang. 2015. MXNet: A flexible and efficient machine learning library for heterogeneous distributed systems. arXiv:1512.01274.

3. Jianmin Chen Rajat Monga Samy Bengio and Rafal Józefowicz. 2016a. Revisiting distributed synchronous SGD. arXiv:1604.00981. http://arxiv.org/abs/1604.00981 Jianmin Chen Rajat Monga Samy Bengio and Rafal Józefowicz. 2016a. Revisiting distributed synchronous SGD. arXiv:1604.00981. http://arxiv.org/abs/1604.00981

Cited by 19 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Resource Efficient Bayesian Optimization;2024 IEEE 17th International Conference on Cloud Computing (CLOUD);2024-07-07

2. Low-complexity CNN-based CU partitioning for intra frames;Journal of Real-Time Image Processing;2023-06-14

3. Scalability of knowledge distillation in incremental deep learning for fast object detection;Applied Soft Computing;2022-11

4. DeepJoint Segmentation-based Lung Segmentation and Hybrid Optimization-Enabled Deep Learning for Lung Nodule Classification;International Journal of Pattern Recognition and Artificial Intelligence;2022-09-28

5. Deep graph convolutional network-based high-performance detection method for spectral domain gesture image stream;Journal of Electronic Imaging;2022-09-19

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3