Distributed Training and Inference of Deep Learning Models for Multi-Modal Land Cover Classification-Reference-Cited by-同舟云学术

Distributed Training and Inference of Deep Learning Models for Multi-Modal Land Cover Classification

Published:2020-08-19 Issue:17 Volume:12 Page:2670
ISSN:2072-4292
Container-title:Remote Sensing
language:en
Short-container-title:Remote Sensing

Author:

Aspri Maria,Tsagkatakis Grigorios^ORCID,Tsakalides Panagiotis^ORCID

Abstract

Deep Neural Networks (DNNs) have established themselves as a fundamental tool in numerous computational modeling applications, overcoming the challenge of defining use-case-specific feature extraction processing by incorporating this stage into unified end-to-end trainable models. Despite their capabilities in modeling, training large-scale DNN models is a very computation-intensive task that most single machines are often incapable of accomplishing. To address this issue, different parallelization schemes were proposed. Nevertheless, network overheads as well as optimal resource allocation pose as major challenges, since network communication is generally slower than intra-machine communication while some layers are more computationally expensive than others. In this work, we consider a novel multimodal DNN based on the Convolutional Neural Network architecture and explore several different ways to optimize its performance when training is executed on an Apache Spark Cluster. We evaluate the performance of different architectures via the metrics of network traffic and processing power, considering the case of land cover classification from remote sensing observations. Furthermore, we compare our architectures with an identical DNN architecture modeled after a data parallelization approach by using the metrics of classification accuracy and inference execution time. The experiments show that the way a model is parallelized has tremendous effect on resource allocation and hyperparameter tuning can reduce network overheads. Experimental results also demonstrate that proposed model parallelization schemes achieve more efficient resource use and more accurate predictions compared to data parallelization approaches.

Publisher

MDPI AG

Subject

General Earth and Planetary Sciences

Link

https://www.mdpi.com/2072-4292/12/17/2670/pdf

Reference52 articles.

1. Big Earth Data: a new challenge and opportunity for Digital Earth’s development

2. Deep learning

3. A patch-based convolutional neural network for remote sensing image classification

4. Survey of Deep-Learning Approaches for Remote Sensing Observation Enhancement

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Development trends and countermeasures of China’s cloud artificial intelligence chip industry;2023 IEEE 14th International Symposium on Parallel Architectures, Algorithms and Programming (PAAP);2023-11-24

2. Distributed artificial intelligence: Taxonomy, review, framework, and reference architecture;Intelligent Systems with Applications;2023-05

3. Distributed Training of Large-Scale Deep Learning Models in Commodity Hardware;Inventive Systems and Control;2023

4. State of the Art: High-Performance and High-Throughput Computing for Remote Sensing Big Data;IEEE Geoscience and Remote Sensing Magazine;2022-12

5. An efficient algorithm for data parallelism based on stochastic optimization;Alexandria Engineering Journal;2022-12