Parallelizing Comprehensive Learning Particle Swarm Optimization by Open Computing Language on an Integrated Graphical Processing Unit-Reference-Cited by-同舟云学术

Parallelizing Comprehensive Learning Particle Swarm Optimization by Open Computing Language on an Integrated Graphical Processing Unit

Published:2020-07-31 Issue: Volume:2020 Page:1-17
ISSN:1076-2787
Container-title:Complexity
language:en
Short-container-title:Complexity

Author:

Yu Xiang¹^ORCID,Qiao Yu²,Li Qingpeng³,Xu Gang⁴,Kang Chuanxiong¹,Estevez Claudio⁵,Deng Chengzhi¹^ORCID,Wang Shengqian¹

Affiliation:

1. Provincial Key Laboratory for Water Information Cooperative Sensing and Intelligent Processing, Nanchang Institute of Technology, Nanchang, Jiangxi 330099, China

2. School of Mathematics and Information Science, Shaanxi Normal University, Xi’an, Shaanxi 710119, China

3. State Grid Nanchang Electric Power Supply Company, Nanchang, Jiangxi 330069, China

4. Department of Mathematics, Nanchang University, Nanchang, Jiangxi 330031, China

5. Department of Electrical Engineering, University of Chile, Santiago 8370451, Chile

Abstract

Comprehensive learning particle swarm optimization (CLPSO) is a powerful metaheuristic for global optimization. This paper studies parallelizing CLPSO by open computing language (OpenCL) on the integrated Intel HD Graphics 520 (IHDG520) graphical processing unit (GPU) with a low clock rate. We implement a coarse-grained all-GPU model that maps each particle to a separate work item. Two enhancement strategies, namely, generating and transferring random numbers from the central processor to the GPU as well as reducing the number of instructions in the kernel, are proposed to shorten the model’s execution time. This paper further investigates parallelizing deterministic optimization for implicit stochastic optimization of China’s Xiaowan Reservoir. The deterministic optimization is performed on an ensemble of 62 years’ historical inflow records with monthly time steps, is solved by CLPSO, and is parallelized by a coarse-grained multipopulation model extended from the all-GPU model. The multipopulation model involves a large number of work items. Because of the capacity limit for a buffer transferring data from the central processor to the GPU and the size of the global memory region, the random number generation strategy is modified by generating a small number of random numbers that can be flexibly exploited by the large number of work items. Experiments conducted on various benchmark functions and the case study demonstrate that our proposed all-GPU and multipopulation parallelization models are appropriate; and the multipopulation model achieves the consumption of significantly less execution time than the corresponding sequential model.

Funder

National Natural Science Foundation of China

Publisher

Hindawi Limited

Subject

Multidisciplinary,General Computer Science

Link

http://downloads.hindawi.com/journals/complexity/2020/6589658.pdf

Reference60 articles.

1. Online Parameter Optimization-Based Prediction for Converter Gas System by Parallel Strategies

2. Bio-inspired optimization for electromagnetic structure design using full-wave techniques on GPUs

3. Parallel hybrid PSO with CUDA for lD heat conduction equation

4. A New Cooperative Evolutionary Multi-Swarm Optimizer Algorithm Based on CUDA Architecture Applied to Engineering Optimization

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Traffic signal control using a cooperative EWMA-based multi-agent reinforcement learning;Applied Intelligence;2022-06-11