Abstract
In the process of Canny edge detection, a large number of high complexity calculations such as Gaussian filtering, gradient calculation, non-maximum suppression, and double threshold judgment need to be performed on the image, which takes up a lot of operation time, which is a great challenge to the real-time requirements of the algorithm. The traditional Canny edge detection technology mainly uses customized equipment such as DSP and FPGA, but it has some problems, such as long development cycle, difficult debugging, resource consumption, and so on. At the same time, the adopted CUDA platform has the problem of poor cross-platform. In order to solve this problem, a fine-grained parallel Canny edge detection method is proposed, which is optimized from three aspects: task partition, vector memory access, and NDRange optimization, and CPU-GPU collaborative parallelism is realized. At the same time, the parallel Canny edge detection methods based on multi-core CPU and CUDA architecture are designed. The experimental results show that OpenCL accelerated Canny edge detection algorithm (OCL_Canny) achieves 20.68 times acceleration ratio compared with CPU serial algorithm at 7452 × 8024 image resolution. At the image resolution of 3500 × 3500, the OCL_Canny algorithm achieves 3.96 times the acceleration ratio compared with the CPU multi-threaded Canny parallel algorithm. At 1024 × 1024 image resolution, the OCL_Canny algorithm achieves 1.21 times the acceleration ratio compared with the CUDA-based Canny parallel algorithm. The effectiveness and performance portability of the proposed Canny edge detection parallel algorithm are verified, and it provides a reference for the research of fast calculation of image big data.
Funder
National Natural Science Foundation of China
Key Scientific Research Project of Colleges and Universities in Henan Province
Natural Science Foundation of Shandong Province
Publisher
Public Library of Science (PLoS)
Reference40 articles.
1. Adaptive edge detection technique implemented on FPGA;S Taslimi;Iranian Journal of Science and Technology-Transactions of Electrical Engineering,2020
2. Multi-GPGPU based medical image processing in hip replacement;A Morar;Control Eng Appl Inf,2012
3. Edge detection of satellite image using fuzzy logic;R Dhivya;Cluster Comput,2019
4. Multi-GPU design and performance evaluation of homomorphic encryption on GPU clusters;A Al Badawi;IEEE T Parall Distr,2021
5. Characterizing deep neural networks on edge computing systems for object classification in 3D point clouds;C Wisultschew;IEEE Sens J,2022
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献