A Highly Parallel and Scalable Motion Estimation Algorithm with GPU for HEVC-Reference-Cited by-同舟云学术

A Highly Parallel and Scalable Motion Estimation Algorithm with GPU for HEVC

Published:2017 Issue: Volume:2017 Page:1-15
ISSN:1058-9244
Container-title:Scientific Programming
language:en
Short-container-title:Scientific Programming

Author:

Xue Yun-gang¹^ORCID,Su Hua-you¹,Ren Ju¹,Wen Mei¹,Zhang Chun-yuan¹,Xiao Li-quan¹

Affiliation:

1. School of Computer, National University of Defense Technology, Changsha 410073, China

Abstract

We propose a highly parallel and scalable motion estimation algorithm, named multilevel resolution motion estimation (MLRME for short), by combining the advantages of local full search and downsampling. By subsampling a video frame, a large amount of computation is saved. While using the local full-search method, it can exploit massive parallelism and make full use of the powerful modern many-core accelerators, such as GPU and Intel Xeon Phi. We implanted the proposed MLRME into HM12.0, and the experimental results showed that the encoding quality of the MLRME method is close to that of the fast motion estimation in HEVC, which declines by less than 1.5%. We also implemented the MLRME with CUDA, which obtained 30–60x speed-up compared to the serial algorithm on single CPU. Specifically, the parallel implementation of MLRME on a GTX 460 GPU can meet the real-time coding requirement with about 25 fps for the 2560×1600 video format, while, for 832×480, the performance is more than 100 fps.

Funder

National Natural Science Foundation of China

Publisher

Hindawi Limited

Subject

Computer Science Applications,Software

Link

http://downloads.hindawi.com/journals/sp/2017/1431574.pdf

Reference12 articles.

1. A novel four-step search algorithm for fast block motion estimation

2. A new diamond search algorithm for fast block-matching motion estimation

3. A novel cross-diamond search algorithm for fast block motion estimation