Affiliation:
1. Hangzhou Dianzi University, Hangzhou, China
2. Ocean University of China, Qingdao, China
3. Harbin Institute of Technology, Shenzhen, China
4. Tsinghua Shenzhen International Graduate School, Shenzhen, China
5. Tsinghua University, Beijing, China
Abstract
The difficulty of no-reference image quality assessment (NR IQA) often lies in the lack of knowledge about the distortion in the image, which makes quality assessment blind and thus inefficient. To tackle such issue, in this article, we propose a novel scheme for precise NR IQA, which includes two successive steps, i.e., distortion identification and targeted quality evaluation. In the first step, we employ the well-known Inception-ResNet-v2 neural network to train a classifier that classifies the possible distortion in the image into the four most common distortion types, i.e., Gaussian white noise (WN), Gaussian blur (GB), jpeg compression (JPEG), and jpeg2000 compression (JP2K). Specifically, the deep neural network is trained on the large-scale Waterloo Exploration database, which ensures the robustness and high performance of distortion classification. In the second step, after determining the distortion type of the image, we then design a specific approach to quantify the image distortion level, which can estimate the image quality specially and more precisely. Extensive experiments performed on LIVE, TID2013, CSIQ, and Waterloo Exploration databases demonstrate that (1) the accuracy of our distortion classification is higher than that of the state-of-the-art distortion classification methods, and (2) the proposed NR IQA method outperforms the state-of-the-art NR IQA methods in quantifying the image quality.
Funder
National Key Research and Development Program of China
National Natural Science Foundation of China
China Postdoctoral Science Foundation
Zhejiang Province Natural Science Foundation of China
111 Project
Shenzhen Science and Technology Project
Guangdong Special Support
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture
Reference68 articles.
1. A Fast Uyghur Text Detector for Complex Background Images
2. Cross-modality bridging and knowledge transferring for image understanding;Yan Chenggang;IEEE Trans. Multimedia,2019
3. STAT: Spatial-temporal attention mechanism for video captioning;Yan Chenggang;IEEE Trans. Multimedia,2019
4. Measuring individual video QoE: A survey, and proposal for future directions using social media;Zhu Yi;ACM Trans. Multimedia Comput. Commun. Appl.,2018
Cited by
101 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献