Auxiliary Information Guided Self-attention for Image Quality Assessment-Reference-Cited by-同舟云学术

Auxiliary Information Guided Self-attention for Image Quality Assessment

Published:2024-01-11 Issue:4 Volume:20 Page:1-23
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

Yang Jifan¹^ORCID,Wang Zhongyuan¹^ORCID,Wang Guangcheng²^ORCID,Huang Baojin¹^ORCID,Yang Yuhong¹^ORCID,Tu Weiping¹^ORCID

Affiliation:

1. School of Computer, Wuhan University, China

2. School of Transportation and Civil Engineering, Nantong University, China

Abstract

Image quality assessment (IQA) is an important problem in computer vision with many applications. We propose a transformer-based multi-task learning framework for the IQA task. Two subtasks: constructing an auxiliary information error map and completing image quality prediction, are jointly optimized using a shared feature extractor. We use visual transformers (ViT) as a feature extractor for feature extraction and guide ViT to focus on image quality-related features by building auxiliary information error map subtask. In particular, we propose a fusion network that includes a channel focus module. Unlike the fusion methods commonly used in previous IQA methods, we use the fusion network, including the channel attention module, to fuse the auxiliary information error map features with the image features, which facilitates the model to mine the image quality features for more accurate image quality assessment. And by jointly optimizing the two subtasks, ViT focuses more on extracting image quality features and building a more precise mapping from feature representation to quality score. With slight adjustments to the model, our approach can be used in both no-reference (NR) and full-reference (FR) IQA environments. We evaluate the proposed method in multiple IQA databases, showing better performance than state-of-the-art FR and NR IQA methods.

Funder

National Natural Science Foundation of China

Guangdong-Macau Joint Laboratory for Advanced and Intelligent Computing

Guangdong High-Level Innovation Research Institute

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Link

https://dl.acm.org/doi/pdf/10.1145/3635716

Reference67 articles.

1. Deep Learning-based Distortion Sensitivity Prediction for Full-Reference Image Quality Assessment

2. Tunç O. Aydın, Rafal Mantiuk, and Hans-Peter Seidel. 2008. Extending quality metrics to full luminance range images. In Human Vision and Electronic Imaging XIII, Vol. 6806. International Society for Optics and Photonics, 109–118.

3. Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment

4. VSNR: A Wavelet-Based Visual Signal-to-Noise Ratio for Natural Images

5. Full-reference Screen Content Image Quality Assessment by Fusing Multilevel Structure Similarity