Affiliation:
1. Nokia Bell Labs, 1083 Budapest, Hungary
Abstract
Deep learning models, particularly transformer architectures, have revolutionized various computer vision tasks, including image classification. However, their performance under different types and levels of noise remains a crucial area of investigation. In this study, we explore the noise sensitivity of prominent transformer models trained on the ImageNet dataset. We systematically evaluate 22 transformer variants, ranging from state-of-the-art large-scale models to compact versions tailored for mobile applications, under five common types of image distortions. Our findings reveal diverse sensitivities across different transformer architectures, with notable variations in performance observed under additive Gaussian noise, multiplicative Gaussian noise, Gaussian blur, salt-and-pepper noise, and JPEG compression. Interestingly, we observe a consistent robustness of transformer models to JPEG compression, with top-5 accuracies exhibiting higher resilience to noise compared to top-1 accuracies. Furthermore, our analysis highlights the vulnerability of mobile-oriented transformer variants to various noise types, underscoring the importance of noise robustness considerations in model design and deployment for real-world applications. These insights contribute to a deeper understanding of transformer model behavior under noisy conditions and have implications for improving the robustness and reliability of deep learning systems in practical scenarios.
Reference64 articles.
1. Jenadeleh, M., Pedersen, M., and Saupe, D. (2020). Blind quality assessment of iris images acquired in visible light for biometric recognition. Sensors, 20.
2. Subjective annotation for a frame interpolation benchmark using artefact amplification;Men;Qual. User Exp.,2020
3. Delepoulle, S., Bigand, A., and Renaud, C. (2012, January 6–8). A no-reference computer-generated images quality metric and its application to denoising. Proceedings of the 2012 6th IEEE International Conference Intelligent Systems, Sofia, Bulgaria.
4. Saupe, D., Hahn, F., Hosu, V., Zingman, I., Rana, M., and Li, S. (2016, January 6–8). Crowd workers proven useful: A comparative study of subjective video quality assessment. Proceedings of the QoMEX 2016: 8th International Conference on Quality of Multimedia Experience, Lisbon, Portugal.
5. Subjective image quality assessment with boosted triplet comparisons;Men;IEEE Access,2021