Attention‐based hierarchical pyramid feature fusion structure for efficient face recognition-Reference-Cited by-同舟云学术

Attention‐based hierarchical pyramid feature fusion structure for efficient face recognition

Published:2023-04-11 Issue:8 Volume:17 Page:2399-2409
ISSN:1751-9659
Container-title:IET Image Processing
language:en
Short-container-title:IET Image Processing

Author:

Dai Yi¹,Sun Kai¹,Huang Wei¹^ORCID,Zhang Dawei²,Dai Gaojie²

Affiliation:

1. School of Electronic Information Engineering Inner Mongolia University Hohhot People's Republic of China

2. State Grid Yichuan County Electric Power Supply Branch Yichuan People's Republic of China

Abstract

AbstractDeep convolutional neural networks (CNN) have become the main method for face recognition (FR). To deploy deep CNN models on embedded and mobile devices, several lightweight FR models have been proposed. However, multi‐scale facial features are seldom considered in these approaches. To overcome this limitation, an attention‐based hierarchical pyramid feature fusion (AHPF) structure was proposed in this paper. Specifically, hierarchical multi‐scale features were directly extracted from the backbone based on its pyramidal hierarchy, and the bidirectional cross‐scale connection was used to better combine the high‐level global features with low‐level local features. In addition, instead of simple concatenation or summation, an attention‐based feature fusion mechanism was used to highlight the most recognizable facial patches, and to address the unequal contribution to the output during the fusing process. Based on the AHPF structure and efficient backbones, multiple sizes of lightweight FR models were presented, called HSFNet. After an extensive experimental evaluation involving 10 mainstream benchmarks, the proposed models consistently achieved state‐of‐the‐art FR performance compared to other lightweight FR models with same level of model complexity. With only 0.659M parameters and 94.94M FLOPs, our HSFNet‐05‐M exhibited a performance competitive with recent top‐ranked FR models containing up to 4M parameters and 500M FLOPs.

Funder

National Natural Science Foundation of China

Natural Science Foundation of Inner Mongolia

Publisher

Institution of Engineering and Technology (IET)

Subject

Electrical and Electronic Engineering,Computer Vision and Pattern Recognition,Signal Processing,Software

Reference47 articles.

1. He K. Zhang X. Ren S. Sun J.:Deep residual learning for image recognition. In:Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp.770–778. Las Vegas NV USA (2016)

2. Parkhi O.M. Vedaldi A. Zisserman A.:Deep face recognition. In:Proceedings of the British Machine Vision Conference pp.1–12. Swansea UK (2015)

3. Liu W. Wen Y. Yu Z. Li M. Raj B. Sphereface S.L.:Deep hypersphere embedding for face recognition. In:Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp.212–220. Honolulu HI USA (2017)

4. Deng J. Guo J. Xue N. Zafeiriou S.:Arcface: Additive angular margin loss for deep face recognition. In:Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp.4690–4699. Long Beach CA USA (2019)

5. Edge Intelligence: Empowering Intelligence to the Edge of Network

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The Effects of AI-Driven Face Restoration on Forensic Face Recognition;Applied Sciences;2024-04-29