Face-based age estimation using improved Swin Transformer with attention-based convolution

Author:

Shi Chaojun,Zhao Shiwei,Zhang Ke,Wang Yibo,Liang Longping

Abstract

Recently Transformer models is new direction in the computer vision field, which is based on self multihead attention mechanism. Compared with the convolutional neural network, this Transformer uses the self-attention mechanism to capture global contextual information and extract more strong features by learning the association relationship between different features, which has achieved good results in many vision tasks. In face-based age estimation, some facial patches that contain rich age-specific information are critical in the age estimation task. The present study proposed an attention-based convolution (ABC) age estimation framework, called improved Swin Transformer with ABC, in which two separate regions were implemented, namely ABC and Swin Transformer. ABC extracted facial patches containing rich age-specific information using a shallow convolutional network and a multiheaded attention mechanism. Subsequently, the features obtained by ABC were spliced with the flattened image in the Swin Transformer, which were then input to the Swin Transformer to predict the age of the image. The ABC framework spliced the important regions that contained rich age-specific information into the original image, which could fully mobilize the long-dependency of the Swin Transformer, that is, extracting stronger features by learning the dependency relationship between different features. ABC also introduced loss of diversity to guide the training of self-attention mechanism, reducing overlap between patches so that the diverse and important patches were discovered. Through extensive experiments, this study showed that the proposed framework outperformed several state-of-the-art methods on age estimation benchmark datasets.

Publisher

Frontiers Media SA

Subject

General Neuroscience

Reference77 articles.

1. Deep learning approach for facial age classification: A survey of the state-of-the-art.;Agbo-Ajala;Artif. Intell. Rev.,2021

2. Anchored regression networks applied to age estimation and super resolution;Agustsson;Proceedings of the IEEE international conference on computer vision,2017

3. Distribution cognisant loss for cross-database facial age estimation with sensitivity analysis.;Akbari;IEEE Trans. Pattern Anal. Mach. Intell.,2020

4. Age estimation via face images: A survey.;Angulu;EURASIP J. Image Video Process.,2018

5. Attention augmented convolutional networks;Bello;Proceedings of the IEEE/CVF international conference on computer vision Seoul, Korea (South).,2019

Cited by 6 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Swin-FER: Swin Transformer for Facial Expression Recognition;Applied Sciences;2024-07-14

2. Relative Age Position Learning for Face-Based Age Estimation;IEEE Access;2024

3. Age-Related Face Recognition Using Siamese Networks and Vision Transformers;Communications in Computer and Information Science;2024

4. Transformative Approach for Heart Rate Prediction from Face Videos Using Local and Global Multi-Head Self-Attention;Technologies;2023-12-22

5. Competitive-Driven Learning for Image Ordinal Classification;2023 3rd International Conference on Electronic Information Engineering and Computer Communication (EIECC);2023-12-22

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3