Learning Streamed Attention Network from Descriptor Images for Cross-Resolution 3D Face Recognition-Reference-Cited by-同舟云学术

Learning Streamed Attention Network from Descriptor Images for Cross-Resolution 3D Face Recognition

Published:2023-01-23 Issue:1s Volume:19 Page:1-20
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

Neto João Baptista Cardia¹,Ferrari Claudio²,Marana Aparecido Nilceu³,Berretti Stefano⁴,Del Bimbo Alberto⁴

Affiliation:

1. São Paulo State Technological College (FATEC), São Paulo, Brazil

2. Department of Architecture and Engineering, University of Parma and University of Florence, Firenze FI, Italy

3. RECOGNA Laboratory, São Paulo State University (UNESP), São Paulo, Brazil

4. MICC, University of Florence, Firenze FI, Italy

Abstract

In this article, we propose a hybrid framework for cross-resolution 3D face recognition which utilizes a Streamed Attention Network (SAN) that combines handcrafted features with Convolutional Neural Networks (CNNs). It consists of two main stages: first, we process the depth images to extract low-level surface descriptors and derive the corresponding Descriptor Images (DIs), represented as four-channel images. To build the DIs, we propose a variation of the 3D Local Binary Pattern (3DLBP) operator that encodes depth differences using a sigmoid function. Then, we design a CNN that learns from these DIs. The peculiarity of our solution consists in processing each channel of the input image separately, and fusing the contribution of each channel by means of both self- and cross-attention mechanisms. This strategy showed two main advantages over the direct application of Deep-CNN to depth images of the face; on the one hand, the DIs can reduce the diversity between high- and low-resolution data by encoding surface properties that are robust to resolution differences. On the other, it allows a better exploitation of the richer information provided by low-level features, resulting in improved recognition. We evaluated the proposed architecture in a challenging cross-dataset, cross-resolution scenario. To this aim, we first train the network on scanner-resolution 3D data. Next, we utilize the pre-trained network as feature extractor on low-resolution data, where the output of the last fully connected layer is used as face descriptor. Other than standard benchmarks, we also perform experiments on a newly collected dataset of paired high- and low-resolution 3D faces. We use the high-resolution data as gallery, while low-resolution faces are used as probe, allowing us to assess the real gap existing between these two types of data. Extensive experiments on low-resolution 3D face benchmarks show promising results with respect to state-of-the-art methods.

Funder

CAPES - Brazil

Petrobras/Fundunesp

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Link

https://dl.acm.org/doi/pdf/10.1145/3527158

Reference48 articles.

1. Efficient object tracking using hierarchical convolutional features model and correlation filters;Abbass Mohammed Y.;The Visual Computer,2021

2. Open-set single-sample face recognition in video surveillance using fuzzy ARTMAP

3. 3D Face Recognition Using Isogeodesic Stripes

4. A method for registration of 3-D shapes

5. Reconstructing High-Resolution Face Models From Kinect Depth Sequences

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enhancing 2D-3D facial recognition accuracy of truncated-hiden faces using fused multi-model biometric deep features;Multimedia Tools and Applications;2024-05-09

2. 3D face recognition using image decomposition and POEM descriptor;Signal, Image and Video Processing;2024-04-17

3. Detection of Adversarial Facial Accessory Presentation Attacks Using Local Face Differential;ACM Transactions on Multimedia Computing, Communications, and Applications;2024-03-27

4. Human Detection and Tracking Based on YOLOv3 and DeepSORT;Communication and Intelligent Systems;2023

5. Biometric Recognition Systems: A Short Survey;Lecture Notes in Networks and Systems;2023