Impact of face swapping and data augmentation on sign language recognition


Perea-Trigo Marina,López-Ortiz Enrique J.,Soria-Morillo Luis M.,Álvarez-García Juan A.,Vegas-Olmos J. J.


AbstractThis study addresses the challenge of improving communication between the deaf and hearing community by exploring different sign language recognition (SLR) techniques. Due to privacy issues and the need for validation by interpreters, creating large-scale sign language (SL) datasets can be difficult. The authors address this by presenting a new Spanish isolated sign language recognition dataset, CALSE-1000, consisting of 5000 videos representing 1000 glosses, with various signers and scenarios. The study also proposes using different computer vision techniques, such as face swapping and affine transformations, to augment the SL dataset and improve the accuracy of the model I3D trained using them. The results show that the inclusion of these augmentations during training leads to an improvement in accuracy in top-1 metrics by up to 11.7 points, top-5 by up to 8.8 points and top-10 by up to 9 points. This has great potential to improve the state of the art in other datasets and other models. Furthermore, the analysis confirms the importance of facial expressions in the model by testing with a facial omission dataset and shows how face swapping can be used to include new anonymous signers without the costly and time-consuming process of recording.


Springer Science and Business Media LLC

Reference64 articles.

1. Organization, W. H., et al.: World report on hearing, World Health Organization (2021)

2. Peery, M.L.: World federation of the deaf, Encyclopedia of Special Education: A Reference for the Education of Children, Adolescents, and Adults with Disabilities and Other Exceptional Individuals. Wiiley, Hoboken (2013)

3. del Estado, J.: Ley 27/2007, de 23 de octubre, por la que se reconocen las lenguas de signos españolas y se regulan los medios de apoyo a la comunicación oral de las personas sordas, con discapacidad auditiva y sordociegas. Boletín Oficial del Estado. 255(24), 10 (2007)

4. Baker, A., van den Bogaerde, B., Pfau, R., Schermer, T.: The Llinguistics of Sign Languages: An Introduction. John Benjamins Publishing Company, Amsterdam (2016)

5. Rastgoo, R., Kiani, K., Escalera, S.: Sign language recognition: a deep survey. Expert Syst. Appl. 164, 113794 (2020)







Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3