Semi-automation of gesture annotation by machine learning and human collaboration-Reference-Cited by-同舟云学术

Semi-automation of gesture annotation by machine learning and human collaboration

Published:2022-02-25 Issue:3 Volume:56 Page:673-700
ISSN:1574-020X
Container-title:Language Resources and Evaluation
language:en
Short-container-title:Lang Resources & Evaluation

Author:

Ienaga Naoto^ORCID,Cravotta Alice,Terayama Kei^ORCID,Scotney Bryan W.^ORCID,Saito Hideo^ORCID,Busà M. Grazia^ORCID

Abstract

AbstractGesture and multimodal communication researchers typically annotate video data manually, even though this can be a very time-consuming task. In the present work, a method to detect gestures is proposed as a fundamental step towards a semi-automatic gesture annotation tool. The proposed method can be applied to RGB videos and requires annotations of part of a video as input. The technique deploys a pose estimation method and active learning. In the experiment, it is shown that if about 27% of the video is annotated, the remaining parts of the video can be annotated automatically with an F-score of at least 0.85. Users can run this tool with a small number of annotations first. If the predicted annotations for the remainder of the video are not satisfactory, users can add further annotations and run the tool again. The code has been released so that other researchers and practitioners can use the results of this research. This tool has been confirmed to work in conjunction with ELAN.

Funder

Japan Society for the Promotion of Science

Publisher

Springer Science and Business Media LLC

Subject

Library and Information Sciences,Linguistics and Language,Education,Language and Linguistics

Link

https://link.springer.com/content/pdf/10.1007/s10579-022-09586-4.pdf

Reference75 articles.

1. Bressem, J., & Müller, C. (2014). The family of away gestures: Negation, refusal, and negative assessment. Body–language–communication: An International Handbook on Multimodality in Human Interaction, 2, 1592–1604. https://doi.org/10.1515/9783110302028.1592

2. Calbris, G. (2003). From cutting an object to a clear cut analysis: Gesture as the representation of a preconceptual schema linking concrete actions to abstract notions. Gesture, 3(1), 19–46. https://doi.org/10.1075/gest.3.1.03cal

3. Camgoz, N. C., Hadfield, S., Koller, O., & Bowden, R. (2016). Using convolutional 3d neural networks for user-independent continuous gesture recognition. In 2016 23rd international conference on pattern recognition, pp. 49–54. https://doi.org/10.1109/ICPR.2016.7899606