Context-Aware Automatic Sign Language Video Transcription in Psychiatric Interviews-Reference-Cited by-同舟云学术

Context-Aware Automatic Sign Language Video Transcription in Psychiatric Interviews

Published:2022-03-30 Issue:7 Volume:22 Page:2656
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Pikoulis Erion-Vasilis^ORCID,Bifis Aristeidis^ORCID,Trigka Maria^ORCID,Constantinopoulos Constantinos,Kosmopoulos Dimitrios

Abstract

Sign language (SL) translation constitutes an extremely challenging task when undertaken in a general unconstrained setup, especially in the absence of vast training datasets that enable the use of end-to-end solutions employing deep architectures. In such cases, the ability to incorporate prior information can yield a significant improvement in the translation results by greatly restricting the search space of the potential solutions. In this work, we treat the translation problem in the limited confines of psychiatric interviews involving doctor-patient diagnostic sessions for deaf and hard of hearing patients with mental health problems.To overcome the lack of extensive training data and be able to improve the obtained translation performance, we follow a domain-specific approach combining data-driven feature extraction with the incorporation of prior information drawn from the available domain knowledge. This knowledge enables us to model the context of the interviews by using an appropriately defined hierarchical ontology for the contained dialogue, allowing for the classification of the current state of the interview, based on the doctor’s question. Utilizing this information, video transcription is treated as a sentence retrieval problem. The goal is predicting the patient’s sentence that has been signed in the SL video based on the available pool of possible responses, given the context of the current exchange. Our experimental evaluation using simulated scenarios of psychiatric interviews demonstrate the significant gains of incorporating context awareness in the system’s decisions.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/22/7/2656/pdf

Reference52 articles.

1. World Federation of the Deafhttps://wfdeaf.org/our-work/

2. Interpreted writing center tutorials with college-level deaf students

3. Phonological and prosodic layering of nonmanuals in American Sign Language;Wilbur,2013

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Hybrid InceptionNet Based Enhanced Architecture for Isolated Sign Language Recognition;IEEE Access;2024

2. An Evaluation of Portuguese to Libras Translator Apps Applied to the Medical Context;Lecture Notes in Computer Science;2024

3. Procrustes-DTW: Dynamic Time Warping Variant for the Recognition of Sign Language Utterances;2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW);2023-06-04

4. Dataset Transformation System for Sign Language Recognition Based on Image Classification Network;Applied Sciences;2022-10-07