Video accessibility enhancement for hearing-impaired users

Author:

Hong Richang1,Wang Meng1,Yuan Xiao-Tong2,Xu Mengdi3,Jiang Jianguo4,Yan Shuicheng2,Chua Tat-Seng2

Affiliation:

1. Hefei University of Technology, Hefei China

2. National University of Singapore, Singapore

3. National University of Singapore

4. Hefei University of Technology

Abstract

There are more than 66 million people suffering from hearing impairment and this disability brings them difficulty in video content understanding due to the loss of audio information. If the scripts are available, captioning technology can help them in a certain degree by synchronously illustrating the scripts during the playing of videos. However, we show that the existing captioning techniques are far from satisfactory in assisting the hearing-impaired audience to enjoy videos. In this article, we introduce a scheme to enhance video accessibility using a Dynamic Captioning approach, which explores a rich set of technologies including face detection and recognition, visual saliency analysis, text-speech alignment, etc. Different from the existing methods that are categorized as static captioning, dynamic captioning puts scripts at suitable positions to help the hearing-impaired audience better recognize the speaking characters. In addition, it progressively highlights the scripts word-by-word via aligning them with the speech signal and illustrates the variation of voice volume. In this way, the special audience can better track the scripts and perceive the moods that are conveyed by the variation of volume. We implemented the technology on 20 video clips and conducted an in-depth study with 60 real hearing-impaired users. The results demonstrated the effectiveness and usefulness of the video accessibility enhancement scheme.

Funder

National Research Foundation-Prime Minister's office, Republic of Singapore

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Reference37 articles.

1. Captioned television for the deaf;Boyd J.;Am Ann Hear. Impaired,1972

Cited by 49 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Eye Gaze Analysis Towards an AI System for Dynamic Content Layout;Image Analysis and Processing - ICIAP 2023 Workshops;2024

2. "It's Not an Issue of Malice, but of Ignorance";Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies;2023-09-27

3. Accessibility Research in Digital Audiovisual Media: What Has Been Achieved and What Should Be Done Next?;Proceedings of the 2023 ACM International Conference on Interactive Media Experiences;2023-06-12

4. Understanding How Deaf and Hard of Hearing Viewers Visually Explore Captioned Live TV News;20th International Web for All Conference;2023-04-30

5. Who is speaking: Unpacking In-text Speaker Identification Preference of Viewers who are Deaf and Hard of Hearing while Watching Live Captioned Television Program;20th International Web for All Conference;2023-04-30

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3