Affiliation:
1. The University of Hong Kong
2. University College London
Abstract
We propose a new method for improving the presentation of subtitles in video (e.g., TV and movies). With conventional subtitles, the viewer has to constantly look away from the main viewing area to read the subtitles at the bottom of the screen, which disrupts the viewing experience and causes unnecessary eyestrain. Our method places on-screen subtitles next to the respective speakers to allow the viewer to follow the visual content while simultaneously reading the subtitles. We use novel identification algorithms to detect the speakers based on audio and visual information. Then the placement of the subtitles is determined using global optimization. A comprehensive usability study indicated that our subtitle placement method outperformed both conventional fixed-position subtitling and another previous dynamic subtitling method in terms of enhancing the overall viewing experience and reducing eyestrain.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture
Cited by
34 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. “Caption It in an Accessible Way That Is Also Enjoyable”: Characterizing User-Driven Captioning Practices on TikTok;Proceedings of the CHI Conference on Human Factors in Computing Systems;2024-05-11
2. Unspoken Sound: Identifying Trends in Non-Speech Audio Captioning on YouTube;Proceedings of the CHI Conference on Human Factors in Computing Systems;2024-05-11
3. Eye Gaze Analysis Towards an AI System for Dynamic Content Layout;Lecture Notes in Computer Science;2024
4. Automated Conversion of Music Videos into Lyric Videos;Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology;2023-10-29
5. Focus on the Motion: Designing Adaptive Subtitles for Online Fitness Videos to Support Ubiquitous Exercises;2023 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct);2023-10-16