Having Difficulty Understanding Manuals? Automatically Converting User Manuals into Instructional Videos-Reference-Cited by-同舟云学术

Having Difficulty Understanding Manuals? Automatically Converting User Manuals into Instructional Videos

Published:2024-06-17 Issue:EICS Volume:8 Page:1-19
ISSN:2573-0142
Container-title:Proceedings of the ACM on Human-Computer Interaction
language:en
Short-container-title:Proc. ACM Hum.-Comput. Interact.

Author:

Liu Songsong¹^ORCID,Wang Shu¹^ORCID,Sun Kun¹^ORCID

Affiliation:

1. George Mason University, Fairfax, VA, USA

Abstract

While users tend to perceive instructional videos as an experience rather than a lesson with a set of instructions, instructional videos are more effective and appealing than textual user manuals and eliminate the ambiguity in text-based descriptions. However, most software vendors only offer document manuals that describe how to install and use their software, leading burden for non-professionals to comprehend the instructions. In this paper, we present a framework called M2V to generate instructional videos automatically based on the provided instructions and images in user manuals. M2V is a two-step framework. First, an action sequence is extracted from the given user manual via natural language processing and computer vision techniques. Second, M2V operates the software sequentially based on the extracted actions; meanwhile, the operation procedure is recorded into an instructional video. We evaluate the usability of automatically generated instructional videos via user studies and an online survey. The evaluation results show, with our toolkit, the generated instructional videos can better assist non-professional end users with the software operations. Moreover, more than 85% of survey participants prefer to use the instructional videos rather than the original user manuals.

Funder

ONR grant

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3660245

Reference44 articles.

1. Accessible skimming

2. C Al Blackwell. 1995. A good installation guide increases user satisfaction and reduces support costs. Technical communication 42, 1 (1995), 56--60.

3. Alexey Bochkovskiy, Chien-Yao Wang, and Hong-Yuan Mark Liao. 2020. Yolov4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020).

4. Reinforcement learning for mapping instructions to actions

5. GUI testing using computer vision