Can Computers Outperform Humans in Detecting User Zone-Outs? Implications for Intelligent Interfaces-Reference-Cited by-同舟云学术

Can Computers Outperform Humans in Detecting User Zone-Outs? Implications for Intelligent Interfaces

Published:2022-01-16 Issue:2 Volume:29 Page:1-33
ISSN:1073-0516
Container-title:ACM Transactions on Computer-Human Interaction
language:en
Short-container-title:ACM Trans. Comput.-Hum. Interact.

Author:

Bosch Nigel¹^ORCID,D'Mello Sidney K.²

Affiliation:

1. School of Information Sciences and Department of Educational Psychology, University of Illinois at Urbana-Champaign, Champaign, IL

2. Department of Computer Science and Institute of Cognitive Science, University of Colorado Boulder, Boulder, CO

Abstract

The ability to identify whether a user is “zoning out” (mind wandering) from video has many HCI (e.g., distance learning, high-stakes vigilance tasks). However, it remains unknown how well humans can perform this task, how they compare to automatic computerized approaches, and how a fusion of the two might improve accuracy. We analyzed videos of users’ faces and upper bodies recorded 10s prior to self-reported mind wandering (i.e., ground truth) while they engaged in a computerized reading task. We found that a state-of-the-art machine learning model had comparable accuracy to aggregated judgments of nine untrained human observers (area under receiver operating characteristic curve [AUC] = .598 versus .589). A fusion of the two (AUC = .644) outperformed each, presumably because each focused on complementary cues. Furthermore, adding more humans beyond 3–4 observers yielded diminishing returns. We discuss implications of human–computer fusion as a means to improve accuracy in complex tasks.

Funder

National Science Foundation

Publisher

Association for Computing Machinery (ACM)

Subject

Human-Computer Interaction

Link

https://dl.acm.org/doi/pdf/10.1145/3481889

Reference100 articles.

1. Inspired by Distraction

2. Better to be frustrated than bored: The incidence, persistence, and impact of learners’ cognitive–affective states during interactions with three different computer-based learning environments

3. OpenFace 2.0: Facial Behavior Analysis Toolkit