A Taxonomy of Property Measures to Unify Active Learning and Human-centered Approaches to Data Labeling-Reference-Cited by-同舟云学术

A Taxonomy of Property Measures to Unify Active Learning and Human-centered Approaches to Data Labeling

Published:2021-12-31 Issue:3-4 Volume:11 Page:1-42
ISSN:2160-6455
Container-title:ACM Transactions on Interactive Intelligent Systems
language:en
Short-container-title:ACM Trans. Interact. Intell. Syst.

Author:

Bernard Jürgen¹,Hutter Marco²,Sedlmair Michael³,Zeppelzauer Matthias⁴,Munzner Tamara¹

Affiliation:

1. University of British Columbia, Vancouver, Canada

2. TU Darmstadt, Darmstadt, Germany

3. University of Stuttgart, Stuttgart, Germany

4. St. Pölten University of Applied Sciences, St. Pölten, Austria

Abstract

Strategies for selecting the next data instance to label, in service of generating labeled data for machine learning, have been considered separately in the machine learning literature on active learning and in the visual analytics literature on human-centered approaches. We propose a unified design space for instance selection strategies to support detailed and fine-grained analysis covering both of these perspectives. We identify a concise set of 15 properties, namely measureable characteristics of datasets or of machine learning models applied to them, that cover most of the strategies in these literatures. To quantify these properties, we introduce Property Measures (PM) as fine-grained building blocks that can be used to formalize instance selection strategies. In addition, we present a taxonomy of PMs to support the description, evaluation, and generation of PMs across four dimensions: machine learning (ML) Model Output , Instance Relations , Measure Functionality , and Measure Valence . We also create computational infrastructure to support qualitative visual data analysis: a visual analytics explainer for PMs built around an implementation of PMs using cascades of eight atomic functions. It supports eight analysis tasks, covering the analysis of datasets and ML models using visual comparison within and between PMs and groups of PMs, and over time during the interactive labeling process. We iteratively refined the PM taxonomy, the explainer, and the task abstraction in parallel with each other during a two-year formative process, and show evidence of their utility through a summative evaluation with the same infrastructure. This research builds a formal baseline for the better understanding of the commonalities and differences of instance selection strategies, which can serve as the stepping stone for the synthesis of novel strategies in future work.

Funder

German Academic Exchange Service

Austrian Research Promotion Agency

Lower Austrian Research and Education Company

National Sciences and Engineering Research Council of Canada

Deutsche Forschungsgemeinschaft

Publisher

Association for Computing Machinery (ACM)

Subject

Artificial Intelligence,Human-Computer Interaction

Link

https://dl.acm.org/doi/pdf/10.1145/3439333

Reference144 articles.

1. ClustMe : A Visual Quality Measure for Ranking Monochrome Scatterplots based on Cluster Patterns

2. Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)

3. Task-Driven Comparison of Topic Models

4. Visual Methods for Analyzing Probabilistic Classification Data

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Unpacking Human-AI interactions: From Interaction Primitives to a Design Space;ACM Transactions on Interactive Intelligent Systems;2024-08-02

2. Towards an understanding and explanation for mixed-initiative artificial scientific text detection;Information Visualization;2024-04-07

3. Understanding Novice's Annotation Process For 3D Semantic Segmentation Task With Human-In-The-Loop;Proceedings of the 29th International Conference on Intelligent User Interfaces;2024-03-18

4. VideoPro: A Visual Analytics Approach for Interactive Video Programming;IEEE Transactions on Visualization and Computer Graphics;2024-01

5. Label Engineering Methods for ML Systems;Lecture Notes in Networks and Systems;2024