Affiliation:
1. University of North Carolina at Chapel Hill, Chapel Hill, NC
2. Tong Ji University, Shanghai, P.R. China
Abstract
Large and high-dimensional real-world datasets are being gathered across a wide range of application disciplines to enable data-driven decision making. Interactive data visualization can play a critical role in allowing domain experts to select and analyze data from these large collections. However, there is a critical mismatch between the very large number of dimensions in complex real-world datasets and the much smaller number of dimensions that can be concurrently visualized using modern techniques. This gap in dimensionality can result in high levels of selection bias that go unnoticed by users. The bias can in turn threaten the very validity of any subsequent insights. This article describes Adaptive Contextualization (AC), a novel approach to interactive visual data selection that is specifically designed to combat the invisible introduction of selection bias. The AC approach (1) monitors and models a user’s visual data selection activity, (2) computes metrics over that model to quantify the amount of selection bias after each step, (3) visualizes the metric results, and (4) provides interactive tools that help users assess and avoid bias-related problems. This article expands on an earlier article presented at ACM IUI 2016 [16] by providing a more detailed review of the AC methodology and additional evaluation results.
Funder
National Science Foundation
Data Fellow award from the National Consortium for Data Science
Publisher
Association for Computing Machinery (ACM)
Subject
Artificial Intelligence,Human-Computer Interaction
Reference43 articles.
1. Rapid-Learning System for Cancer Care
2. The Revised CONSORT Statement for Reporting Randomized Trials: Explanation and Elaboration
3. M. Ankerst S. Berchtold and D. A. Keim. 1998. Similarity clustering of dimensions for an enhanced visualization of multidimensional data. 52--60. M. Ankerst S. Berchtold and D. A. Keim. 1998. Similarity clustering of dimensions for an enhanced visualization of multidimensional data. 52--60.
4. L. Bavoil S. P. Callahan P. J. Crossno J. Freire C. E. Scheidegger C. T. Silva and H. T. Vo. 2005. VisTrails: Enabling interactive multiple-view visualizations. In IEEE Visualization. 135--142. L. Bavoil S. P. Callahan P. J. Crossno J. Freire C. E. Scheidegger C. T. Silva and H. T. Vo. 2005. VisTrails: Enabling interactive multiple-view visualizations. In IEEE Visualization. 135--142.
5. Improving the Quality of Reporting of Randomized Controlled Trials
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献