Affiliation:
1. MIT
2. University of Illinois (UIUC)
3. Google
Abstract
Data analysts often build visualizations as the first step in their analytical workflow. However, when working with high-dimensional datasets, identifying visualizations that show relevant or desired trends in data can be laborious. We propose S
ee
DB, a visualization recommendation engine to facilitate fast visual analysis: given a subset of data to be studied, S
ee
DB intelligently explores the space of visualizations, evaluates promising visualizations for trends, and recommends those it deems most "useful" or "interesting". The two major obstacles in recommending interesting visualizations are (a)
scale
: evaluating a large number of candidate visualizations while responding within interactive time scales, and (b)
utility
: identifying an appropriate metric for assessing interestingness of visualizations. For the former, S
ee
DB introduces
pruning optimizations
to quickly identify high-utility visualizations and
sharing optimizations
to maximize sharing of computation across visualizations. For the latter, as a first step, we adopt a deviation-based metric for visualization utility, while indicating how we may be able to generalize it to other factors influencing utility. We implement S
ee
DB as a middleware layer that can run on top of any DBMS. Our experiments show that our framework can identify interesting visualizations with high accuracy. Our optimizations lead to
multiple orders of magnitude speedup
on relational row and column stores and provide recommendations at interactive time scales. Finally, we demonstrate via a user study the effectiveness of our deviation-based utility metric and the value of recommendations in supporting visual analytics.
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Cited by
160 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献