Abstract
AbstractHigh-dimensional data are a key study object for both machine learning (ML) and information visualization. On the visualization side, dimensionality reduction (DR) methods, also called projections, are the most suited techniques for visual exploration of large and high-dimensional datasets. On the ML side, high-dimensional data are generated and processed by classifiers and regressors, and these techniques increasingly require visualization for explanation and exploration. In this paper, we explore how both fields can help each other in achieving their respective aims. In more detail, we present both examples that show how DR can be used to understand and engineer better ML models (seeing helps learning) and also applications of DL for improving the computation of direct and inverse projections (learning helps seeing). We also identify existing limitations of DR methods used to assist ML and of ML techniques applied to improve DR. Based on the above, we propose several high-impact directions for future work that exploit the analyzed ML-DR synergy.
Publisher
Springer Science and Business Media LLC
Reference89 articles.
1. Munzner T. Visualization analysis and design: principles, techniques, and practice. Boca Raton: CRC Press; 2014.
2. Telea AC. Data visualization—principles and practice. 2nd ed. Abingdon: CRC Press/Taylor and Francis; 2014.
3. Liu S, Maljovec D, Wang B, Bremer P-T, Pascucci V. Visualizing high-dimensional data: advances in the past decade. IEEE TVCG. 2015;23(3):1249–68.
4. Yates A, Webb A, Sharpnack M, Chamberlin H, Huang K, Machiraju R. Visualizing multidimensional data with glyph SPLOMs. CGF. 2014;33(3):301–10.
5. Lehmann DJ, Albuquerque G, Eisemann M, Magnor M, Theisel H. Selecting coherent and relevant plots in large scatterplot matrices. Comput Graph Forum. 2012;31(6):1895–908.