Visual Analytics for Explainable and Trustworthy Machine Learning

Author:

,Chatzimparmpas AngelosORCID,

Abstract

The deployment of artificial intelligence solutions and machine learning research has exploded in popularity in recent years, with numerous types of models proposed to interpret and predict patterns and trends in data from diverse disciplines. However, as the complexity of these models grows, it becomes increasingly difficult for users to evaluate and rely on the model results, since their inner workings are mostly hidden in black boxes, which are difficult to trust in critical decision-making scenarios. While automated methods can partly handle these problems, recent research findings suggest that their combination with innovative methods developed within information visualization and visual analytics can lead to further insights gained from models and, consequently, improve their predictive ability and enhance trustworthiness in the entire process. Visual analytics is the area of research that studies the analysis of vast and intricate information spaces by combining statistical and machine learning models with interactive visual interfaces. By following this methodology, human experts can better understand such spaces and apply their domain expertise in the process of building and improving the underlying models. The primary goals of this dissertation are twofold, focusing on (1) methodological aspects, by conducting qualitative and quantitative meta-analyses to support the visualization research community in making sense of its literature and to highlight unsolved challenges, as well as (2) technical solutions, by developing visual analytics approaches for various machine learning models, such as dimensionality reduction and ensemble learning methods. Regarding the first goal, we define, categorize, and examine in depth the means for visual coverage of the different trust levels at each stage of a typical machine learning pipeline and establish a design space for novel visualizations in the area. Regarding the second goal, we discuss multiple visual analytics tools and systems implemented by us to facilitate the underlying research on the various stages of the machine learning pipeline, i.e., data processing, feature engineering, hyperparameter tuning, understanding, debugging, refining, and comparing models. Our approaches are data-agnostic, but mainly target tabular data with meaningful attributes in diverse domains, such as health care and finance. The applicability and effectiveness of this work were validated with case studies, usage scenarios, expert interviews, user studies, and critical discussions of limitations and alternative designs. The results of this dissertation provide new avenues for visual analytics research in explainable and trustworthy machine learning.

Publisher

Linnaeus University

Reference744 articles.

1. [1]Mostafa M. Abbas, Michaël Aupetit, Michael Sedlmair, and Halima Bensmail. ClustMe: A visual quality measure for ranking monochrome scatterplots based on cluster patterns. Computer Graphics Forum, 38(3):225- 236, June 2019. doi:10.1111/cgf.13684.

2. [2] David Abramov, Jasmine Otto, Mahika Dubey, Cassia Artanegara, Pierre Boutillier, Walter Fontana, and Angus G. Forbes. RuleVis: Constructing patterns and rules for rule-based models. In Proceedings of the IEEE Visualization Conference, VIS '19, pages 191-195. IEEE, 2019. doi:10.1109/VISUAL.2019.8933596.

3. [3] Tameem Adel, Zoubin Ghahramani, and Adrian Weller. Discovering interpretable representations for both deep generative and discriminative models. In Proceedings of the 35th International Conference on Machine Learning, ICML '18, pages 50-59. PMLR, 2018. URL: http://proceedings.mlr.press/v80/adel18a.html.

4. [4] Charu C. Aggarwal. An introduction to outlier analysis. In Outlier Analysis, pages 1-34. Springer, 2017. doi:10.1007/978-3-319-47578-3_1.

5. [5] Zafar Ahmed, Patrick Yost, Amy McGovern, and Chris Weaver. Steerable clustering for visual analysis of ecosystems. In Proceedings of the EuroVis Workshop on Visual Analytics, EuroVA '11. The Eurographics Association, 2011. doi:10.2312/PE/EuroVAST/EuroVA11/049-052.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3