Toward interpretable and actionable data analysis with explanations and causality-Reference-Cited by-同舟云学术

Toward interpretable and actionable data analysis with explanations and causality

Published:2022-08 Issue:12 Volume:15 Page:3812-3820
ISSN:2150-8097
Container-title:Proceedings of the VLDB Endowment
language:en
Short-container-title:Proc. VLDB Endow.

Author:

Roy Sudeepa¹

Affiliation:

1. Duke University

Abstract

We live in a world dominated by data, where users from different fields routinely collect, study, and make decisions supported by data. To aid these users, the current trend in data analysis is to design tools that allow large-scale analytics, sophisticated predictive models, and beautiful visualizations. At this exciting time when both data and analytics tools are widely accessible to users, treating analyses as magical black boxes can painfully mislead users and make troubleshooting frustratingly time-consuming. For instance, although the perils of interpreting correlations inferred by predictive models as causation are well-documented, making such a distinction can be tricky for many users who do not have formal training in computer science or statistics. In this paper, we give an overview of our research toward bridging this gap along two main thrusts of explanations and causality. Explanations support a primary goal of data analysis - empowering users to be able to interpret the results in data analysis and troubleshoot the process. Causality complements explanations by supporting prescriptive or actionable analytics with counterfactuals and interventions, thereby helping sound decision making. In these thrusts, we explore the symbiotic relationship between core database techniques and complementary techniques from machine learning and statistics via interdisciplinary collaborations, and employ them to applications in domains like computer science education, law, and health.

Publisher

Association for Computing Machinery (ACM)

Subject

General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development

Link

https://dl.acm.org/doi/pdf/10.14778/3554821.3554902

Reference73 articles.

1. R. Agrawal and R. Srikant . Fast algorithms for mining association rules in large databases . In J. B. Bocca, M. Jarke, and C. Zaniolo, editors, VLDB'94, Proceedings of 20th International Conference on Very Large Data Bases, September 12--15, 1994 , Santiago de Chile, Chile, pages 487 -- 499 . Morgan Kaufmann , 1994. R. Agrawal and R. Srikant. Fast algorithms for mining association rules in large databases. In J. B. Bocca, M. Jarke, and C. Zaniolo, editors, VLDB'94, Proceedings of 20th International Conference on Very Large Data Bases, September 12--15, 1994, Santiago de Chile, Chile, pages 487--499. Morgan Kaufmann, 1994.

2. Almost Matching Exactly Lab Duke University. https://almost-matching-exactly.github.io/. Almost Matching Exactly Lab Duke University. https://almost-matching-exactly.github.io/.

3. Provenance for aggregate queries

4. M. U. Awan , Y. Liu , M. Morucci , S. Roy , C. Rudin , and A. Volfovsky . Interpretable almost matching exactly with instrumental variables . In Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, UAI 2019 , Tel Aviv, Israel, July 22--25 , 2019 , page 410 , 2019. M. U. Awan, Y. Liu, M. Morucci, S. Roy, C. Rudin, and A. Volfovsky. Interpretable almost matching exactly with instrumental variables. In Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, UAI 2019, Tel Aviv, Israel, July 22--25, 2019, page 410, 2019.

5. Proceedings of Machine Learning Research;Awan M. U.,2020

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. On Data-Aware Global Explainability of Graph Neural Networks;Proceedings of the VLDB Endowment;2023-07

2. Why Not Yet: Fixing a Top-k Ranking that is Not Fair to Individuals;Proceedings of the VLDB Endowment;2023-05