Affiliation:
1. The University of Chicago
2. IBM Research AI
Abstract
In data science problems, understanding the data is a crucial first step. However, it can be challenging and time intensive for a data scientist who is not an expert in that domain. Several downstream tasks such as feature engineering and data curation depend on the understanding of data semantics. In this demonstration, we present,
ADE (Automated Data Explanation)
, a novel system that uses
maximum likelihood estimation approach
through ensembles for automatically labeling and explaining relational data by taking advantage of openly available semantic knowledge bases, webtables and Wikipedia. It helps a user to understand concepts of various columns and their relationships, an abstract summary about the overall data, and additional context not present in the data. It reduces the need for cumbersome search queries or expert consultation and can also receive inputs or corrections from a user, making it a mixed-initiative automation system.
Publisher
Association for Computing Machinery (ACM)
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Reference11 articles.
1. DIFF
2. J Chen , E Jiménez-Ruiz , I Horrocks , and C Sutton . 2019 . Colnet: Embedding the semantics of web tables for column type prediction. In AAAI. J Chen, E Jiménez-Ruiz, I Horrocks, and C Sutton. 2019. Colnet: Embedding the semantics of web tables for column type prediction. In AAAI.
3. Consistent Feature Construction with Constrained Genetic Programming for Experimental Physics
4. Automated Feature Enhancement for Predictive Modeling using External Knowledge
5. Kevin Hu Neil Gaikwad Michiel Bakker Madelon Hulsebos Emanuel Zgraggen César Hidalgo Tim Kraska Guoliang Li Arvind Satyanarayan and Çağatay Demiralp. 2019. VizNet: Towards a large-scale visualization learning and benchmarking repository. In CHI. ACM. Kevin Hu Neil Gaikwad Michiel Bakker Madelon Hulsebos Emanuel Zgraggen César Hidalgo Tim Kraska Guoliang Li Arvind Satyanarayan and Çağatay Demiralp. 2019. VizNet: Towards a large-scale visualization learning and benchmarking repository. In CHI. ACM.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Empirical Evidence on Conversational Control of GUI in Semantic Automation;Proceedings of the 29th International Conference on Intelligent User Interfaces;2024-03-18