Affiliation:
1. Univ. of Toronto
2. IBM Almaden Research Center
Abstract
Clio is a system for managing and facilitating the complex tasks of heterogeneous data transformation and integration. In Clio, we have collected together a powerful set of data management techniques that have proven invaluable in tackling these difficult problems. In this paper, we present the underlying themes of our approach and present a brief case study.
Publisher
Association for Computing Machinery (ACM)
Subject
Information Systems,Software
Cited by
178 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Discovering Functional Dependencies through Hitting Set Enumeration;Proceedings of the ACM on Management of Data;2024-03-12
2. Discovering Similarity Inclusion Dependencies;Proceedings of the ACM on Management of Data;2023-05-26
3. Fast Discovery of Inclusion Dependencies with Desbordante;2023 33rd Conference of Open Innovations Association (FRUCT);2023-05-24
4. EulerFD: An Efficient Double-Cycle Approximation of Functional Dependencies;2023 IEEE 39th International Conference on Data Engineering (ICDE);2023-04
5. Semantic Metadata Requirements for Data Warehousing from a Dimensional Modeling Perspective;Proceedings of the 24th International Conference on Enterprise Information Systems;2022