Affiliation:
1. University of Michigan
2. Megagon Labs
3. Google, Inc.
4. University of Florida
5. Columbia University
Abstract
In 2008, we wrote about WebTables, an effort to exploit the large and diverse set of structured databases casually published online in the form of HTML tables. The past decade has seen a flurry of research and commercial activities around the WebTables project itself, as well as the broad topic of informal online structured data. In this paper, we
1
will review the WebTables project, and try to place it in the broader context of the decade of work that followed. We will also show how the progress over the past ten years sets up an exciting agenda for the future, and will draw upon many corners of the data management community.
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Cited by
47 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Searching Data Lakes for Nested and Joined Data;Proceedings of the VLDB Endowment;2024-07
2. The Web Data Commons Schema.org Table Corpora;Companion Proceedings of the ACM Web Conference 2024;2024-05-13
3. NPEL: Neural Paired Entity Linking in Web Tables;ACM Transactions on Asian and Low-Resource Language Information Processing;2024-03-19
4. Opportunities and Challenges in Data-Centric AI;IEEE Access;2024
5. Text to Data;Natural Language Interfaces to Databases;2023-11-25