Affiliation:
1. University of Stavanger, Stavanger, Norway
Abstract
Tables are powerful and popular tools for organizing and manipulating data. A vast number of tables can be found on the Web, which represent a valuable knowledge resource. The objective of this survey is to synthesize and present two decades of research on web tables. In particular, we organize existing literature into six main categories of information access tasks: table extraction, table interpretation, table search, question answering, knowledge base augmentation, and table augmentation. For each of these tasks, we identify and describe seminal approaches, present relevant resources, and point out interdependencies among the different tasks.
Publisher
Association for Computing Machinery (ACM)
Subject
Artificial Intelligence,Theoretical Computer Science
Cited by
71 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Large Language Models for Tabular Data: Progresses and Future Directions;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10
2. A Large Scale Test Corpus for Semantic Table Search;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10
3. Semantic Annotation of Russian-Language Tables Based on a Pre-Trained Language Model;2024 Ivannikov Memorial Workshop (IVMEM);2024-05-17
4. The Web Data Commons Schema.org Table Corpora;Companion Proceedings of the ACM Web Conference 2024;2024-05-13
5. Table Illustrator: Puzzle-based interactive authoring of plain tables;Proceedings of the CHI Conference on Human Factors in Computing Systems;2024-05-11