Abstract
AbstractAn important but very time consuming part of the research process is literature review. An already large and nevertheless growing ground set of publications as well as a steadily increasing publication rate continue to worsen the situation. Consequently, automating this task as far as possible is desirable. Experimental results of systems are key-insights of high importance during literature review and usually represented in form of tables. Our pipeline KIETA exploits these tables to contribute to the endeavor of automation by extracting them and their contained knowledge from scientific publications. The pipeline is split into multiple steps to guarantee modularity as well as analyzability, and agnosticim regarding the specific scientific domain up until the knowledge extraction step, which is based upon an ontology. Additionally, a dataset of corresponding articles has been manually annotated with information regarding table and knowledge extraction. Experiments show promising results that signal the possibility of an automated system, while also indicating limits of extracting knowledge from tables without any context.
Funder
Julius-Maximilians-Universität Würzburg
Publisher
Springer Science and Business Media LLC
Reference20 articles.
1. Chi Z, Huang H, Xu HD et al (2019) Complicated table structure recognition. preprint at arXiv:https://arxiv.org/abs/1908.04729
2. Duda RO, Hart PE (1972) Use of the hough transformation to detect lines and curves in pictures. Commun ACM 15:11–15. https://doi.org/10.1145/361237.361242
3. Göbel M, Hassan T, Oro E, Orsi G (2012) A methodology for evaluating algorithms for table understanding in PDF documents. In: DocEng. ACM Press. https://doi.org/10.1145/2361354.2361365, pp 45–48
4. Grobid (2008)
5. Hou Y, Jochim C, Gleize M, Bonin F, Ganguly D (2019) Identification of tasks, datasets, evaluation metrics, and numeric scores for scientific leaderboards construction. In: ACL. Association for Computational Linguistics. https://doi.org/10.18653/v1/p19-1513, pp 5203–5213
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献