Abstract
Abstract
We develop a new model for automatic extraction of reported measurement values from the astrophysical literature, utilizing modern natural language processing techniques. We use this model to extract measurements present in the abstracts of the approximately 248,000 astrophysics articles from the arXiv repository, yielding a database containing over 231,000 astrophysical numerical measurements. Furthermore, we present an online interface (Numerical Atlas) to allow users to query and explore this database, based on parameter names and symbolic representations, and download the resulting data sets for their own research uses. To illustrate potential use cases, we then collect values for nine different cosmological parameters using this tool. From these results, we can clearly observe the historical trends in the reported values of these quantities over the past two decades and see the impacts of landmark publications on our understanding of cosmology.
Funder
UKRI ∣ Science and Technology Facilities Council
Publisher
American Astronomical Society
Subject
Space and Planetary Science,Astronomy and Astrophysics