Classification of Astrophysics Journal Articles with Machine Learning to Identify Data for NED-Reference-Cited by-同舟云学术

Classification of Astrophysics Journal Articles with Machine Learning to Identify Data for NED

Published:2022-01-01 Issue:1031 Volume:134 Page:014501
ISSN:0004-6280
Container-title:Publications of the Astronomical Society of the Pacific
language:
Short-container-title:PASP

Author:

Chen Tracy X.^ORCID,Ebert Rick^ORCID,Mazzarella Joseph M.^ORCID,Frayer Cren,Terek Scott,Chan Ben H. P.,Cook David^ORCID,Lo Tak,Schmitz Marion^ORCID,Wu Xiuqin^ORCID

Abstract

Abstract The NASA/IPAC Extragalactic Database (NED) is a comprehensive online service that combines fundamental multi-wavelength information for known objects beyond the Milky Way and provides value-added, derived quantities and tools to search and access the data. The contents and relationships between measurements in the database are continuously augmented and revised to stay current with astrophysics literature and new sky surveys. The conventional process of distilling and extracting data from the literature involves human experts to review the journal articles and determine if an article is of extragalactic nature, and if so, what types of data it contains. This is both labor intensive and unsustainable, especially given the ever-increasing number of publications each year. We present here a machine learning (ML) approach developed and integrated into the NED production pipeline to help automate the classification of journal article topics and their data content for inclusion into NED. We show that this ML application can successfully reproduce the classifications of a human expert to an accuracy of over 90% in a fraction of the time it takes a human, allowing us to focus human expertise on tasks that are more difficult to automate.

Publisher

IOP Publishing

Subject

Space and Planetary Science,Astronomy and Astrophysics

Link

https://iopscience.iop.org/article/10.1088/1538-3873/ac3c36/pdf

Reference10 articles.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Best Practices for Data Publication in the Astronomical Literature;The Astrophysical Journal Supplement Series;2022-05-01