Practical galaxy morphology tools from deep supervised representation learning

Author:

Walmsley Mike1ORCID,Scaife Anna M M12ORCID,Lintott Chris3ORCID,Lochner Michelle45,Etsebeth Verlon4,Géron Tobias3ORCID,Dickinson Hugh6,Fortson Lucy78,Kruk Sandor910,Masters Karen L11ORCID,Mantha Kameswara Bharadwaj78,Simmons Brooke D12ORCID

Affiliation:

1. Jodrell Bank Centre for Astrophysics, Department of Physics & Astronomy, University of Manchester, Oxford Road, Manchester M13 9PL, UK

2. The Alan Turing Institute, 96 Euston Road, London NW1 2DB, UK

3. Oxford Astrophysics, Department of Physics, University of Oxford, Denys Wilkinson Building, Keble Road, Oxford OX1 3RH, UK

4. Department of Physics and Astronomy, University of the Western Cape, Bellville, Cape Town 7535, South Africa

5. South African Radio Astronomy Observatory (SARAO), The Park, Park Road, Pinelands, Cape Town 7405, South Africa

6. School of Physical Sciences, The Open University, Milton Keynes, Kents Hill MK7 6AA, UK

7. Minnesota Institute for Astrophysics, University of Minnesota, 116 Church St SE, Minneapolis, MN 55455, USA

8. School of Physics and Astronomy, University of Minnesota, 116 Church St SE, Minneapolis, MN 55455, USA

9. Max-Planck-Institut für extraterrestrische Physik, Giessenbachstrasse 1, D-85748 Garching bei München, Germany

10. European Space Agency, ESTEC, Keplerlaan 1, NL-2201 AZ Noordwijk, the Netherlands

11. Departments of Physics and Astronomy, Haverford College, 370 Lancaster Avenue, Haverford, PA 19041, USA

12. Department of Physics, Lancaster University, Bailrigg, Lancaster LA1 4YB, UK

Abstract

ABSTRACT Astronomers have typically set out to solve supervised machine learning problems by creating their own representations from scratch. We show that deep learning models trained to answer every Galaxy Zoo DECaLS question learn meaningful semantic representations of galaxies that are useful for new tasks on which the models were never trained. We exploit these representations to outperform several recent approaches at practical tasks crucial for investigating large galaxy samples. The first task is identifying galaxies of similar morphology to a query galaxy. Given a single galaxy assigned a free text tag by humans (e.g. ‘#diffuse’), we can find galaxies matching that tag for most tags. The second task is identifying the most interesting anomalies to a particular researcher. Our approach is 100 per cent accurate at identifying the most interesting 100 anomalies (as judged by Galaxy Zoo 2 volunteers). The third task is adapting a model to solve a new task using only a small number of newly labelled galaxies. Models fine-tuned from our representation are better able to identify ring galaxies than models fine-tuned from terrestrial images (ImageNet) or trained from scratch. We solve each task with very few new labels; either one (for the similarity search) or several hundred (for anomaly detection or fine-tuning). This challenges the longstanding view that deep supervised methods require new large labelled data sets for practical use in astronomy. To help the community benefit from our pretrained models, we release our fine-tuning code zoobot. Zoobot is accessible to researchers with no prior experience in deep learning.

Funder

Alan Turing Institute

National Science Foundation

National Research Foundation

Publisher

Oxford University Press (OUP)

Subject

Space and Planetary Science,Astronomy and Astrophysics

Cited by 14 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. The ROAD to discovery: Machine-learning-driven anomaly detection in radio astronomy spectrograms;Astronomy & Astrophysics;2023-12

2. Galaxy mergers in Subaru HSC-SSP: A deep representation learning approach for identification, and the role of environment on merger incidence;Astronomy & Astrophysics;2023-11

3. Deep supervised hashing for fast retrieval of radio image cubes;2023 XXXVth General Assembly and Scientific Symposium of the International Union of Radio Science (URSI GASS);2023-08-19

4. Similar Image Retrieval using Autoencoder. I. Automatic Morphology Classification of Galaxies;Publications of the Astronomical Society of the Pacific;2023-08-01

5. Guided Retraining to Enhance the Detection of Difficult Android Malware;Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis;2023-07-12

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3