A versatile framework for analyzing galaxy image data by incorporating Human-in-the-loop in a large vision model*
-
Published:2024-09-01
Issue:9
Volume:48
Page:095001
-
ISSN:1674-1137
-
Container-title:Chinese Physics C
-
language:
-
Short-container-title:Chinese Phys. C
Author:
Fu 傅 Ming-Xiang 溟翔,Song 宋 Yu 宇,Lv 吕 Jia-Meng 佳蒙,Cao 曹 Liang 亮,Jia 贾 Peng 鹏,Li 李 Nan 楠,Li 李 Xiang-Ru 乡儒,Liu 刘 Ji-Feng 继峰,Luo 罗 A-Li 阿理,Qiu 邱 Bo 波,Shen 沈 Shi-Yin 世银,Tu 屠 Liang-Ping 良平,Wang 王 Li-Li 丽丽,Wei 卫 Shou-Lin 守林,Yang 杨 Hai-Feng 海峰,Yi 衣 Zhen-Ping 振萍,Zou 邹 Zhi-Qiang 志强
Abstract
Abstract
The exponential growth of astronomical datasets provides an unprecedented opportunity for humans to gain insight into the Universe. However, effectively analyzing this vast amount of data poses a significant challenge. In response, astronomers are turning to deep learning techniques, but these methods are limited by their specific training sets, leading to considerable duplicate workloads. To overcome this issue, we built a framework for the general analysis of galaxy images based on a large vision model (LVM) plus downstream tasks (DST), including galaxy morphological classification, image restoration, object detection, parameter extraction, and more. Considering the low signal-to-noise ratios of galaxy images and the imbalanced distribution of galaxy categories, we designed our LVM to incorporate a Human-in-the-loop (HITL) module, which leverages human knowledge to enhance the reliability and interpretability of processing galaxy images interactively. The proposed framework exhibits notable few-shot learning capabilities and versatile adaptability for all the abovementioned tasks on galaxy images in the DESI Legacy Imaging Surveys. In particular, for the object detection task, which was trained using 1000 data points, our DST in the LVM achieved an accuracy of 96.7%, while ResNet50 plus Mask R-CNN reached an accuracy of 93.1%. For morphological classification, to obtain an area under the curve (AUC) of ~0.9, LVM plus DST and HITL only requested 1/50 of the training sets that ResNet18 requested. In addition, multimodal data can be integrated, which creates possibilities for conducting joint analyses with datasets spanning diverse domains in the era of multi-messenger astronomy.
Funder
National Natural Science Foundation of China
National Key Research and Development Program of China
Square Kilometre Array (SKA) Project
China Manned Space Project
CAS Project for Young Scientists in Basic Research