Unsupervised classification of SDSS galaxy spectra-Reference-Cited by-同舟云学术

Unsupervised classification of SDSS galaxy spectra

Published:2021-05 Issue: Volume:649 Page:A53
ISSN:0004-6361
Container-title:Astronomy & Astrophysics
language:
Short-container-title:A&A

Author:

Fraix-Burnet D.^ORCID,Bouveyron C.,Moultaka J.

Abstract

Context. Defining templates of galaxy spectra is useful to quickly characterise new observations and organise databases from surveys. These templates are usually built from a pre-defined classification based on other criteria. Aims. We present an unsupervised classification of 702 248 spectra of galaxies and quasars with redshifts smaller than 0.25 that were retrieved from the Sloan Digital Sky Survey (SDSS) database, release 7. Methods. The spectra were first corrected for redshift, then wavelet-filtered to reduce the noise, and finally binned to obtain about 1437 wavelengths per spectrum. The unsupervised clustering algorithm Fisher-EM, relying on a discriminative latent mixture model, was applied on these corrected spectra. The full set and several subsets of 100 000 and 300 000 spectra were analysed. Results. The optimum number of classes given by a penalised likelihood criterion is 86 classes, of which the 37 most populated gather 99% of the sample. These classes are established from a subset of 302 214 spectra. Using several cross-validation techniques we find that this classification agrees with the results obtained on the other subsets with an average misclassification error of about 15%. The large number of very small classes tends to increase this error rate. In this paper, we do an initial quick comparison of our classes with literature templates. Conclusions. This is the first time that an automatic, objective and robust unsupervised classification is established on such a large number of galaxy spectra. The mean spectra of the classes can be used as templates for a large majority of galaxies in our Universe.

Funder

3IA Côte d’Azur Investment in the Future project managed by the National Research Agency

Publisher

EDP Sciences

Subject

Space and Planetary Science,Astronomy and Astrophysics

Link

https://www.aanda.org/10.1051/0004-6361/202040046/pdf

Reference33 articles.

1. Classification parameters for the emission-line spectra of extragalactic objects

2. The weirdest SDSS galaxies: results from an outlier detection algorithm

3. Assessing a mixture model for clustering with the integrated completed likelihood