Unobserved classes and extra variables in high-dimensional discriminant analysis-Reference-Cited by-同舟云学术

Unobserved classes and extra variables in high-dimensional discriminant analysis

Published:2022-03 Issue:1 Volume:16 Page:55-92
ISSN:1862-5347
Container-title:Advances in Data Analysis and Classification
language:en
Short-container-title:Adv Data Anal Classif

Author:

Fop Michael^ORCID,Mattei Pierre-Alexandre,Bouveyron Charles,Murphy Thomas Brendan

Abstract

AbstractIn supervised classification problems, the test set may contain data points belonging to classes not observed in the learning phase. Moreover, the same units in the test data may be measured on a set of additional variables recorded at a subsequent stage with respect to when the learning sample was collected. In this situation, the classifier built in the learning phase needs to adapt to handle potential unknown classes and the extra dimensions. We introduce a model-based discriminant approach, Dimension-Adaptive Mixture Discriminant Analysis (D-AMDA), which can detect unobserved classes and adapt to the increasing dimensionality. Model estimation is carried out via a full inductive approach based on an EM algorithm. The method is then embedded in a more general framework for adaptive variable selection and classification suitable for data of large dimensions. A simulation study and an artificial experiment related to classification of adulterated honey samples are used to validate the ability of the proposed framework to deal with complex situations.

Funder

Science Foundation Ireland

Agence Nationale de la Recherche

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Statistics and Probability

Link

https://link.springer.com/content/pdf/10.1007/s11634-021-00474-3.pdf

Reference60 articles.

1. Bagnall A, Lines J, Bostrom A, Large J, Keogh E (2017) The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances. Data Min Knowl Disc 31(3):606–660

2. Bao B-K, Liu G, Hong R, Yan S, Xu C (2013) General subspace learning with corrupted training data via graph embedding. IEEE Trans Image Process 22(11):4380–4393

3. Baudry J-P, Celeux G (2015) EM for mixtures Initialization requires special care. Stat Comput 25(4):713–726

4. Bazell D, Miller DJ (2005) Class discovery in galaxy classification. Astrophys J 618(2):723

5. Bensmail H, Celeux G (1996) Regularized Gaussian discriminant analysis through eigenvalue decomposition. J Am Stat Assoc 91:1743–1748

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Variational inference for semiparametric Bayesian novelty detection in large datasets;Advances in Data Analysis and Classification;2023-12-04