Variational inference for semiparametric Bayesian novelty detection in large datasets-Reference-Cited by-同舟云学术

Variational inference for semiparametric Bayesian novelty detection in large datasets

Published:2023-12-04 Issue: Volume: Page:
ISSN:1862-5347
Container-title:Advances in Data Analysis and Classification
language:en
Short-container-title:Adv Data Anal Classif

Author:

Benedetti Luca,Boniardi Eric,Chiani Leonardo^ORCID,Ghirri Jacopo^ORCID,Mastropietro Marta^ORCID,Cappozzo Andrea^ORCID,Denti Francesco^ORCID

Abstract

AbstractAfter being trained on a fully-labeled training set, where the observations are grouped into a certain number of known classes, novelty detection methods aim to classify the instances of an unlabeled test set while allowing for the presence of previously unseen classes. These models are valuable in many areas, ranging from social network and food adulteration analyses to biology, where an evolving population may be present. In this paper, we focus on a two-stage Bayesian semiparametric novelty detector, also known as Brand, recently introduced in the literature. Leveraging on a model-based mixture representation, Brand allows clustering the test observations into known training terms or a single novelty term. Furthermore, the novelty term is modeled with a Dirichlet Process mixture model to flexibly capture any departure from the known patterns. Brand was originally estimated using MCMC schemes, which are prohibitively costly when applied to high-dimensional data. To scale up Brand applicability to large datasets, we propose to resort to a variational Bayes approach, providing an efficient algorithm for posterior approximation. We demonstrate a significant gain in efficiency and excellent classification performance with thorough simulation studies. Finally, to showcase its applicability, we perform a novelty detection analysis using the openly-available dataset, a large collection of satellite imaging spectra, to search for novel soil types.

Funder

Università Cattolica del Sacro Cuore

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Statistics and Probability

Link

https://link.springer.com/content/pdf/10.1007/s11634-023-00569-z.pdf

Reference35 articles.

1. Aliverti E, Russo M (2022) Stratified stochastic variational inference for high-dimensional network factor model. J Comput Graph Stat 31(2):502–511. https://doi.org/10.1080/10618600.2021.1984929, arXiv:2006.14217

2. Blei DM, Jordan MI (2006) Variational inference for Dirichlet process mixtures. Bayesian Anal 1(1):121–144. http://www.cs.berkeley.edu/$sim$blei/

3. Blei DM, Kucukelbir A, McAuliffe JD (2017) Variational inference: a review for statisticians. J Am Stat Assoc 112(518):859–877. https://doi.org/10.1080/01621459.2017.1285773

4. Boudt K, Rousseeuw PJ, Vanduffel S et al (2020) The minimum regularized covariance determinant estimator. Stat Comput 30(1):113–128. https://doi.org/10.1007/s11222-019-09869-x, arXiv:1701.07086

5. Bouveyron C (2014) Adaptive mixture discriminant analysis for supervised learning with unobserved classes. J Classif 31(1):49–84. https://doi.org/10.1007/s00357-014-9147-x. (link.springer.com/content/pdf/10.1007/s00357-014-9147-x.pdf)