Optimal Sampling of Parametric Families: Implications for Machine Learning-Reference-Cited by-同舟云学术

Optimal Sampling of Parametric Families: Implications for Machine Learning

Published:2020-01 Issue:1 Volume:32 Page:261-279
ISSN:0899-7667
Container-title:Neural Computation
language:en
Short-container-title:Neural Computation

Author:

Huber Adrian E. G.¹,Anumula Jithendar¹,Liu Shih-Chii¹

Affiliation:

1. Institute of Neuroinformatics, University of Zurich and ETH Zurich, Zurich 8057, Switzerland

Abstract

It is well known in machine learning that models trained on a training set generated by a probability distribution function perform far worse on test sets generated by a different probability distribution function. In the limit, it is feasible that a continuum of probability distribution functions might have generated the observed test set data; a desirable property of a learned model in that case is its ability to describe most of the probability distribution functions from the continuum equally well. This requirement naturally leads to sampling methods from the continuum of probability distribution functions that lead to the construction of optimal training sets. We study the sequential prediction of Ornstein-Uhlenbeck processes that form a parametric family. We find empirically that a simple deep network trained on optimally constructed training sets using the methods described in this letter can be robust to changes in the test set distribution.

Publisher

MIT Press - Journals

Subject

Cognitive Neuroscience,Arts and Humanities (miscellaneous)

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/neco_a_01251

Reference20 articles.

1. The Strong Ergodic Theorem for Densities: Generalized Shannon-McMillan-Breiman Theorem

2. Predictors for the first-order autoregressive process

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Wavelet-GAN: A GPR Noise and Clutter Removal Method Based on Small Real Datasets;IEEE Transactions on Geoscience and Remote Sensing;2024