Information-Theoretic Generalization Bounds for Meta-Learning and Applications-Reference-Cited by-同舟云学术

Information-Theoretic Generalization Bounds for Meta-Learning and Applications

Published:2021-01-19 Issue:1 Volume:23 Page:126
ISSN:1099-4300
Container-title:Entropy
language:en
Short-container-title:Entropy

Author:

Jose Sharu Theresa^ORCID,Simeone Osvaldo^ORCID

Abstract

Meta-learning, or “learning to learn”, refers to techniques that infer an inductive bias from data corresponding to multiple related tasks with the goal of improving the sample efficiency for new, previously unobserved, tasks. A key performance measure for meta-learning is the meta-generalization gap, that is, the difference between the average loss measured on the meta-training data and on a new, randomly selected task. This paper presents novel information-theoretic upper bounds on the meta-generalization gap. Two broad classes of meta-learning algorithms are considered that use either separate within-task training and test sets, like model agnostic meta-learning (MAML), or joint within-task training and test sets, like reptile. Extending the existing work for conventional learning, an upper bound on the meta-generalization gap is derived for the former class that depends on the mutual information (MI) between the output of the meta-learning algorithm and its input meta-training data. For the latter, the derived bound includes an additional MI between the output of the per-task learning procedure and corresponding data set to capture within-task uncertainty. Tighter bounds are then developed for the two classes via novel individual task MI (ITMI) bounds. Applications of the derived bounds are finally discussed, including a broad class of noisy iterative algorithms for meta-learning.

Funder

H2020 European Research Council

Publisher

MDPI AG

Subject

General Physics and Astronomy

Link

https://www.mdpi.com/1099-4300/23/1/126/pdf

Reference54 articles.

1. Understanding Machine Learning: From Theory to Algorithms;Shalev-Shwartz,2014

2. Pattern Recognition and Machine Learning;Bishop,2006

3. A Brief Introduction to Machine Learning for Engineers

4. Learning to Learn: Introduction and Overview;Thrun,1998

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Information-Theoretic Characterizations of Generalization Error for the Gibbs Algorithm;IEEE Transactions on Information Theory;2024-01

2. Meta-Learning for Wireless Communications: A Survey and a Comparison to GNNs;IEEE Open Journal of the Communications Society;2024

3. Meta-Learning based efficient framework for diagnosing rare disorders: A comprehensive survey;AIP Conference Proceedings;2024

4. Exactly Tight Information-Theoretic Generalization Error Bound for the Quadratic Gaussian Problem;IEEE Journal on Selected Areas in Information Theory;2024

5. Lossless Transformations and Excess Risk Bounds in Statistical Inference;Entropy;2023-09-28