Toward Training Recurrent Neural Networks for Lifelong Learning-Reference-Cited by-同舟云学术

Toward Training Recurrent Neural Networks for Lifelong Learning

Published:2020-01 Issue:1 Volume:32 Page:1-35
ISSN:0899-7667
Container-title:Neural Computation
language:en
Short-container-title:Neural Computation

Author:

Sodhani Shagun¹,Chandar Sarath¹,Bengio Yoshua²

Affiliation:

1. Mila, University of Montréal, Montreal, Quebec H3T 1J4, Canada

2. Mila, University of Montréal, Montreal, Quebec H3T 1J4, Canada, and CIFAR

Abstract

Catastrophic forgetting and capacity saturation are the central challenges of any parametric lifelong learning system. In this work, we study these challenges in the context of sequential supervised learning with an emphasis on recurrent neural networks. To evaluate the models in the lifelong learning setting, we propose a curriculum-based, simple, and intuitive benchmark where the models are trained on tasks with increasing levels of difficulty. To measure the impact of catastrophic forgetting, the model is tested on all the previous tasks as it completes any task. As a step toward developing true lifelong learning systems, we unify gradient episodic memory (a catastrophic forgetting alleviation approach) and Net2Net (a capacity expansion approach). Both models are proposed in the context of feedforward networks, and we evaluate the feasibility of using them for recurrent networks. Evaluation on the proposed benchmark shows that the unified model is more suitable than the constituent models for lifelong learning setting.

Publisher

MIT Press - Journals

Subject

Cognitive Neuroscience,Arts and Humanities (miscellaneous)

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/neco_a_01246

Reference33 articles.

1. Curriculum learning

Cited by 33 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The role of lifelong machine learning in bridging the gap between human and machine learning: A scientometric analysis;WIREs Data Mining and Knowledge Discovery;2024-01-10

2. A 2D image 3D reconstruction function adaptive denoising algorithm;PeerJ Computer Science;2023-10-03

3. Artificial intelligence in psychiatry research, diagnosis, and therapy;Asian Journal of Psychiatry;2023-09

4. Biologically-Inspired Continual Learning of Human Motion Sequences;ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2023-06-04

5. Class-Incremental Learning on Multivariate Time Series Via Shape-Aligned Temporal Distillation;ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2023-06-04