Measuring Catastrophic Forgetting in Neural Networks-Reference-Cited by-同舟云学术

Measuring Catastrophic Forgetting in Neural Networks

Published:2018-04-29 Issue:1 Volume:32 Page:
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Kemker Ronald,McClure Marc,Abitino Angelina,Hayes Tyler,Kanan Christopher

Abstract

Deep neural networks are used in many state-of-the-art systems for machine perception. Once a network is trained to do a specific task, e.g., bird classification, it cannot easily be trained to do new tasks, e.g., incrementally learning to recognize additional bird species or learning an entirely different task such as flower recognition. When new tasks are added, typical deep neural networks are prone to catastrophically forgetting previous tasks. Networks that are capable of assimilating new information incrementally, much like how humans form new memories over time, will be more efficient than re-training the model from scratch each time a new task needs to be learned. There have been multiple attempts to develop schemes that mitigate catastrophic forgetting, but these methods have not been directly compared, the tests used to evaluate them vary considerably, and these methods have only been evaluated on small-scale problems (e.g., MNIST). In this paper, we introduce new metrics and benchmarks for directly comparing five different mechanisms designed to mitigate catastrophic forgetting in neural networks: regularization, ensembling, rehearsal, dual-memory, and sparse-coding. Our experiments on real-world images and sounds show that the mechanism(s) that are critical for optimal performance vary based on the incremental training paradigm and type of data being used, but they all demonstrate that the catastrophic forgetting problem is not yet solved.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 148 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Online learning and continuous model upgrading with data streams through the Kafka-ML framework;Future Generation Computer Systems;2024-11

2. A cognition-driven framework for few-shot class-incremental learning;Neurocomputing;2024-10

3. LLM-Commentator: Novel fine-tuning strategies of large language models for automatic commentary generation using football event data;Knowledge-Based Systems;2024-09

4. Assessment of catastrophic forgetting in continual credit card fraud detection;Expert Systems with Applications;2024-09

5. Catastrophic Forgetting in Deep Learning: A Comprehensive Taxonomy;Journal of the Brazilian Computer Society;2024-08-06