Catastrophic Forgetting in Deep Learning: A Comprehensive Taxonomy
-
Published:2024-08-06
Issue:1
Volume:30
Page:
-
ISSN:1678-4804
-
Container-title:Journal of the Brazilian Computer Society
-
language:
-
Short-container-title:JBCS
Author:
Aleixo Everton LimaORCID, Colonna Juan G.ORCID, Cristo MarcoORCID, Fernandes EverlandioORCID
Abstract
Deep Learning models have achieved remarkable performance in tasks such as image classification or generation, often surpassing human accuracy. However, they can struggle to learn new tasks and update their knowledge without access to previous data, leading to a significant loss of accuracy known as Catastrophic Forgetting (CF). This phenomenon was first observed by McCloskey and Cohen in 1989 and remains an active research topic. Incremental learning without forgetting is widely recognized as a crucial aspect in building better AI systems, as it allows models to adapt to new tasks without losing the ability to perform previously learned ones. This article surveys recent studies that tackle CF in modern Deep Learning models that use gradient descent as their learning algorithm. Although several solutions have been proposed, a definitive solution or consensus on assessing CF is yet to be established. The article provides a comprehensive review of recent solutions, proposes a taxonomy to organize them, and identifies research gaps in this area.
Publisher
Sociedade Brasileira de Computacao - SB
Reference204 articles.
1. Adel, T., Zhao, H., and Turner, R. E. (2020). Continual learning with adaptive weights (claw). In International Conference on Learning Representations. DOI: 10.48550/arXiv.1911.0951. 2. Ahn, H., Kwak, J., Lim, S., Bang, H., Kim, H., and Moon, T. (2021). Ss-il: Separated softmax for incremental learning. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 844-853. Available online [link]. 3. Aljundi, R., Babiloni, F., Elhoseiny, M., Rohrbach, M., and Tuytelaars, T. (2018). Memory aware synapses: Learning what (not) to forget. In Ferrari, V., Hebert, M., Sminchisescu, C., and Weiss, Y., editors, Computer Vision - ECCV 2018 - 15th European Conference, Munich, Germany, September 8-14, 2018, Proceedings, Part III, volume 11207 of Lecture Notes in Computer Science, pages 144-161. Springer. Available online [link]. 4. Aljundi, R., Chakravarty, P., and Tuytelaars, T. (2017). Expert gate: Lifelong learning with a network of experts. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 3366-3375. Available online [link]. 5. Ashfahani, A. and Pratama, M. (2019). Autonomous deep learning: Continual learning approach for dynamic environments. In Berger-Wolf, T. Y. and Chawla, N. V., editors, Proceedings of the 2019 SIAM International Conference on Data Mining, SDM 2019, Calgary, Alberta, Canada, May 2-4, 2019, pages 666-674. SIAM. DOI: 10.1137/1.9781611975673.75.
|
|