Regularisation of neural networks by enforcing Lipschitz continuity-Reference-Cited by-同舟云学术

Regularisation of neural networks by enforcing Lipschitz continuity

Published:2020-12-06 Issue:2 Volume:110 Page:393-416
ISSN:0885-6125
Container-title:Machine Learning
language:en
Short-container-title:Mach Learn

Author:

Gouk Henry^ORCID,Frank Eibe,Pfahringer Bernhard,Cree Michael J.

Abstract

AbstractWe investigate the effect of explicitly enforcing the Lipschitz continuity of neural networks with respect to their inputs. To this end, we provide a simple technique for computing an upper bound to the Lipschitz constant—for multiple p-norms—of a feed forward neural network composed of commonly used layer types. Our technique is then used to formulate training a neural network with a bounded Lipschitz constant as a constrained optimisation problem that can be solved using projected stochastic gradient methods. Our evaluation study shows that the performance of the resulting models exceeds that of models trained with other common regularisers. We also provide evidence that the hyperparameters are intuitive to tune, demonstrate how the choice of norm for computing the Lipschitz constant impacts the resulting model, and show that the performance gains provided by our method are particularly noticeable when only a small amount of training data is available.

Funder

University of Edinburgh

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

http://link.springer.com/content/pdf/10.1007/s10994-020-05929-w.pdf

Reference39 articles.

1. Arjovsky, M., Chintala, S., & Bottou, L. (2017). Wasserstein GAN. In Proceedings of the 34th international conference on machine learning.

2. Balan, R., Singh, M., & Zou, D. (2017). Lipschitz properties for deep convolutional networks. arXiv:1701.05217.

3. Bartlett, P. L. (1998). The sample complexity of pattern classification with neural networks: The size of the weights is more important than the size of the network. IEEE Transactions on Information Theory, 44(2), 525–536.

4. Bartlett, P. L., Foster, D. J., & Telgarsky, M. J. (2017). Spectrally-normalized margin bounds for neural networks. In Advances in neural information processing systems (vol. 30).

5. Bergstra, J., Komer, B., Eliasmith, C., Yamins, D., & Cox, D. D. (2015). Hyperopt: A Python library for model selection and hyperparameter optimization. Computational Science & Discovery, 8(1), 014008.

Cited by 115 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. RobustCheck: A Python package for black-box robustness assessment of image classifiers;SoftwareX;2024-09

2. Quaternion Convolutional Neural Networks: Current Advances and Future Directions;Advances in Applied Clifford Algebras;2024-08-28

3. Online Drift Detection with Maximum Concept Discrepancy;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24

4. Robustness and exploration of variational and machine learning approaches to inverse problems: An overview;GAMM-Mitteilungen;2024-08-07

5. Machine-learning-coined noise induces energy-saving synchrony;Physical Review E;2024-07-25