On the Convergence Properties of a Stochastic Trust-Region Method with Inexact Restoration-Reference-Cited by-同舟云学术

On the Convergence Properties of a Stochastic Trust-Region Method with Inexact Restoration

Published:2022-12-28 Issue:1 Volume:12 Page:38
ISSN:2075-1680
Container-title:Axioms
language:en
Short-container-title:Axioms

Author:

Bellavia Stefania,Morini Benedetta^ORCID,Rebegoldi Simone^ORCID

Abstract

We study the convergence properties of SIRTR, a stochastic inexact restoration trust-region method suited for the minimization of a finite sum of continuously differentiable functions. This method combines the trust-region methodology with random function and gradient estimates formed by subsampling. Unlike other existing schemes, it forces the decrease of a merit function by combining the function approximation with an infeasibility term, the latter of which measures the distance of the current sample size from its maximum value. In a previous work, the expected iteration complexity to satisfy an approximate first-order optimality condition was given. Here, we elaborate on the convergence analysis of SIRTR and prove its convergence in probability under suitable accuracy requirements on random function and gradient estimates. Furthermore, we report the numerical results obtained on some nonconvex classification test problems, discussing the impact of the probabilistic requirements on the selection of the sample sizes.

Funder

INdAM GNCS project “Ottimizzazione adattiva per il machine learning”

Mobility Project “Second order methods for optimization problems in Machine Learning”

IEA CNRS project entitled “VaMOS”

Publisher

MDPI AG

Subject

Geometry and Topology,Logic,Mathematical Physics,Algebra and Number Theory,Analysis

Link

https://www.mdpi.com/2075-1680/12/1/38/pdf

Reference31 articles.

1. Bishop, C.M. (2006). Pattern Recognition and Machine Learning, Springer.

2. Goodfellow, I., Bengio, Y., and Courville, A. (2016). Deep Learning, MIT Press.

3. Optimization Methods for Large-Scale Machine Learning;Bottou;SIAM Rev.,2018

4. Kingma, D.P., and Ba, J. (2015, January 7–9). Adam: A Method for Stochastic Optimization. Proceedings of the 3rd International Conference on Learning Representations, San Diego, CA, USA.

5. Minimizing Finite Sums with the Stochastic Average Gradient;Schmidt;Math. Program.,2017

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A New Adaptive Accelerated Levenberg–Marquardt Method for Solving Nonlinear Equations and Its Applications in Supply Chain Problems;Symmetry;2023-02-24