Variance Counterbalancing for Stochastic Large-scale Learning-Reference-Cited by-同舟云学术

Variance Counterbalancing for Stochastic Large-scale Learning

Published:2020-08 Issue:05 Volume:29 Page:2050010
ISSN:0218-2130
Container-title:International Journal on Artificial Intelligence Tools
language:en
Short-container-title:Int. J. Artif. Intell. Tools

Author:

Lagari Pola Lydia¹,Tsoukalas Lefteri H.¹,Lagaris Isaac E.²

Affiliation:

1. School of Nuclear Engineering, Purdue University, West Lafayette, IN 47907, USA

2. Department of Computer Science and Engineering, University of Ioannina, Ioannina 45500, Greece

Abstract

Stochastic Gradient Descent (SGD) is perhaps the most frequently used method for large scale training. A common example is training a neural network over a large data set, which amounts to minimizing the corresponding mean squared error (MSE). Since the convergence of SGD is rather slow, acceleration techniques based on the notion of “Mini-Batches” have been developed. All of them however, mimicking SGD, impose diminishing step-sizes as a means to inhibit large variations in the MSE objective. In this article, we introduce random sets of mini-batches instead of individual mini-batches. We employ an objective function that minimizes the average MSE and its variance over these sets, eliminating so the need for the systematic step size reduction. This approach permits the use of state-of-the-art optimization methods, far more efficient than the gradient descent, and yields a significant performance enhancement.

Publisher

World Scientific Pub Co Pte Lt

Subject

Artificial Intelligence,Artificial Intelligence

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218213020500104

Reference9 articles.

1. A Stochastic Approximation Method

2. Optimization Methods for Large-Scale Machine Learning

3. The Merlin control language for strategic optimization

4. Artificial neural networks with an infinite number of nodes

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. SOMO-VCB: A Matlab® software for single-objective and multi-objective optimization for variance counterbalancing in stochastic learning;Software Impacts;2023-09

2. Single-objective and multi-objective optimization for variance counterbalancing in stochastic learning;Applied Soft Computing;2023-07

3. Automated Glaucoma Detection Using Deep Convolutional Neural Networks;2023-04-17