An Optimal Algorithm for Online Non-Convex Learning-Reference-Cited by-同舟云学术

An Optimal Algorithm for Online Non-Convex Learning

Published:2019-01-17 Issue:1 Volume:46 Page:41-43
ISSN:0163-5999
Container-title:ACM SIGMETRICS Performance Evaluation Review
language:en
Short-container-title:SIGMETRICS Perform. Eval. Rev.

Author:

Yang Lin¹,Deng Lei¹,Hajiesmaili Mohammad H.²,Tan Cheng³,Wong Wing Shing³

Affiliation:

1. Chinese University of Hong Kong, Hong Kong, Hong Kong

2. Johns Hopkins University, Baltimore, MD, USA

3. Chinese University of Hong Kong, Hong Kong , Hong Kong

Abstract

In many online learning paradigms, convexity plays a central role in the derivation and analysis of online learning algorithms. The results, however, fail to be extended to the non-convex settings, which are necessitated by tons of recent applications. The Online Non-Convex Learning problem generalizes the classic Online Convex Optimization framework by relaxing the convexity assumption on the cost function (to a Lipschitz continuous function) and the decision set. The state-of-the-art result for ønco demonstrates that the classic Hedge algorithm attains a sublinear regret of O(√T log T). The regret lower bound for øco, however, is Omega(√T), and to the best of our knowledge, there is no result in the context of the ønco problem achieving the same bound. This paper proposes the Online Recursive Weighting algorithm with regret of O(√T), matching the tight regret lower bound for the øco problem, and fills the regret gap between the state-of-the-art results in the online convex and non-convex optimization problems.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3292040.3219635

Reference11 articles.

1. P. Auer N. Cesa-Bianchi Y. Freund and R. E. Schapire. The nonstochastic multiarmed bandit problem. SIAM journal on computing 32(1):48--77 2002. 10.1137/S0097539701398375 P. Auer N. Cesa-Bianchi Y. Freund and R. E. Schapire. The nonstochastic multiarmed bandit problem. SIAM journal on computing 32(1):48--77 2002. 10.1137/S0097539701398375

2. Online linear optimization and adaptive routing

3. Nonconvex Online Support Vector Machines

4. Introduction to Online Convex Optimization

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Regrets of proximal method of multipliers for online non-convex optimization with long term constraints;Journal of Global Optimization;2022-06-21