1. Adaptive subgradient methods for online learning and stochastic optimization;duchi;J Mach Learn Res,2011
2. SCAFFOLD: Stochastic controlled averaging for federated learning;karimireddy,2019
3. Adaptive methods for nonconvex optimization;zaheer;Proc Adv Neural Inf Process Syst (NeurIPS),2018
4. Adam: A method for stochastic optimization;kingma,2014
5. Learning multiple layers of features from tiny images;krizhevsky,2009