1. Agarwal A, Dekel O, Xiao L (2010) Optimal algorithms for online convex optimization with multi-point bandit feedback. In: Colt. Citeseer, pp 28–40
2. Agarwal A, Foster DP, Hsu DJ, Kakade SM, Rakhlin A (2011) Stochastic convex optimization with bandit feedback. Adv Neural Inf Process Syst 24:1–9
3. Akhavan A, Chzhen E, Pontil M, Tsybakov AB (2022) A gradient estimator via l1-randomization for online zero-order optimization with two point feedback. arXiv preprint arXiv:2205.13910
4. Akhavan A, Pontil M, Tsybakov A (2020) Exploiting higher order smoothness in derivative-free optimization and continuous bandits. Adv Neural Inf Process Syst 33:9017–9027
5. Akhavan A, Pontil M, Tsybakov A (2021) Distributed zero-order optimization under adversarial noise. Adv Neural Inf Process Syst 34:10209–10220