1. On difference convexity of locally Lipschitz functions;Bačák;Optimization,2011
2. Variance reduction for Markov chains with application to MCMC;Belomestny;Statistics and Computing,2020
3. Berrada, L., Zisserman, A., & Kumar, M. P. (2017). Trusting SVM for Piecewise Linear CNNs. In International conference on learning representations.
4. On explicit L2-convergence rate estimate for underdamped langevin dynamics;Cao,2019
5. Entropy-sgd: Biasing gradient descent into wide valleys;Chaudhari;Journal of Statistical Mechanics: Theory and Experiment,2019