1. Optimistic posterior sampling for reinforcement learning: worst‐case regret bounds;Agrawal S.;Advances in Neural Information Processing Systems,2017
2. Tensor decompositions for learning latent variable models;Anandkumar A.;Journal of Machine Learning Research,2014
3. Anandkumar A. Hsu D. &Kakade S. M.(2012).A method of moments for mixture models and hidden Markov models. In Mannor Shie Srebro Nathan & Williamson Robert C. (Eds.) Conference on learning theory(Vol. 23 pp.33.1–33.34) PMLR.
4. The Nonstochastic Multiarmed Bandit Problem