1. Natural gradient works efficiently in learning;Amari;Neural Comput.,1998
2. Analytic subordination theory of operator-valued free additive convolution and the solution of a general random matrix problem;Belinschi;J. Reine Angew. Math.,2013
3. Probability and Measure;Billingsley,2008
4. Online Learning and Stochastic Approximations;Bottou,1998
5. Pattern Recognition and Machine Learning;Christopher,2016