1. Majorization, doubly stochastic matrices, and comparison of eigenvalues
2. Sanjeev Arora , Noah Golowich , Nadav Cohen , and Wei Hu . 2019 . A convergence analysis of gradient descent for deep linear neural networks . In 7th International Conference on Learning Representations, ICLR 2019. Sanjeev Arora, Noah Golowich, Nadav Cohen, and Wei Hu. 2019. A convergence analysis of gradient descent for deep linear neural networks. In 7th International Conference on Learning Representations, ICLR 2019.
3. Andreas Arvanitogeorgos . 2003. ¯ An introduction to Lie groups and the geometry of homogeneous spaces . Vol. 22 . American Mathematical Soc . Andreas Arvanitogeorgos. 2003. ¯ An introduction to Lie groups and the geometry of homogeneous spaces. Vol. 22. American Mathematical Soc.
4. Jimmy Ba and Brendan Frey. 2013. Adaptive Dropout for training deep neural networks. In Advances in Neural Information Processing Systems. 3084--3092. Jimmy Ba and Brendan Frey. 2013. Adaptive Dropout for training deep neural networks. In Advances in Neural Information Processing Systems. 3084--3092.
5. Learning deep linear neural networks: Riemannian gradient flows and convergence to global minimizers