1. Optimal whitening and decorrelation;A Kessy;The American Statistician,2018
2. Efficient backprop;Y Lecun;Neural networks: Tricks of the trade,2002
3. Batch normalization: Accelerating deep network training by reducing internal covariate shift;S Ioffe;International conference on machine learning,2015