1. SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
2. Adafactor: Adaptive learning rates with sublinear memory cost;shazeer;ICML,0
3. Batch normalization: Accelerating deep network training by reducing internal covariate shift;ioffe;ICML,0
4. Searching for activation functions;ramachandran;ArXiv Preprint,2017
5. Long Short-Term Memory