1. Adam: A method for stochastic optimization;kingma;ArXiv Preprint,2014
2. DeepAR: Probabilistic forecasting with autoregressive recurrent networks
3. Weight normalization: A simple reparameterization to accelerate training of deep neural networks;salimans;ArXiv Preprint,2016
4. Con-volutional sequence to sequence learning;gehring;International Conference on Machine Learning,0
5. An empirical evaluation of generic convolutional and recurrent networks for sequence modeling;bai;ArXiv Preprint,2018