1. Learning long-term dependencies with gradient descent is difficult;IEEE Transactions on Neural Networks,2002
2. On the properties of neural machine translation: encoder-decoder approaches;arXiv preprint arXiv:1409.1259,2014
3. Empirical evaluation of gated recurrent neural networks on sequence modeling;arXiv preprint arXiv:1412.3555,2014
4. Arima models to predict next-day electricity prices;IEEE Transactions on Power Systems,2003
5. Learning to forget: continual prediction with LSTM;Neural Computation,2000