1. Patient subtyping via time-aware lstm networks;I M Baytas;Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining,2017
2. Learning long-term dependencies with gradient descent is difficult;Y Bengio;IEEE transactions on neural networks,1994
3. A theoretical analysis of feature pooling in visual recognition;Y L Boureau;Proceedings of the 27th international conference on machine learning (ICML-10),2010
4. Recurrent neural networks for multivariate time series with missing values;Z Che;Scientific reports,2018