1. OnRL
2. Self-clocked rate adaptation for multime-dia;johansson;Technical Report Internet Engineering Task Force (IETF),2017
3. Learning to navigate in complex environments;mirowski;ArXiv Preprint,2016
4. Deep recurrent q-learning for partially observable mdps;hausknecht;ArXiv Preprint,2015