1. Dynamic weights in multi-objective deep reinforcement learning;Abels,2019
2. Reinforcement learning-based bus holding for high-frequency services;Alesiani,2018
3. Dynamic control of complex transit systems;Argote-Cabanero;Transp. Res. B,2015
4. A Markovian decision process;Bellman;J. Math. Mech.,1957
5. Training stochastic model recognition algorithms as networks can lead to maximum mutual information estimation of parameters;Bridle;Adv. Neural Inf. Process. Syst.,1989