1. Throughput-optimal wireless scheduling with regulated inter-service times
2. Learning combinatorial optimization algorithms over graphs;dai;Proc 31st Int Conf Neural Inf Process Syst,2017
3. Combinatorial optimization with graph convolutional networks and guided tree search;li;Proc 32nd Int Conf Neural Inf Process Syst,2018
4. Maximum a posteriori policy optimisation;abdolmaleki;Proc Int Conf Learn Represent,2018
5. Addressing function approximation error in actor-critic methods;fujimoto;Proc 35th Int Conf Mach Learn (ICML),2018