1. Solving Rubik's Cube with a Robot Hand;Akkaya,2019
2. Layer Normalization;Ba,2016
3. Neuronlike adaptive elements that can solve difficult learning control problems;Barto;IEEE transactions on systems, man, and cybernetics,1983
4. Amrl: aggregated memory for reinforcement learning;Beck,2019
5. Acting optimally in partially observable stochastic domains;Cassandra;Aaai,1994