1. Ilge Akkaya Marcin Andrychowicz Maciek Chociej Mateusz Litwin Bob Mc-Grew Arthur Petron Alex Paino Matthias Plappert Glenn Powell Raphael Ribas etal 2019. Solving rubik's cube with a robot hand. arXiv preprint arXiv:1910.07113 (2019). Ilge Akkaya Marcin Andrychowicz Maciek Chociej Mateusz Litwin Bob Mc-Grew Arthur Petron Alex Paino Matthias Plappert Glenn Powell Raphael Ribas et al. 2019. Solving rubik's cube with a robot hand. arXiv preprint arXiv:1910.07113 (2019).
2. James Bradbury , Roy Frostig , Peter Hawkins , Matthew James Johnson , Chris Leary, Dougal Maclaurin, George Necula, Adam Paszke, Jake VanderPlas, Skye Wanderman-Milne, and Qiao Zhang. 2018 . JAX: composable transformations of Python +NumPy programs. http://github.com/google/jax James Bradbury, Roy Frostig, Peter Hawkins, Matthew James Johnson, Chris Leary, Dougal Maclaurin, George Necula, Adam Paszke, Jake VanderPlas, Skye Wanderman-Milne, and Qiao Zhang. 2018. JAX: composable transformations of Python+NumPy programs. http://github.com/google/jax
3. Felix Chalumeau , Raphael Boige , Bryan Lim , Valentin Macé , Maxime Allard , Arthur Flajolet , Antoine Cully , and Thomas Pierrot . 2022. Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery. arXiv preprint arXiv:2210.03516 ( 2022 ). Felix Chalumeau, Raphael Boige, Bryan Lim, Valentin Macé, Maxime Allard, Arthur Flajolet, Antoine Cully, and Thomas Pierrot. 2022. Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery. arXiv preprint arXiv:2210.03516 (2022).
4. Konstantinos Chatzilygeroudis , Antoine Cully , Vassilis Vassiliades , and Jean-Baptiste Mouret . 2021. Quality-Diversity Optimization: a novel branch of stochastic optimization . In Black Box Optimization, Machine Learning, and No-Free Lunch Theorems . Springer , 109--135. Konstantinos Chatzilygeroudis, Antoine Cully, Vassilis Vassiliades, and Jean-Baptiste Mouret. 2021. Quality-Diversity Optimization: a novel branch of stochastic optimization. In Black Box Optimization, Machine Learning, and No-Free Lunch Theorems. Springer, 109--135.
5. Xinyue Chen , Che Wang , Zijian Zhou , and Keith Ross . 2021. Randomized ensembled double q-learning: Learning fast without a model. arXiv preprint arXiv:2101.05982 ( 2021 ). Xinyue Chen, Che Wang, Zijian Zhou, and Keith Ross. 2021. Randomized ensembled double q-learning: Learning fast without a model. arXiv preprint arXiv:2101.05982 (2021).