Author:
Parker Jones Oiwi,Mitchell Alexander L.,Yamada Jun,Merkt Wolfgang,Geisert Mathieu,Havoutis Ioannis,Posner Ingmar
Abstract
AbstractSensorimotor control of complex, dynamic systems such as humanoids or quadrupedal robots is notoriously difficult. While artificial systems traditionally employ hierarchical optimisation approaches or black-box policies, recent results in systems neuroscience suggest that complex behaviours such as locomotion and reaching are correlated with limit cycles in the primate motor cortex. A recent result suggests that, when applied to a learned latent space, oscillating patterns of activation can be used to control locomotion in a physical robot. While reminiscent of limit cycles observed in primate motor cortex, these dynamics are unsurprising given the cyclic nature of the robot’s behaviour (walking). In this preliminary investigation, we consider how a similar approach extends to a less obviously cyclic behaviour (reaching). This has been explored in prior work using computational simulations. But simulations necessarily make simplifying assumptions that do not necessarily correspond to reality, so do not trivially transfer to real robot platforms. Our primary contribution is to demonstrate that we can infer and control real robot states in a learnt representation using oscillatory dynamics during reaching tasks. We further show that the learned latent representation encodes interpretable movements in the robot’s workspace. Compared to robot locomotion, the dynamics that we observe for reaching are not fully cyclic, as they do not begin and end at the same position of latent space. However, they do begin to trace out the shape of a cycle, and, by construction, they are driven by the same underlying oscillatory mechanics.
Funder
UKRI/EPSRC Programme Grant
EPSRC CDT
EPSRC
UKRI/EPSRC RAIN
ORCA
Hubs and the EU H2020
Publisher
Springer Science and Business Media LLC
Reference26 articles.
1. Mitchell, A. L. et al. Next steps: Learning a disentangled gait representation for versatile quadruped locomotion. In IEEE International Conference on Robotics and Automation (ICRA) (2022).
2. Churchland, M. M. et al. Neural population dynamics during reaching. Nature 487, 51–56 (2012).
3. Mitchell, A. L. et al. First steps: Latent-space control with semantic constraints for quadruped locomotion. In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (2020), pp. 5343–5350.
4. Kingma, D. & Welling, M. Auto-encoding variational bayes. In International Conference on Learning Representations (ICLR) (2014).
5. Rezende, D. J., Mohamed, S. & Wierstra, D. Stochastic backpropagation and approximate inference in deep generative models. In International Conference on Machine Learning (ICML) (2014).