Abstract
AbstractLearning sensorimotor trajectories through flexible neural representations is fundamental for robots as it facilitates the building of motor skills as well as equipping them with the ability to represent the world as predictable temporal events. Recent advances in deep learning led to the development of powerful learning from demonstration (LfD) systems such as Conditional Neural Movement Primitives (CNMPs). CNMPs can robustly represent skills as movement distributions and allow them to be ‘recalled’ by conditioning the movement on a few observation points. In this study, we focus on improving CNMPs to achieve a higher resource economy by adopting a divide-and-conquer approach. We propose a novel neural architecture called Coupled CNMP (C-CNMP), that couples the latent spaces of a pair of CNMPs that splits a given sensorimotor trajectory into segments whose learning is undertaken by smaller sub-networks. Therefore, each sub-network needs to deal with a less complex trajectory making the learning less resource-hungry. With systematic simulations on a controlled trajectory data set, we show that the overhead brought by the coupling introduced in our model is well offset by the resource and performance gain obtained. To be specific, with CNMP model as the baseline, it is shown that the proposed model is able to learn to generate trajectories in the data set with a lower trajectory error measured as the mean absolute difference between the generated trajectory and the ground truth. Importantly, our model can perform well with relatively limited resources, i.e., with less number of neural network parameters compared to the baseline. To show that the findings from the controlled data set well-transfer to robot data, we use robot joint data in an LfD setting and compare the learning performance of the proposed model with the baseline model at equal complexity levels. The simulation experiments show that with also the robot joint data, the proposed model, C-CNMP, learns to generate the joint trajectories with significantly less error than the baseline model. Overall, our study improves the state of the art in sensorimotor trajectory learning and exemplifies how divide-and-conquer approaches can benefit deep learning architectures for resource economy.
Funder
Japan Society for the Promotion of Science
New Energy and Industrial Technology Development Organization
Özyeğin University
Publisher
Springer Science and Business Media LLC