Symmetry-aware Neural Architecture for Embodied Visual Navigation-Reference-Cited by-同舟云学术

Symmetry-aware Neural Architecture for Embodied Visual Navigation

Published:2023-10-25 Issue: Volume: Page:
ISSN:0920-5691
Container-title:International Journal of Computer Vision
language:en
Short-container-title:Int J Comput Vis

Author:

Liu Shuang^ORCID,Suganuma Masanori,Okatani Takayuki

Abstract

AbstractThe existing methods for addressing visual navigation employ deep reinforcement learning as the standard tool for the task. However, they tend to be vulnerable to statistical shifts between the training and test data, resulting in poor generalization over novel environments that are out-of-distribution from the training data. In this study, we attempt to improve the generalization ability by utilizing the inductive biases available for the task. Employing the active neural SLAM that learns policies with the advantage actor-critic method as the base framework, we first point out that the mappings represented by the actor and the critic should satisfy specific symmetries. We then propose a network design for the actor and the critic to inherently attain these symmetries. Specifically, we use G-convolution instead of the standard convolution and insert the semi-global polar pooling layer, which we newly design in this study, in the last section of the critic network. Our method can be integrated into existing methods that utilize intermediate goals and 2D occupancy maps. Experimental results show that our method improves generalization ability by a good margin over visual exploration and object goal navigation, which are two main embodied visual navigation tasks.

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Software

Link

https://link.springer.com/content/pdf/10.1007/s11263-023-01909-4.pdf

Reference69 articles.

1. Anderson, P., Chang, A., Chaplot, D. S., et al. (2018). On evaluation of embodied navigation agents. arXiv preprint arXiv:1807.06757

2. Beeching, E., Dibangoye, J., Simonin, O., et al. (2020). Egomap: Projective mapping and structured egocentric memory for deep rl. In: Joint European conference on machine learning and knowledge discovery in databases, Springer, pp 525–540

3. Bonin-Font, F., Ortiz, A., & Oliver, G. (2008). Visual navigation for mobile robots: A survey. Journal of intelligent and robotic systems, 53(3), 263–296.

4. Cadena, C., Carlone, L., Carrillo, H., et al. (2016). Past, present, and future of simultaneous localization and mapping: Toward the robust-perception age. IEEE Transactions on robotics, 32(6), 1309–1332.

5. Calimeri, F., Marzullo, A., Stamile, C., et al. (2017). Biomedical data augmentation using generative adversarial neural networks. In: International conference on artificial neural networks, Springer, pp 626–634