Navigating to objects in the real world

Author:

Gervet Theophile1ORCID,Chintala Soumith2,Batra Dhruv23ORCID,Malik Jitendra24ORCID,Chaplot Devendra Singh2

Affiliation:

1. Carnegie Mellon University, Pittsburgh, PA, USA.

2. Meta AI Research, Menlo Park, CA, USA.

3. Georgia Institute of Technology, Atlanta, GA, USA.

4. University of California, Berkeley, CA, USA.

Abstract

Semantic navigation is necessary to deploy mobile robots in uncontrolled environments such as homes or hospitals. Many learning-based approaches have been proposed in response to the lack of semantic understanding of the classical pipeline for spatial navigation, which builds a geometric map using depth sensors and plans to reach point goals. Broadly, end-to-end learning approaches reactively map sensor inputs to actions with deep neural networks, whereas modular learning approaches enrich the classical pipeline with learning-based semantic sensing and exploration. However, learned visual navigation policies have predominantly been evaluated in sim, with little known about what works on a robot. We present a large-scale empirical study of semantic visual navigation methods comparing representative methods with classical, modular, and end-to-end learning approaches across six homes with no prior experience, maps, or instrumentation. We found that modular learning works well in the real world, attaining a 90% success rate. In contrast, end-to-end learning does not, dropping from 77% sim to a 23% real-world success rate because of a large image domain gap between sim and reality. For practitioners, we show that modular learning is a reliable approach to navigate to objects: Modularity and abstraction in policy design enable sim-to-real transfer. For researchers, we identify two key issues that prevent today’s simulators from being reliable evaluation benchmarks—a large sim-to-real gap in images and a disconnect between sim and real-world error modes—and propose concrete steps forward.

Publisher

American Association for the Advancement of Science (AAAS)

Subject

Artificial Intelligence,Control and Optimization,Computer Science Applications,Mechanical Engineering

Reference117 articles.

1. P. Anderson A. Chang D. S. Chaplot A. Dosovitskiy S. Gupta V. Koltun J. Kosecka J. Malik R. Mottaghi M. Savva A. R. Zamir On evaluation of embodied navigation agents. arXiv:1807.06757 [cs.AI] (18 July 2018).

2. H. P. Moravec Obstacle avoidance and navigation in the real world by a seeing robot rover thesis Stanford University Palo Alto CA (1980).

3. R. Chatila J.-P. Laumond “Position referencing and consistent world modeling for mobile robots ” in Proceedings of the 1985 IEEE International Conference on Robotics and Automation St. Louis MO 25 to 28 March 1985 (IEEE 1985); vol. 2 pp. 138–145.

4. Sonar-based real-world mapping and navigation

5. Using occupancy grids for mobile robot perception and navigation

Cited by 28 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3