Prospective Role of Foundation Models in Advancing Autonomous Vehicles

Author:

Wu Jianhua1,Gao Bingzhao12,Gao Jincheng1,Yu Jianhao1,Chu Hongqing1,Yu Qiankun3,Gong Xun4,Chang Yi4,Tseng H. Eric5,Chen Hong67ORCID,Chen Jie27

Affiliation:

1. School of Automotive Studies, Tongji University, Shanghai 201804, China.

2. Frontiers Science Center for Intelligent Autonomous Systems, Tongji University, Shanghai 201210, China.

3. SAIC Intelligent Technology, Shanghai 201805, China.

4. College of Artificial Intelligence, Jilin University, Changchun 130012, China.

5. Research and Advanced Engineering, Ford Motor Company, Dearborn, MI 48124, USA.

6. College of Electronic and Information Engineering, Tongji University, Shanghai 201804, China.

7. National Key Laboratory of Autonomous Intelligent Unmanned Systems, Shanghai 201210, China.

Abstract

With the development of artificial intelligence and breakthroughs in deep learning, large-scale foundation models (FMs), such as generative pre-trained transformer (GPT), Sora, etc., have achieved remarkable results in many fields including natural language processing and computer vision. The application of FMs in autonomous driving holds considerable promise. For example, they can contribute to enhancing scene understanding and reasoning. By pre-training on rich linguistic and visual data, FMs can understand and interpret various elements in a driving scene, and provide cognitive reasoning to give linguistic and action instructions for driving decisions and planning. Furthermore, FMs can augment data based on the understanding of driving scenarios to provide feasible scenes of those rare occurrences in the long tail distribution that are unlikely to be encountered during routine driving and data collection. The enhancement can subsequently lead to improvement in the accuracy and reliability of autonomous driving systems. Another testament to the potential of FMs’ applications lies in world models, exemplified by the DREAMER series, which showcases the ability to comprehend physical laws and dynamics. Learning from massive data under the paradigm of self-supervised learning, world models can generate unseen yet plausible driving environments, facilitating the enhancement in the prediction of road users’ behaviors and the off-line training of driving strategies. In this paper, we synthesize the applications and future trends of FMs in autonomous driving. By utilizing the powerful capabilities of FMs, we strive to tackle the potential issues stemming from the long-tail distribution in autonomous driving, consequently advancing overall safety in this domain.

Funder

National Natural Science Foundation of China

Publisher

American Association for the Advancement of Science (AAAS)

Reference222 articles.

1. A Survey of Autonomous Driving: Common Practices and Emerging Technologies

2. A survey of deep learning techniques for autonomous driving;Grigorescu S;J Field Robot,2020

3. Chen L Wu P Chitta K Jaeger B Geiger A Li H. End-to-end autonomous driving: Challenges and frontiers. arXiv. 2023. https://doi.org/10.48550/arXiv.2306.16927

4. Recent advancements in end-to-end autonomous driving using deep learning: A survey;Chib PS;IEEE Trans Intell Veh,2023

5. Bommasani R Hudson DA Adeli E Altman R Arora S von Arx S Bernstein MS Bohg J Bosselut A Brunskill E et al. On the opportunities and risks of foundation models. arXiv. 2021. https://doi.org/10.48550/arXiv.2108.07258

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3