Supply-Demand-aware Deep Reinforcement Learning for Dynamic Fleet Management

Author:

Zheng Bolong1ORCID,Ming Lingfeng1,Hu Qi1,Lü Zhipeng1,Liu Guanfeng2,Zhou Xiaofang3

Affiliation:

1. Huazhong University of Science and Technology, Wuhan, China

2. Macquarie University, Sydney, Australia

3. Hong Kong University of Science and Technology, Kowloon, Hong Kong

Abstract

Online ride-hailing platforms have reduced significantly the amounts of the time that taxis are idle and that passengers spend on waiting. As a key component of these platforms, the fleet management problem can be naturally modeled as a Markov Decision Process, which enables us to use the deep reinforcement learning. However, existing studies are proposed based on simplified problem settings that fail to model the complicated supply-dynamics and restrict the performance in the real traffic environment. In this article, we propose a supply-demand-aware deep reinforcement learning algorithm for taxi dispatching, where we use a deep Q-network with action sampling policy, called AS-DQN, to learn an optimal dispatching policy. Furthermore, we utilize a dueling network architecture, called AS-DDQN, to improve the performance of AS-DQN. Extensive experiments on real-world datasets offer insight into the performance of our model and show that it is capable of outperforming the baseline approaches.

Funder

NSFC

Hubei Natural Science Foundation

Fundamental Research Funds for the Central Universities

Publisher

Association for Computing Machinery (ACM)

Subject

Artificial Intelligence,Theoretical Computer Science

Reference46 articles.

1. 2021. Didi. Retrieved from https://www.xiaojukeji.com.

2. 2021. Uber. Retrieved from https://www.uber.com.

3. Lei Bai, Lina Yao, Can Li, Xianzhi Wang, and Can Wang. 2020. Adaptive graph convolutional recurrent network for traffic forecasting. In NeurIPS.

4. Online Vehicle Routing: The Edge of Optimization in Large-Scale Applications

Cited by 10 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. A Clustering-Based Multi-Agent Reinforcement Learning Framework for Finer-Grained Taxi Dispatching;IEEE Transactions on Intelligent Transportation Systems;2024-09

2. DyPS: Dynamic Parameter Sharing in Multi-Agent Reinforcement Learning for Spatio-Temporal Resource Allocation;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24

3. Learned Unmanned Vehicle Scheduling for Large-Scale Urban Logistics;IEEE Transactions on Intelligent Transportation Systems;2024-07

4. Spatio-temporal Idle Routing for Green Mobility;2024 25th IEEE International Conference on Mobile Data Management (MDM);2024-06-24

5. Agent Guidance in Autonomous Mobility on Demand Systems: An Approach Utilizing Priority Double Deep-Q-Networks;2024 IEEE 14th Annual Computing and Communication Workshop and Conference (CCWC);2024-01-08

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3