Supply-Demand-aware Deep Reinforcement Learning for Dynamic Fleet Management-Reference-Cited by-同舟云学术

Supply-Demand-aware Deep Reinforcement Learning for Dynamic Fleet Management

Published:2022-01-18 Issue:3 Volume:13 Page:1-19
ISSN:2157-6904
Container-title:ACM Transactions on Intelligent Systems and Technology
language:en
Short-container-title:ACM Trans. Intell. Syst. Technol.

Author:

Zheng Bolong¹^ORCID,Ming Lingfeng¹,Hu Qi¹,Lü Zhipeng¹,Liu Guanfeng²,Zhou Xiaofang³

Affiliation:

1. Huazhong University of Science and Technology, Wuhan, China

2. Macquarie University, Sydney, Australia

3. Hong Kong University of Science and Technology, Kowloon, Hong Kong

Abstract

Online ride-hailing platforms have reduced significantly the amounts of the time that taxis are idle and that passengers spend on waiting. As a key component of these platforms, the fleet management problem can be naturally modeled as a Markov Decision Process, which enables us to use the deep reinforcement learning. However, existing studies are proposed based on simplified problem settings that fail to model the complicated supply-dynamics and restrict the performance in the real traffic environment. In this article, we propose a supply-demand-aware deep reinforcement learning algorithm for taxi dispatching, where we use a deep Q-network with action sampling policy, called AS-DQN, to learn an optimal dispatching policy. Furthermore, we utilize a dueling network architecture, called AS-DDQN, to improve the performance of AS-DQN. Extensive experiments on real-world datasets offer insight into the performance of our model and show that it is capable of outperforming the baseline approaches.

Funder

NSFC

Hubei Natural Science Foundation

Fundamental Research Funds for the Central Universities

Publisher

Association for Computing Machinery (ACM)

Subject

Artificial Intelligence,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3467979

Reference46 articles.

1. 2021. Didi. Retrieved from https://www.xiaojukeji.com.

2. 2021. Uber. Retrieved from https://www.uber.com.

3. Lei Bai, Lina Yao, Can Li, Xianzhi Wang, and Can Wang. 2020. Adaptive graph convolutional recurrent network for traffic forecasting. In NeurIPS.

4. Online Vehicle Routing: The Edge of Optimization in Large-Scale Applications

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Clustering-Based Multi-Agent Reinforcement Learning Framework for Finer-Grained Taxi Dispatching;IEEE Transactions on Intelligent Transportation Systems;2024-09

2. DyPS: Dynamic Parameter Sharing in Multi-Agent Reinforcement Learning for Spatio-Temporal Resource Allocation;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24

3. Learned Unmanned Vehicle Scheduling for Large-Scale Urban Logistics;IEEE Transactions on Intelligent Transportation Systems;2024-07

4. Spatio-temporal Idle Routing for Green Mobility;2024 25th IEEE International Conference on Mobile Data Management (MDM);2024-06-24

5. Agent Guidance in Autonomous Mobility on Demand Systems: An Approach Utilizing Priority Double Deep-Q-Networks;2024 IEEE 14th Annual Computing and Communication Workshop and Conference (CCWC);2024-01-08