Prediction of high-speed train delay propagation based on causal text information-Reference-Cited by-同舟云学术

Prediction of high-speed train delay propagation based on causal text information

Published:2022-09-12 Issue:1 Volume:31 Page:89-106
ISSN:2662-4745
Container-title:Railway Engineering Science
language:en
Short-container-title:Rail. Eng. Science

Author:

Liu Qianyi,Wang Shengjie,Li Zhongcan^ORCID,Li Li,Zhang Jun,Wen Chao

Abstract

AbstractThe delay-causing text data contain valuable information such as the specific reasons for the delay, location and time of the disturbance, which can provide an efficient support for the prediction of train delays and improve the guidance of train control efficiency. Based on the train operation data and delay-causing data of the Wuhan–Guangzhou high-speed railway, the relevant algorithms in the natural language processing field are used to process the delay-causing text data. It also integrates the train operating-environment information and delay-causing text information so as to develop a cause-based train delay propagation prediction model. The Word2vec model is first used to vectorize the delay-causing text description after word segmentation. The mean model or the term frequency-inverse document frequency-weighted model is then used to generate the delay-causing sentence vector based on the original word vector. Afterward, the train operating-environment features and delay-causing sentence vector are input into the extreme gradient boosting (XGBoost) regression algorithm to develop a delay propagation prediction model. In this work, 4 text feature processing methods and 8 regression algorithms are considered. The results demonstrate that the XGBoost regression algorithm has the highest prediction accuracy using the test features processed by the continuous bag of words and the mean models. Compared with the prediction model that only considers the train-operating-environment features, the results show that the prediction accuracy of the model is significantly improved with multiple regression algorithms after integrating the delay-causing feature.

Funder

National Natural Science Foundation of China

Research and development project of China National Railway Group Co., Ltd

China Railway Chengdu Group Co. Ltd

Publisher

Springer Science and Business Media LLC

Subject

Electrical and Electronic Engineering,Computer Science Applications,Mechanical Engineering,Transportation,Computational Mechanics

Link

https://link.springer.com/content/pdf/10.1007/s40534-022-00286-x.pdf

Reference23 articles.

1. Wen C, Li Z, Huang P et al (2020) Cause-specific investigation of primary delays of wuhan-guangzhou HSR. Trans Lett 12(7):1–14

2. Wen C, Li Z, Lessan J et al (2017) Statistical investigation on train primary delay based on real records: evidence from wuhan-guangzhou HSR. Int J Rail Transp 5(3):170–189

3. Ye Y, Zhu B, Huang P et al (2022) OORNet: a deep learning model for on-board condition monitoring and fault diagnosis of out-of-round wheels of high-speed trains. Measurement 199. https://doi.org/10.1016/j.measurement.2022.111268

4. Kecman P, Goverde RMP (2015) Online data-driven adaptive prediction of train event times. IEEE Trans Intell Transp Syst 16(1):465–474

5. Kecman P, Corman F, Meng L (2015) Train delay evolution as a stochastic process. In: the 6th International Conference on Railway Operations Modelling and Analysis, Tokyo, pp 007–1–19

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Railway network delay evolution: A heterogeneous graph neural network approach;Applied Soft Computing;2024-07

2. Impact of feature cross in hybrid optimization based convolutional neural networks for train delay prediction;Multimedia Tools and Applications;2024-04-09

3. Data‐driven train delay prediction incorporating dispatching commands: An XGBoost‐metaheuristic framework;IET Intelligent Transport Systems;2023-12

4. Dynamic train dwell time forecasting: a hybrid approach to address the influence of passenger flow fluctuations;Railway Engineering Science;2023-06-07

5. A review of data-driven approaches to predict train delays;Transportation Research Part C: Emerging Technologies;2023-03