Research on Text Fault Recognition for On-board Equipment of C3 Train Control System Based on Integrated XGBoost Algorithm
Author:
Yue Lili1,
Liu Luyue1,
Li Maoqing1,
Xiao Baodi12,
Wu Xiaochun1
Affiliation:
1. School of Automation and Electrical Engineering, Lanzhou Jiaotong University , Lanzhou 730070 , China
2. Beijing Consen Traffic Technology Co., Ltd. , Beijing 101318 , China
Abstract
Abstract
The robust guarantee of train control on-board equipment is inextricably linked to the safe functioning of a high-speed train. A fault diagnostic model of on-board equipment is built utilizing the integrated learning XGBoost (eXtreme Gradient Boosting) algorithm to help technicians assess the malfunction category of high-speed train control on-board equipment accurately and rapidly. XGBoost algorithm iterates multiple decision tree models to improve the accuracy of fault diagnosis by lifting the predicted residual and adding regular terms. To begin, the text features were extracted using the improved TF-IDF (Term Frequency–Inverse Document Frequency) approach, and 24 fault feature words were chosen and converted into weight word vectors. Secondly, considering the imbalanced fault categories in the data set, ADASYN (Adaptive Synthetic sampling) adaptive synthetically oversampling technique was used to synthesize a few category fault samples. Finally, the data samples were split into training and test sets based on the fault text data of CTCS-3 train control on-board equipment recorded by Guangzhou Railway Group maintenance personnel. The XGBoost model was utilized to realize the automatic fault location of the test set after optimized parameter tuning through grid search. Compared with other methods, the evaluation index of the XGBoost model was significantly improved. The diagnostic accuracy reached 95.43%, which verifies the effectiveness of the method in text fault diagnosis.
Publisher
Oxford University Press (OUP)
Subject
Engineering (miscellaneous),Safety, Risk, Reliability and Quality,Control and Systems Engineering