Abstract
A variety of Chinese textual operational text data has been recorded during the operation and maintenance of the high-speed railway catenary system. Such defect text records can facilitate defect detection and defect severity analysis if mined efficiently and accurately. Therefore, in this context, this paper focuses on a specific problem in defect text mining, which is to efficiently extract defect-relevant information from catenary defect text records and automatically identify catenary defect severity. The specific task is transformed into a machine learning problem for defect text classification. First, we summarize the characteristics of catenary defect texts and construct a text dataset. Second, we use BERT to learn defect texts and generate word embedding vectors with contextual features, fed into the classification model. Third, we developed a deep text categorization network (DTCN) to distinguish the catenary defect level, considering the contextualized semantic features. Finally, the effectiveness of our proposed method (BERT-DTCN) is validated using a catenary defect textual dataset collected from 2016 to 2018 in the China Railway Administration in Chengdu, Lanzhou, and Hengshui. Moreover, BERT-DTCN outperforms several competitive methods in terms of accuracy, precision, recall, and F1-score value.
Funder
Natural Science Foundation of Sichuan Province
National Natural Science Foundation of China
National Key R&D Program of China
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference40 articles.
1. Predicting Wind-Caused Floater Intrusion Risk for Overhead Contact Lines Based on Bayesian Neural Network with Spatiotemporal Correlation Analysis;Wang;Reliab. Eng. Syst. Saf.,2022
2. Data-Driven Lightning-Related Failure Risk Prediction of Overhead Contact Lines Based on Bayesian Network with Spatiotemporal Fragility Model;Wang;Reliab. Eng. Syst. Saf.,2023
3. Gao, S., Wang, J., Yu, L., Zhang, D., Zhan, R., Kou, L., and Chen, K. (2022). A Multilayer Bayesian Network Approach-Based Predictive Probabilistic Risk Assessment for Overhead Contact Lines Under External Weather Conditions. IEEE Trans. Transp. Electrif., 1–18.
4. Automatic Detection and Monitoring System of Pantograph-Catenary in China’s High-Speed Railways;Gao;IEEE Trans. Instrum. Meas.,2021
5. Short Text Mining Framework with Specific Design for Operation and Maintenance of Power Equipment;Wang;CSEE J. Power Energy Syst.,2021
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Weather-Related Failure Risk Prediction of Overhead Contact Lines Based on Deep Gaussian Process;Proceedings of the 2023 7th International Conference on High Performance Compilation, Computing and Communications;2023-06-17