An Expressway ETC Missing Data Restoration Model Considering Multi-Attribute Features
Author:
Zou Fumin1, Zhou Zhaoyi1, Cai Qiqin12ORCID, Guo Feng13, Zhang Xinyi1
Affiliation:
1. Fujian Key Laboratory for Automotive Electronics and Electric Drive, Fujian University of Technology, Fuzhou 350118, China 2. School of Mechanical Engineering and Automation, Huaqiao University, Xiamen 361021, China 3. College of Computer and Data Science, Fuzhou University, Fuzhou 350108, China
Abstract
Electronic toll collection (ETC) data mining has become one of the hotspots in the research of intelligent expressway extension applications. Ensuring the integrity of ETC data stands as a critical measure in upholding data quality. ETC data are typical structured data, and although deep learning holds great potential in the ETC data restoration field, its applications in structured data are still in the early stages. To address these issues, we propose an expressway ETC missing transaction data restoration model considering multi-attribute features (MAF). Initially, we employ an entity embedding neural network (EENN) to automatically learn the representation of categorical features in multi-dimensional space, subsequently obtaining embedding vectors from networks that have been adequately trained. Then, we use long short-term memory (LSTM) neural networks to extract the changing patterns of vehicle speeds across several continuous sections. Ultimately, we merge the processed features with other features as input, using a three-layer multilayer perceptron (MLP) to complete the ETC data restoration. To validate the effectiveness of the proposed method, we conducted extensive tests using real ETC datasets and compared it with methods commonly used for structured data restoration. The experimental results demonstrate that the proposed method significantly outperforms others in restoration accuracy on two different datasets. Specifically, our sample data size reached around 400,000 entries. Compared to the currently best method, our method improved the restoration accuracy by 19.06% on non-holiday ETC datasets. The MAE and RMSE values reached optimal levels of 12.394 and 23.815, respectively. The fitting degree of the model to the dataset also reached its peak (R2 = 0.993). Meanwhile, the restoration stability of our method on holiday datasets increased by 5.82%. An ablation experiment showed that the EENN and LSTM modules contributed 7.60% and 9% to the restoration accuracy, as well as 4.68% and 7.29% to the restoration stability. This study indicates that the proposed method not only significantly improves the quality of ETC data but also meets the timeliness requirements of big data mining analysis.
Funder
Renewable Energy Technology Research institution of Fujian University of Technology Ningde, China 2020 Fujian Province “Belt and Road” Technology Innovation Platform Provincial Candidates for the Hundred, Thousand and Ten Thousand Talent of Fujian Patent Grant project Horizontal projects Municipal level science and technology projects Fujian Provincial Department of Science and Technology Foreign Cooperation Project Open Fund project
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference45 articles.
1. Guo, F., Zou, F., Luo, S., Liao, L., Wu, J., Yu, X., and Zhang, C. (2022). The fast detection of abnormal ETC data based on an improved DTW algorithm. Electronics, 11. 2. A method for estimating traffic operation parameters based on toll data;Lai;Chin. J. Highw.,2022 3. Research on speed distribution and vehicle classification of mountain highways based on electronic non-stop toll data;Xu;Transp. Syst. Eng. Inf.,2022 4. Traffic flow estimation based on gantry ticket data considering multitype vehicle impact;Yao;J. Transp. Eng. Part A Syst.,2021 5. Visualizing potential transportation demand from ETC log analysis using ELK stack;Tsung;IEEE Internet Things J.,2020
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|