Impact of Data Loss on Multi-Step Forecast of Traffic Flow in Urban Roads Using K-Nearest Neighbors-Reference-Cited by-同舟云学术

Impact of Data Loss on Multi-Step Forecast of Traffic Flow in Urban Roads Using K-Nearest Neighbors

Published:2022-09-07 Issue:18 Volume:14 Page:11232
ISSN:2071-1050
Container-title:Sustainability
language:en
Short-container-title:Sustainability

Author:

Mallek Amin^ORCID,Klosa Daniel,Büskens Christof

Abstract

Data-driven models have recently proved to be a very powerful tool to extract relevant information from different kinds of datasets. However, datasets are often subject to multiple anomalies, including the loss of important parts of entries. In the context of intelligent transportation, we examine in this paper the impact of data loss on the behavior of one of the frequently used approaches to address this kind of problems in the literature, namely, the k-nearest neighbors model. The method designed herein is set to perform multi-step traffic flow forecasts in urban roads. In our study, we deploy non-prepossessed real data recorded by seven inductive loop detectors and delivered by the Traffic Management Center (VMZ) of Bremen (Germany). Firstly, we measure the performance of the model on a complete dataset of 11 weeks. The same dataset is then used to artificially create 50 incomplete datasets with different gap sizes and completeness levels. Afterwards, in order to reconstruct these datasets, we propose three computationally-low techniques, which proved through empirical testing to be efficient in reproducing missing entries. Thereafter, the performance of the E-KNN model is assessed under the original dataset, incomplete and filled-in datasets. Although the accuracy of E-KNN under incomplete and reconstructed datasets depends on gap lengths and completeness levels, under original dataset, the model proves to deliver six-step forecasts with an accuracy of 83% on average over 3 weeks of the test set, which also translates to a less than one car per minute error.

Publisher

MDPI AG

Subject

Management, Monitoring, Policy and Law,Renewable Energy, Sustainability and the Environment,Geography, Planning and Development,Building and Construction

Link

https://www.mdpi.com/2071-1050/14/18/11232/pdf

Reference33 articles.

1. Short-term traffic flow prediction using seasonal ARIMA model with limited input data

2. STARIMA-based traffic prediction with time-varying lags;Duan;Proceedings of the 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC),2016

3. Combining kohonen maps with arima time series models to forecast traffic flow

4. A combined method for short-term traffic flow prediction based on recurrent neural network

5. Short-Term Traffic Flow Prediction Using the Modified Elman Recurrent Neural Network Optimized Through a Genetic Algorithm

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A decentralized trust inference approach with intelligence to improve data collection quality for mobile crowd sensing;Information Sciences;2023-10

2. High-Level K-Nearest Neighbors (HLKNN): A Supervised Machine Learning Model for Classification Analysis;Electronics;2023-09-10

3. Low Cost Evolutionary Neural Architecture Search (LENAS) Applied to Traffic Forecasting;Machine Learning and Knowledge Extraction;2023-07-28

4. Data-Driven Analysis of Fatal Urban Traffic Accident Characteristics and Safety Enhancement Research;Sustainability;2023-02-10