POLIDriving: A Public-Access Driving Dataset for Road Traffic Safety Analysis
-
Published:2024-07-19
Issue:14
Volume:14
Page:6300
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Marcillo Pablo1ORCID, Arciniegas-Ayala Cristian1ORCID, Valdivieso Caraguay Ángel Leonardo1ORCID, Sanchez-Gordon Sandra1ORCID, Hernández-Álvarez Myriam1ORCID
Affiliation:
1. Departamento de Informática y Ciencias de la Computación, Escuela Politécnica Nacional, Ladrón de Guevara E11-25 y Andalucía, Edificio de Sistemas, Quito 170525, Ecuador
Abstract
The problems with current driving datasets are their exclusivity to autonomous driving applications and their limited diversity in terms of sources of information and number of attributes. Thus, this paper presents a novel driving dataset that contains information from several heterogeneous sources and targets road traffic safety applications. We used an acquisition module based on software and hardware to collect information from a vehicle scanner and a health monitor. This module also consumes information from a weather web service and databases on traffic accidents and road geometric characteristics. For the acquisition sessions, drivers of different ages and genders drove vehicles on two routes at different day hours in different weather conditions. POLIDriving contains around 18 h of driving data, more than 61k observations, and 32 attributes. Unlike the other related datasets that include information on vehicle and road conditions, POLIDriving also includes information on the driver, weather conditions, traffic accidents, and road geometric characteristics. The dataset was tested in learning models to predict the risk levels of suffering a traffic accident. Hence, we built two learning models: Gradient Boosting Machine (GBM) and Multilayer Perceptron (MLP). GBM reached an accuracy value of 95.6%, and MLP reached an accuracy of 98.6%. Undoubtedly, POLIDriving will contribute greatly to the research on traffic accident prevention by providing a novel, numerous, and diverse driving dataset.
Funder
Escuela Politécnica Nacional
Reference25 articles.
1. World Health Organization (2023). WHO Global Status Report on Road Safety 2023, WHO. 2. Marcillo, P., Valdivieso Caraguay, Á.L., and Hernández-Álvarez, M. (2022). A Systematic Literature Review of Learning-Based Traffic Accident Prediction Models Based on Heterogeneous Sources. Appl. Sci., 12. 3. Geyer, J., Kassahun, Y., Mahmudi, M., Ricou, X., Durgesh, R., Chung, A.S., Hauswald, L., Pham, V.H., Mühlegg, M., and Dorn, S. (2020). A2d2: Audi autonomous driving dataset. arXiv. 4. Vision meets robotics: The kitti dataset;Geiger;Int. J. Robot. Res.,2013 5. Yu, F., Chen, H., Wang, X., Xian, W., Chen, Y., Liu, F., Madhavan, V., and Darrell, T. (2020, January 14–19). Bdd100k: A diverse driving dataset for heterogeneous multitask learning. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.
|
|