Affiliation:
1. Egnatia Odos S.A., Public Company of Maintenance & Construction, Thessaloniki, Greece
2. Department of Civil and Environmental Engineering, University of Massachusetts, Amherst, MA, USA
Abstract
Road crashes are a prevalent public health issue across the globe. The objective of this research was to develop a methodology for accurately classifying high-risk crash locations. The hypothesis of this study was that readily obtained roadway indicators can be used along with machine learning techniques to categorize locations as high crash-risk. A database containing 5,383 locations was created during 2012 to 2015 as part of the Hellenic National Road Safety Project and used to develop three binary machine learning models to classify high crash-risk locations based on roadway indicators. The three models were random forest, gradient boosting, and extra trees. This research used features engineering to reduce the number of indicators in the model, and the synthetic minority oversampling technique to address imbalances in the dataset between the minority (high crash-risk locations identified using crash reports) and majority classes (medium to low crash-risk locations identified based on local police testimonies, site inspections, and geometry analysis). Although all three models performed similarly, the extra trees model outperformed the other two on a range of performance metrics, including the area under the precision–recall curve and the F1-score. The findings revealed that design speeds, pavement markings, signage presence, and pavement condition were the most influential factors affecting roadway safety. The contribution of this research is in the development of a transferable methodology for classifying high crash-risk locations in addition to revealing key indicators for crash-risk potential, which in turn can inform cost-effective data collection and maintenance activities.
Reference50 articles.
1. CDC. Road Traffic Injuries and Deaths—a Global Problem. Centers for Disease Control and Prevention, 2023. https://www.cdc.gov/injury/features/global-road-safety/index.html. Accessed December 14, 2020.
2. WHO. Road Traffic Injuries [Fact Sheet]. World Health Organization, 2021. https://www.who.int/news-room/fact-sheets/detail/road-traffic-injuries
3. A data mining approach to characterize road accident locations
4. Identifying significant predictors of injury severity in traffic accidents using a series of artificial neural networks
5. A Data Mining Approach to Identify Key Factors of Traffic Injury Severity