Modeling Pedestrian Injury Severity: A Case Study of Using Extreme Gradient Boosting Vs Random Forest in Feature Selection

Author:

Wu Zhenxi1,Misra Aditi2ORCID,Bao Shan13ORCID

Affiliation:

1. University of Michigan Transportation Research Institute, Ann Arbor, MI

2. 2Department of Civil Engineering, College of Engineering, Design and Computing, University of Colorado Denver, Denver, CO

3. Department of Industrial and Manufacturing Systems Engineering, University of Michigan-Dearborn, Dearborn, MI

Abstract

Walking and bicycling are lauded for their negative net carbon impact and for their health benefits. However, national crash statistics suggest that pedestrians are disproportionately harmed in any vehicle–pedestrian conflict situation. Although automated transportation in the future is anticipated to increase overall safety, multiple incidents involving automated vehicles have been reported recently, indicating that the technology needs more training on real-world scenarios and conflicts. This research is motivated by the need for contextual data and related levels of harm in potential conflict scenarios in mixed traffic and we use a national police reported crash dataset, CRSS, to address this need. Our study uses a new gradient boosting algorithm, XGBoost, to identify important features among a host of seemingly significant variables. We compare the performance of XGBoost with the more frequently used random forest method and find that XGBoost is more reliable and robust for handling an unbalanced and sparse dataset like crash data, and the features extracted are more aligned to findings from previous research on the topic. We also compare feature importance between NASS-GES and CRSS—two national crash databases with different sampling strategies but the same objective—and find that sampling strategy influences selection of feature importance. We further use the features extracted using XGBoost in a multiclass logistic regression to quantify the effect of these features on different levels of pedestrian injury. Our findings indicate that speed limit, light conditions, pre-crash movements, and location of pedestrian are important contributors to crash severity, along with driver distraction and impairment.

Publisher

SAGE Publications

Subject

Mechanical Engineering,Civil and Structural Engineering

Reference22 articles.

1. National Highway Traffic Safety Administration. 2019 Pedestrians Traffic Safety Fact Sheet. https://crashstats.nhtsa.dot.gov/Api/Public/ViewPublication/813079. Retrieved July 2021.

2. Hendricks D. L., Fell J. C., Freedman M., et al. The Relative Frequency of Unsafe Driving Acts in Serious Traffic Crashes. Summary Technical Report. U.S. Department of Transportation, National Highway Traffic Safety Administration, Traffic Safety Programs, Office of Research and Traffic Records, Washington D.C., 1999. https://one.nhtsa.gov/people/injury/research/udashortrpt/index.html

3. Tesla Says Autopilot Makes Its Cars Safer. Crash Victims Say It Kills. The New York Times, July 5, 2021. https://www.nytimes.com/2021/07/05/business/tesla-autopilot-lawsuits-safety.html

4. A clustering regression approach: A comprehensive injury severity analysis of pedestrian–vehicle crashes in New York, US and Montreal, Canada

5. Investigating the risk factors associated with pedestrian injury severity in Illinois

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3