Ensemble Learning with Pre-Trained Transformers for Crash Severity Classification: A Deep NLP Approach-Reference-Cited by-同舟云学术

Ensemble Learning with Pre-Trained Transformers for Crash Severity Classification: A Deep NLP Approach

Published:2024-06-30 Issue:7 Volume:17 Page:284
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Jaradat Shadi¹²,Nayak Richi²³^ORCID,Paz Alexander⁴^ORCID,Elhenawy Mohammed¹^ORCID

Affiliation:

1. Centre for Accident Research & Road Safety, Queensland University of Technology, Brisbane, QLD 4000, Australia

2. Centre of Data Science, Queensland University of Technology, Brisbane, QLD 4000, Australia

3. School of Computer Science, Queensland University of Technology, Brisbane, QLD 4000, Australia

4. School of Civil Engineering, Queensland University of Technology, Brisbane, QLD 4000, Australia

Abstract

Transfer learning has gained significant traction in natural language processing due to the emergence of state-of-the-art pre-trained language models (PLMs). Unlike traditional word embedding methods such as TF-IDF and Word2Vec, PLMs are context-dependent and outperform conventional techniques when fine-tuned for specific tasks. This paper proposes an innovative hard voting classifier to enhance crash severity classification by combining machine learning and deep learning models with various word embedding techniques, including BERT, RoBERTa, Word2Vec, and TF-IDF. Our study involves two comprehensive experiments using motorists’ crash data from the Missouri State Highway Patrol. The first experiment evaluates the performance of three machine learning models—XGBoost (XGB), random forest (RF), and naive Bayes (NB)—paired with TF-IDF, Word2Vec, and BERT feature extraction techniques. Additionally, BERT and RoBERTa are fine-tuned with a Bidirectional Long Short-Term Memory (Bi-LSTM) classification model. All models are initially evaluated on the original dataset. The second experiment repeats the evaluation using an augmented dataset to address the severe data imbalance. The results from the original dataset show strong performance for all models in the “Fatal” and “Personal Injury” classes but a poor classification of the minority “Property Damage” class. In the augmented dataset, while the models continued to excel with the majority classes, only XGB/TFIDF and BERT-LSTM showed improved performance for the minority class. The ensemble model outperformed individual models in both datasets, achieving an F1 score of 99% for “Fatal” and “Personal Injury” and 62% for “Property Damage” on the augmented dataset. These findings suggest that ensemble models, combined with data augmentation, are highly effective for crash severity classification and potentially other textual classification tasks.

Publisher

MDPI AG

Link

https://www.mdpi.com/1999-4893/17/7/284/pdf

Reference59 articles.

1. Oestergaard, F., Beck Kinman, S., and Ravn Pedersen, S. (2013). Control your data or drown trying. I.B.M. Nordic Blog, Available online: https://www.ibm.com/blogs/nordic-msp/control-your-data-or-drown-trying/.

2. A brief survey of text mining;Hotho;J. Lang. Technol. Comput. Linguist.,2005

3. Ramos, J. (2003, January 23–24). Using TF-IDF to determine word relevance in document queries. Proceedings of the First Instructional Conference On Machine Learning, Los Angeles, CA, USA.

4. Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., and Dean, J. (2013, January 5–10). Distributed representations of words and phrases and their compositionality. Proceedings of the 26th International Conference on Neural Information Processing Systems, Lake Tahoe, NV, USA.

5. Devlin, J. (2019). BERT: Pre-training of deep bidirectional transformers for understanding. arXiv.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multitask Learning for Crash Analysis: A Fine-Tuned LLM Framework Using Twitter Data;Smart Cities;2024-09-01