A deep learning approach for robust traffic accident information extraction from online chinese news

Author:

Ling Yancheng1,Ma Zhenliang2ORCID,Dong Xiaoxian1,Weng Xiaoxiong1

Affiliation:

1. School of Civil Engineering and Transportation South China University of Technology Guangzhou China

2. Department of Civil and Architectural Engineering KTH Royal Institute of Technology Stockholm Sweden

Abstract

AbstractRoad traffic accidents are the leading causes of injuries and fatalities. Understanding the traffic accident occurrence pattern and its contributing factors are prerequisites for effective traffic safety management. The paper proposes a deep learning approach for traffic accident recognition and information extraction from online Chinese news to extract and organize traffic accidents automatically. The approach consists of three modules, including automated news collection, news classification, and traffic accident information extraction. The automated news collection module crawls news from online sources, cleans and organizes it into a general news database with different categories of news. The news classification module robustly recognizes the traffic accident news from all types of news by fusing the sentence‐wise and context‐wise semantic news information. The accident information extraction module extracts the key attributes of traffic accidents (e.g. causes, times, locations) from news text using the SoftLexicon‐BiLSTM‐CRF method. The proposed approach is validated by comparing it with state‐of‐the‐art text mining methods using Chinese news data crawled online. The results show that the approach can achieve a high information extraction performance in terms of precision, recall, and F1‐score. It improves the performance of the best benchmark model (BiLSTM‐CRF) by 18.8% in precision and 12.08% in F1‐score. In addition, the potential value of the automatically extracted accident data is illustrated from online news in complementing traditional authority accident data to drive more effective traffic safety management in practice.

Publisher

Institution of Engineering and Technology (IET)

Reference79 articles.

1. WHO launches second global status report on road safety: Table 1

2. SCISOR: extracting information from on-line news

3. An overview of online fake news: Characterization, detection, and discussion

4. Chaulagain B. Bhatt B. Panday S.P. Shakya A. Newar D.K.P. Pandey R.K.:Casualty information extraction and analysis from news. In:Proceedings of the International ISCRAM Conference vol.2019 pp.1002–1011.IEEE Piscataway(2019)

5. Arulanandam R. Savarimuthu B.T.R. Purvis M.A.:Extracting crime information from online newspaper articles. In:Proceedings of the Second Australasian Web Conference (AWC 2014) pp.31–38. (2014)

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3