Affiliation:
1. Department of Computer Science and Technology, Suzhou College of Information Technology, Suzhou, China
2. School of Electrical and Information, Zhenjiang College, Zhenjiang, China
Abstract
The rapid development of transportation industry has brought some potential safety hazards. Aiming at the problem of driving safety, the application of artificial intelligence technology in safe driving behavior recognition can effectively reduce the accident rate and economic losses. Based on the presence of interference signals such as spatiotemporal background mixed signals in the driving monitoring video sequence, the recognition accuracy of small targets such as human eyes is low. In this paper, an improved dual-stream convolutional network is proposed to recognize the safe driving behavior. Based on convolutional neural networks (CNNs), attention mechanism (AM) is integrated into a long short-term memory (LSTM) neural network structure, and the hybrid dual-stream AM-LSTM convolutional network channel is designed. The spatial stream channel uses the CNN method to extract the spatial characteristic value of video image and uses pyramid pooling instead of traditional pooling, normalizing the scale transformation. The time stream channel uses a single-shot multibox detector (SSD) algorithm to calculate the adjacent two frames of video sequence for the detection of small objects such as face and eyes. Then, AM-LSTM is used to fuse and classify dual-stream information. The self-built driving behavior video image set is built. ROC, accuracy rate, and loss function experiments are carried out in the FDDB database, VOT100 data set, and self-built video image set, respectively. Compared with CNN, SSD, IDT, and dual-stream recognition methods, the accuracy rate of this method can be improved by at least 1.4%, and the average absolute error in four video sequences can be improved by more than 2%. On the contrary, in the self-built image set, the recognition rate of doze reaches 68.3%, which is higher than other methods. The experimental results show that this method has good recognition accuracy and practical application value.
Funder
National Natural Science Foundation of China
Subject
Electrical and Electronic Engineering,General Computer Science,Signal Processing
Reference36 articles.
1. Analysis on the influence of driver factors on road traffic accident indexes;L. X. Zhang;Chinese Journal of Safety Sciences,2014
2. Analysis on flight fatigue risk and the systematic solution;L. Wang
3. Vehicles of the Future: A Survey of Research on Safety Issues
4. PERCLOS algorithm for human eye opening based on finite state automata;X. Q. Gong;Computer Application Research,2014
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Design and application of airport face image detection system;International Conference on Remote Sensing Technology and Survey Mapping (RSTSM 2024);2024-05-16
2. Accounting Intelligent Monitoring System Based on Data Analysis Algorithm;2023 International Conference on Ambient Intelligence, Knowledge Informatics and Industrial Electronics (AIKIIE);2023-11-02
3. Pseudo 3D Pose Recognition Network;IEEE Access;2023
4. A New Multiface Target Detection Algorithm for Students in Class Based on Bayesian Optimized YOLOv3 Model;Journal of Electrical and Computer Engineering;2022-01-04
5. Applying the AdaBoost Face Detection Algorithm to Detect Inattentive States;2021 4th International Conference on Information Systems and Computer Aided Education;2021-09-24