Vehicle Classification Algorithm Based on Improved Vision Transformer-Reference-Cited by-同舟云学术

Vehicle Classification Algorithm Based on Improved Vision Transformer

Published:2024-07-30 Issue:8 Volume:15 Page:344
ISSN:2032-6653
Container-title:World Electric Vehicle Journal
language:en
Short-container-title:WEVJ

Author:

Dong Xinlong¹^ORCID,Shi Peicheng¹,Tang Yueyue¹,Yang Li¹,Yang Aixi²^ORCID,Liang Taonian³

Affiliation:

1. School of Mechanical and Automotive Engineering, Anhui Polytechnic University, Wuhu 241000, China

2. Polytechnic Institute, Zhejiang University, Hangzhou 310015, China

3. Chery New Energy Automobile Co., Ltd., Wuhu 241000, China

Abstract

Vehicle classification technology is one of the foundations in the field of automatic driving. With the development of deep learning technology, visual transformer structures based on attention mechanisms can represent global information quickly and effectively. However, due to direct image segmentation, local feature details and information will be lost. To solve this problem, we propose an improved vision transformer vehicle classification network (IND-ViT). Specifically, we first design a CNN-In D branch module to extract local features before image segmentation to make up for the loss of detail information in the vision transformer. Then, in order to solve the problem of misdetection caused by the large similarity of some vehicles, we propose a sparse attention module, which can screen out the discernible regions in the image and further improve the detailed feature representation ability of the model. Finally, this paper uses the contrast loss function to further increase the intra-class consistency and inter-class difference of classification features and improve the accuracy of vehicle classification recognition. Experimental results show that the accuracy of the proposed model on the datasets of vehicle classification BIT-Vehicles, CIFAR-10, Oxford Flower-102, and Caltech-101 is higher than that of the original vision transformer model. Respectively, it increased by 1.3%, 1.21%, 7.54%, and 3.60%; at the same time, it also met a certain real-time requirement to achieve a balance of accuracy and real time.

Funder

Yangtze River Delta Science and Technology Innovation Community Joint Research Project

Natural Science Foundation of Anhui Province

Anhui Provincial Key Research and Development Plan

Publisher

MDPI AG

Link

https://www.mdpi.com/2032-6653/15/8/344/pdf

Reference35 articles.

1. Intelligent traffic monitoring systems for vehicle classification: A survey;Won;IEEE Access,2020

2. Wang, P., Ouyang, T., Zhao, S., Wang, X., Ni, Z., and Fan, Y. (2024). Intelligent Vehicle Formation System Based on Information Interaction. World Electr. Veh. J., 15.

3. Dai, Z., Guan, Z., Chen, Q., Xu, Y., and Sun, F. (2024). Enhanced Object Detection in Autonomous Vehicles through LiDAR—Camera Sensor Fusion. World Electr. Veh. J., 15.

4. Shi, D., Chu, F., Cai, Q., Wang, Z., Lv, Z., and Wang, J. (2024). Research on a Path Tracking Control Strategy for Autonomous Vehicles Based on State Parameter Identification. World Electr. Veh. J., 15.

5. AI-enhanced blockchain technology: A review of advancements and opportunities;Ressi;J. Netw. Comput. Appl.,2024