Author:
Shaifee Mohammad Javad,Chywl Brendan,Li Francis,Wong Alexander
Abstract
Object detection is considered one of the most challenging problemsin this field of computer vision, as it involves the combinationof object classification and object localization within a scene. Recently,deep neural networks (DNNs) have been demonstrated toachieve superior object detection performance compared to otherapproaches, with YOLOv2 (an improved You Only Look Once model)being one of the state-of-the-art in DNN-based object detectionmethods in terms of both speed and accuracy. Although YOLOv2can achieve real-time performance on a powerful GPU, it still remainsvery challenging for leveraging this approach for real-timeobject detection in video on embedded computing devices withlimited computational power and limited memory. In this paper,we propose a new framework called Fast YOLO, a fast You OnlyLook Once framework which accelerates YOLOv2 to be able toperform object detection in video on embedded devices in a realtimemanner. First, we leverage the evolutionary deep intelligenceframework to evolve the YOLOv2 network architecture and producean optimized architecture (referred to as O-YOLOv2 here) that has2.8X fewer parameters with just a 2% IOU drop. To further reducepower consumption on embedded devices while maintaining performance,a motion-adaptive inference method is introduced intothe proposed Fast YOLO framework to reduce the frequency ofdeep inference with O-YOLOv2 based on temporal motion characteristics.Experimental results show that the proposed Fast YOLOframework can reduce the number of deep inferences by an averageof 38.13%, and an average speedup of 3.3X for objectiondetection in video compared to the original YOLOv2, leading FastYOLO to run an average of 18FPS on a Nvidia Jetson TX1 embeddedsystem.
Subject
Industrial and Manufacturing Engineering
Cited by
191 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. EDet-BTR: EfficientDet-based brain tumor recognition from the magnetic resonance imaging;Biomedical Signal Processing and Control;2024-10
2. Spatial and Temporal Detection With Attention for Real-Time Video Analytics at Edges;IEEE Transactions on Mobile Computing;2024-10
3. MSPV3D: Multi-Scale Point-Voxels 3D Object Detection Net;Remote Sensing;2024-08-26
4. LIVER ULTRASOUND IMAGING LESION DETECTION BASED ON YOLO;International Journal of Engineering Technologies and Management Research;2024-07-24
5. Breaking reCAPTCHAv2;2024 IEEE 48th Annual Computers, Software, and Applications Conference (COMPSAC);2024-07-02