An IoT System Using Deep Learning to Classify Camera Trap Images on the Edge-Reference-Cited by-同舟云学术

An IoT System Using Deep Learning to Classify Camera Trap Images on the Edge

Published:2022-01-13 Issue:1 Volume:11 Page:13
ISSN:2073-431X
Container-title:Computers
language:en
Short-container-title:Computers

Author:

Zualkernan Imran^ORCID,Dhou Salam^ORCID,Judas Jacky,Sajun Ali Reza^ORCID,Gomez Brylle Ryan,Hussain Lana Alhaj

Abstract

Camera traps deployed in remote locations provide an effective method for ecologists to monitor and study wildlife in a non-invasive way. However, current camera traps suffer from two problems. First, the images are manually classified and counted, which is expensive. Second, due to manual coding, the results are often stale by the time they get to the ecologists. Using the Internet of Things (IoT) combined with deep learning represents a good solution for both these problems, as the images can be classified automatically, and the results immediately made available to ecologists. This paper proposes an IoT architecture that uses deep learning on edge devices to convey animal classification results to a mobile app using the LoRaWAN low-power, wide-area network. The primary goal of the proposed approach is to reduce the cost of the wildlife monitoring process for ecologists, and to provide real-time animal sightings data from the camera traps in the field. Camera trap image data consisting of 66,400 images were used to train the InceptionV3, MobileNetV2, ResNet18, EfficientNetB1, DenseNet121, and Xception neural network models. While performance of the trained models was statistically different (Kruskal–Wallis: Accuracy H(5) = 22.34, p < 0.05; F1-score H(5) = 13.82, p = 0.0168), there was only a 3% difference in the F1-score between the worst (MobileNet V2) and the best model (Xception). Moreover, the models made similar errors (Adjusted Rand Index (ARI) > 0.88 and Adjusted Mutual Information (AMU) > 0.82). Subsequently, the best model, Xception (Accuracy = 96.1%; F1-score = 0.87; F1-Score = 0.97 with oversampling), was optimized and deployed on the Raspberry Pi, Google Coral, and Nvidia Jetson edge devices using both TenorFlow Lite and TensorRT frameworks. Optimizing the models to run on edge devices reduced the average macro F1-Score to 0.7, and adversely affected the minority classes, reducing their F1-score to as low as 0.18. Upon stress testing, by processing 1000 images consecutively, Jetson Nano, running a TensorRT model, outperformed others with a latency of 0.276 s/image (s.d. = 0.002) while consuming an average current of 1665.21 mA. Raspberry Pi consumed the least average current (838.99 mA) with a ten times worse latency of 2.83 s/image (s.d. = 0.036). Nano was the only reasonable option as an edge device because it could capture most animals whose maximum speeds were below 80 km/h, including goats, lions, ostriches, etc. While the proposed architecture is viable, unbalanced data remain a challenge and the results can potentially be improved by using object detection to reduce imbalances and by exploring semi-supervised learning.

Publisher

MDPI AG

Subject

Computer Networks and Communications,Human-Computer Interaction

Link

https://www.mdpi.com/2073-431X/11/1/13/pdf

Reference73 articles.

1. Snap happy: camera traps are an effective sampling tool when compared with alternative methods

2. Camera‐trapping version 3.0: current constraints and future priorities for development

3. Backpropagation Applied to Handwritten Zip Code Recognition

4. Deep Learning Guinea Pig Image Classification Using Nvidia DIGITS and GoogLeNet;Zmudzinski,2018

Cited by 28 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Reliable and efficient integration of AI into camera traps for smart wildlife monitoring based on continual learning;Ecological Informatics;2024-11

2. Recognition of European mammals and birds in camera trap images using deep neural networks;IET Computer Vision;2024-07-03

3. Wildlife Real-Time Detection in Complex Forest Scenes Based on YOLOv5s Deep Learning Network;Remote Sensing;2024-04-11

4. Object classification and visualization with edge artificial intelligence for a customized camera trap platform;Ecological Informatics;2024-03

5. Improved Wildlife Recognition through Fusing Camera Trap Images and Temporal Metadata;Diversity;2024-02-23