The Challenge of Data Annotation in Deep Learning—A Case Study on Whole Plant Corn Silage-Reference-Cited by-同舟云学术

The Challenge of Data Annotation in Deep Learning—A Case Study on Whole Plant Corn Silage

Published:2022-02-18 Issue:4 Volume:22 Page:1596
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Rasmussen Christoffer Bøgelund^ORCID,Kirk Kristian,Moeslund Thomas B.^ORCID

Abstract

Recent advances in computer vision are primarily driven by the usage of deep learning, which is known to require large amounts of data, and creating datasets for this purpose is not a trivial task. Larger benchmark datasets often have detailed processes with multiple stages and users with different roles during annotation. However, this can be difficult to implement in smaller projects where resources can be limited. Therefore, in this work we present our processes for creating an image dataset for kernel fragmentation and stover overlengths in Whole Plant Corn Silage. This includes the guidelines for annotating object instances in respective classes and statistics of gathered annotations. Given the challenging image conditions, where objects are present in large amounts of occlusion and clutter, the datasets appear appropriate for training models. However, we experience annotator inconsistency, which can hamper evaluation. Based on this we argue the importance of having an evaluation form independent of the manual annotation where we evaluate our models with physically based sieving metrics. Additionally, instead of the traditional time-consuming manual annotation approach, we evaluate Semi-Supervised Learning as an alternative, showing competitive results while requiring fewer annotations. Specifically, given a relatively large supervised set of around 1400 images we can improve the Average Precision by a number of percentage points. Additionally, we show a significantly large improvement when using an extremely small set of just over 100 images, with over 3× in Average Precision and up to 20 percentage points when estimating the quality.

Funder

Innovation Fund Denmark

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/22/4/1596/pdf

Reference50 articles.

1. Maize Silage Kernel Fragment Estimation Using Deep Learning-Based Object Recognition in Non-Separated Kernel/Stover RGB Images

2. Anchor tuning in Faster R-CNN for measuring corn silage physical characteristics

3. Objects365: A Large-Scale, High-Quality Dataset for Object Detection

4. ImageNet Large Scale Visual Recognition Challenge

Cited by 21 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Computer-Simulated Virtual Image Datasets to Train Machine Learning Models for Non-Invasive Fish Detection in Recirculating Aquaculture;Sensors;2024-09-07

2. Assessing inclusion and representativeness on digital platforms for health education: Evidence from YouTube;Journal of Biomedical Informatics;2024-09

3. Continuous unsupervised domain adaptation using stabilized representations and experience replay;Neurocomputing;2024-09

4. Diagnosis of Citrus Greening Using Artificial Intelligence: A Faster Region-Based Convolutional Neural Network Approach with Convolution Block Attention Module-Integrated VGGNet and ResNet Models;Plants;2024-06-13

5. Revolutionizing Agriculture: Embracing Modern Strategies for the Management of Coffee Leaf Rust Disease;IoT and AI in Agriculture;2024