Do We Train on Test Data? Purging CIFAR of Near-Duplicates-Reference-Cited by-同舟云学术

Do We Train on Test Data? Purging CIFAR of Near-Duplicates

Published:2020-06-02 Issue:6 Volume:6 Page:41
ISSN:2313-433X
Container-title:Journal of Imaging
language:en
Short-container-title:J. Imaging

Author:

Barz Björn^ORCID,Denzler Joachim^ORCID

Abstract

The CIFAR-10 and CIFAR-100 datasets are two of the most heavily benchmarked datasets in computer vision and are often used to evaluate novel methods and model architectures in the field of deep learning. However, we find that 3.3% and 10% of the images from the test sets of these datasets have duplicates in the training set. These duplicates are easily recognizable by memorization and may, hence, bias the comparison of image recognition techniques regarding their generalization capability. To eliminate this bias, we provide the “fair CIFAR” (ciFAIR) dataset, where we replaced all duplicates in the test sets with new images sampled from the same domain. The training set remains unchanged, in order not to invalidate pre-trained models. We then re-evaluate the classification performance of various popular state-of-the-art CNN architectures on these new test sets to investigate whether recent research has overfitted to memorizing data instead of learning abstract concepts. We find a significant drop in classification accuracy of between 9% and 14% relative to the original performance on the duplicate-free test set. We make both the ciFAIR dataset and pre-trained models publicly available and furthermore maintain a leaderboard for tracking the state of the art.

Funder

Deutsche Forschungsgemeinschaft

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Graphics and Computer-Aided Design,Computer Vision and Pattern Recognition,Radiology, Nuclear Medicine and imaging

Link

https://www.mdpi.com/2313-433X/6/6/41/pdf

Reference24 articles.

1. ImageNet Large Scale Visual Recognition Challenge

2. Wide Residual Networks

Cited by 26 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exploring Classifiers with Differentiable Decision Boundary Maps;Computer Graphics Forum;2024-06

2. Deep Learning for Image Classification: A Review;Lecture Notes in Electrical Engineering;2024

3. A Sentiment Analysis Benchmark for Automated Machine Learning Applications;2023 International Conference on Machine Learning and Applications (ICMLA);2023-12-15

4. GUANinE v1.0: Benchmark Datasets for Genomic AI Sequence-to-Function Models;2023-10-17

5. USC-DCT: A Collection of Diverse Classification Tasks;Data;2023-10-12