Distribution-preserving data augmentation

Author:

Saran Nurdan Ayse1,Saran Murat1ORCID,Nar Fatih2ORCID

Affiliation:

1. Department of Computer Engineering, Cankaya University, Ankara, Turkey

2. Department of Computer Engineering, Ankara Yildirim Beyazit University, Ankara, Turkey

Abstract

In the last decade, deep learning has been applied in a wide range of problems with tremendous success. This success mainly comes from large data availability, increased computational power, and theoretical improvements in the training phase. As the dataset grows, the real world is better represented, making it possible to develop a model that can generalize. However, creating a labeled dataset is expensive, time-consuming, and sometimes not likely in some domains if not challenging. Therefore, researchers proposed data augmentation methods to increase dataset size and variety by creating variations of the existing data. For image data, variations can be obtained by applying color or spatial transformations, only one or a combination. Such color transformations perform some linear or nonlinear operations in the entire image or in the patches to create variations of the original image. The current color-based augmentation methods are usually based on image processing methods that apply color transformations such as equalizing, solarizing, and posterizing. Nevertheless, these color-based data augmentation methods do not guarantee to create plausible variations of the image. This paper proposes a novel distribution-preserving data augmentation method that creates plausible image variations by shifting pixel colors to another point in the image color distribution. We achieved this by defining a regularized density decreasing direction to create paths from the original pixels’ color to the distribution tails. The proposed method provides superior performance compared to existing data augmentation methods which is shown using a transfer learning scenario on the UC Merced Land-use, Intel Image Classification, and Oxford-IIIT Pet datasets for classification and segmentation tasks.

Publisher

PeerJ

Subject

General Computer Science

Reference46 articles.

1. Pyramid methods in image processing;Adelson;RCA Engineer,1984

2. A machine learning approach to automatic detection of irregularity in skin lesion border using dermoscopic images;Ali;PeerJ Computer Science,2020

3. Convex Optimization

4. CLoDSA: a tool for augmentation in classification, localization, detection, semantic segmentation and instance segmentation tasks;Casado-Garca;BMC Bioinformatics,2019

5. Pointnet: deep learning on point sets for 3D classification and segmentation;Charles,2017

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. INCEPTION SH: A NEW CNN MODEL BASED ON INCEPTION MODULE FOR CLASSIFYING SCENE IMAGES;Mühendislik Bilimleri ve Tasarım Dergisi;2024-06-30

2. Application of a Voting-Based Ensemble Method for Recognizing Seven Basic Emotions in Real-Time Webcam Video Images;2024 IEEE 9th International Conference for Convergence in Technology (I2CT);2024-04-05

3. Modulation Recognition Method of Underwater Acoustic Signal Based on Parallel Network;2023 IEEE 6th International Conference on Electronic Information and Communication Technology (ICEICT);2023-07-21

4. Integrated animal monitoring system with animal detection and classification capabilities: a review on image modality, techniques, applications, and challenges;Artificial Intelligence Review;2023-06-20

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3