Albumentations: Fast and Flexible Image Augmentations-Reference-Cited by-同舟云学术

Albumentations: Fast and Flexible Image Augmentations

Published:2020-02-24 Issue:2 Volume:11 Page:125
ISSN:2078-2489
Container-title:Information
language:en
Short-container-title:Information

Author:

Buslaev Alexander,Iglovikov Vladimir I.^ORCID,Khvedchenya Eugene,Parinov Alex,Druzhinin Mikhail^ORCID,Kalinin Alexandr A.^ORCID

Abstract

Data augmentation is a commonly used technique for increasing both the size and the diversity of labeled training sets by leveraging input transformations that preserve corresponding output labels. In computer vision, image augmentations have become a common implicit regularization technique to combat overfitting in deep learning models and are ubiquitously used to improve performance. While most deep learning frameworks implement basic image transformations, the list is typically limited to some variations of flipping, rotating, scaling, and cropping. Moreover, image processing speed varies in existing image augmentation libraries. We present Albumentations, a fast and flexible open source library for image augmentation with many various image transform operations available that is also an easy-to-use wrapper around other augmentation libraries. We discuss the design principles that drove the implementation of Albumentations and give an overview of the key features and distinct capabilities. Finally, we provide examples of image augmentations for different computer vision tasks and demonstrate that Albumentations is faster than other commonly used image augmentation tools on most image transform operations.

Publisher

MDPI AG

Subject

Information Systems

Link

https://www.mdpi.com/2078-2489/11/2/125/pdf

Reference82 articles.

1. Deep learning

2. Simplifying Neural Networks by Soft Weight-Sharing

3. The Problem of Overfitting

4. Regularization for deep learning: A taxonomy;Kukačka;arXiv,2017

Cited by 1309 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Adapting the segment anything model for multi-modal retinal anomaly detection and localization;Information Fusion;2025-01

2. Advancing histopathology in Health 4.0: Enhanced cell nuclei detection using deep learning and analytic classifiers;Computer Standards & Interfaces;2025-01

3. Real-time placental vessel segmentation in fetoscopic laser surgery for Twin-to-Twin Transfusion Syndrome;Medical Image Analysis;2025-01

4. Automated computer vision based individual salmon (Salmo salar) breathing rate estimation (SaBRE) for improved state observability;Aquaculture;2025-01

5. Deep learning for tubes and lines detection in critical illness: Generalizability and comparison with residents;European Journal of Radiology Open;2024-12