Albumentations: Fast and Flexible Image Augmentations

被引:1202
作者
Buslaev, Alexander [1 ]
Iglovikov, Vladimir I. [2 ]
Khvedchenya, Eugene [3 ]
Parinov, Alex [4 ]
Druzhinin, Mikhail [5 ]
Kalinin, Alexandr A. [6 ,7 ]
机构
[1] Mapbox, Minsk 220030, BELARUS
[2] Lyft Level 5, Palo Alto, CA 94304 USA
[3] ODSai, UA-65000 Odessa, Ukraine
[4] X5 Retail Grp, Moscow 119049, Russia
[5] Simicon, St Petersburg 195009, Russia
[6] Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA
[7] Shenzhen Res Inst Big Data, Shenzhen 518172, Guangdong, Peoples R China
关键词
data augmentation; computer vision; deep learning;
D O I
10.3390/info11020125
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data augmentation is a commonly used technique for increasing both the size and the diversity of labeled training sets by leveraging input transformations that preserve corresponding output labels. In computer vision, image augmentations have become a common implicit regularization technique to combat overfitting in deep learning models and are ubiquitously used to improve performance. While most deep learning frameworks implement basic image transformations, the list is typically limited to some variations of flipping, rotating, scaling, and cropping. Moreover, image processing speed varies in existing image augmentation libraries. We present Albumentations, a fast and flexible open source library for image augmentation with many various image transform operations available that is also an easy-to-use wrapper around other augmentation libraries. We discuss the design principles that drove the implementation of Albumentations and give an overview of the key features and distinct capabilities. Finally, we provide examples of image augmentations for different computer vision tasks and demonstrate that Albumentations is faster than other commonly used image augmentation tools on most image transform operations.
引用
收藏
页数:20
相关论文
共 74 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] [Anonymous], 2017, P NIPS W LONG BEACH
  • [3] [Anonymous], ARXIV190106345
  • [4] [Anonymous], P IEEE INT C COMP VI
  • [5] [Anonymous], 2018, P INT C LEARN REPR I
  • [6] [Anonymous], P INT C COMP VIS ICC
  • [7] [Anonymous], 2015, Nature, DOI [DOI 10.1038/NATURE14539, 10.1038/nature14539]
  • [8] [Anonymous], 2018, ACCELERATED DL RL
  • [9] [Anonymous], ARXIV200105127
  • [10] [Anonymous], 2019, P IEEE INT C COMP VI