Alleviating Long-Tailed Image Classification via Dynamical Classwise Splitting
-
Published:2023-07-05
Issue:13
Volume:11
Page:2996
-
ISSN:2227-7390
-
Container-title:Mathematics
-
language:en
-
Short-container-title:Mathematics
Author:
Yuan Ye1, Wang Jiaqi1, Xu Xin1, Li Ruoshi1, Zhu Yongtong1, Wan Lihong1, Li Qingdu1, Liu Na1
Affiliation:
1. Institute of Machine Intelligence, University of Shanghai for Science and Technology, Shanghai 200093, China
Abstract
With the rapid increase in data scale, real-world datasets tend to exhibit long-tailed class distributions (i.e., a few classes account for most of the data, while most classes contain only a few data points). General solutions typically exploit class rebalancing strategies involving resampling and reweighting based on the sample number for each class. In this work, we explore an orthogonal direction, category splitting, which is motivated by the empirical observation that naive splitting of majority samples could alleviate the heavy imbalance between majority and minority classes. To this end, we propose a novel classwise splitting (CWS) method built upon a dynamic cluster, where classwise prototypes are updated using a moving average technique. CWS generates intra-class pseudo labels for splitting intra-class samples based on the point-to-point distance. Moreover, a group mapping module was developed to recover the ground truth of the training samples. CWS can be plugged into any existing method as a complement. Comprehensive experiments were conducted on artificially induced long-tailed image classification datasets, such as CIFAR-10-LT, CIFAR-100-LT, and OCTMNIST. Our results show that when trained with the proposed class-balanced loss, the network is able to achieve significant performance gains on long-tailed datasets.
Funder
Young Scientists Fund of the National Natural Science Foundation of China Pujiang Talents Plan of Shanghai Artificial Intelligence Innovation and Development Special Fund of Shanghai
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference36 articles.
1. Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., and Li, F.F. (2009, January 20–25). Imagenet: A large-scale hierarchical image database. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA. 2. Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollár, P., and Zitnick, C.L. (2014, January 6–12). Microsoft coco: Common objects in context. Proceedings of the Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland. Proceedings, Part V 13. 3. Liu, Z., Miao, Z., Zhan, X., Wang, J., Gong, B., and Yu, S.X. (2019, January 15–20). Large-scale long-tailed recognition in an open world. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA. 4. A survey on long-tailed visual recognition;Yang;Int. J. Comput. Vis.,2022 5. A systematic study of the class imbalance problem in convolutional neural networks;Buda;Neural Netw.,2018
|
|