Affiliation:
1. School of Software Technology, Zhejiang University, Ningbo 315048, China
2. School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China
Abstract
Current age estimation datasets often have a skewed long-tail distribution with significant data imbalance, rather than an ideal uniform distribution for each category. The existing age estimation algorithms that rely on label distribution do not leverage data density information to address the issue of data imbalance. To solve the aforementioned problem, this paper proposes a novel method based on cost-sensitive learning, namely Data-Imbalance Adaptive Age Regression (DIAAR), for age estimation. DIAAR consists of two main modules: the adaptive soft label (ASL) module and the Data Density Smoothing (DDS) module. The ASL module embeds soft labels in the form of probability in the age regression. It assigns different degrees of soft labels adaptively to head and tail data based on their density, which helps balance the dataset. The DDS module further addresses data imbalance by revising data density through kernel smoothing and reweighting the loss function accordingly. Experiments on two benchmark datasets show that DIAAR can effectively deal with data imbalance and improve the accuracy of age estimation, achieving an average improvement of 8% over the baseline models. Moreover, this approach can be applied to various methods based on convolutional neural network models.
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science