Abstract
Semantic segmentation is a fundamental problem
for computer vision. On the other hand, for studies in the field
of biometrics, semantic segmentation is gaining more importance.
Many successful biometric recognition systems require a high-
performance semantic segmentation algorithm. In this study, we
present an effective ear segmentation technique in natural images.
A convolutional neural network is trained for pixel-based ear
segmentation. DeepLab v3+ network structure, with ResNet-18 as
the backbone and Tversky lost function layer as the last layer, has
been trained with natural and uncontrolled images. We perform
the proposed network training using only the 750 images in the
Annotated Web Ears (AWE) training set. The corresponding tests
are performed on the AWE Test Set, University of Ljubljana
Test Set, and the Collection A of In-The-Wild dataset. For the
Annotated Web Ears (AWE) dataset, intersection over union
(IoU) is measured as 86.3% for the AWE database. To the best of
our knowledge, this is the highest performance achieved among
the algorithms tested on the AWE test set.
Publisher
Balkan Journal of Electrical & Computer Engineering (BAJECE)