Semantic segmentation network stacking with genetic programming-Reference-Cited by-同舟云学术

Semantic segmentation network stacking with genetic programming

Published:2023-10-26 Issue:2 Volume:24 Page:
ISSN:1389-2576
Container-title:Genetic Programming and Evolvable Machines
language:en
Short-container-title:Genet Program Evolvable Mach

Author:

Bakurov Illya,Buzzelli Marco,Schettini Raimondo,Castelli Mauro,Vanneschi Leonardo

Abstract

AbstractSemantic segmentation consists of classifying each pixel of an image and constitutes an essential step towards scene recognition and understanding. Deep convolutional encoder–decoder neural networks now constitute state-of-the-art methods in the field of semantic segmentation. The problem of street scenes’ segmentation for automotive applications constitutes an important application field of such networks and introduces a set of imperative exigencies. Since the models need to be executed on self-driving vehicles to make fast decisions in response to a constantly changing environment, they are not only expected to operate reliably but also to process the input images rapidly. In this paper, we explore genetic programming (GP) as a meta-model that combines four different efficiency-oriented networks for the analysis of urban scenes. Notably, we present and examine two approaches. In the first approach, we represent solutions as GP trees that combine networks’ outputs such that each output class’s prediction is obtained through the same meta-model. In the second approach, we propose representing solutions as lists of GP trees, each designed to provide a unique meta-model for a given target class. The main objective is to develop efficient and accurate combination models that could be easily interpreted, therefore allowing gathering some hints on how to improve the existing networks. The experiments performed on the Cityscapes dataset of urban scene images with semantic pixel-wise annotations confirm the effectiveness of the proposed approach. Specifically, our best-performing models improve systems’ generalization ability by approximately 5% compared to traditional ensembles, 30% for the less performing state-of-the-art CNN and show competitive results with respect to state-of-the-art ensembles. Additionally, they are small in size, allow interpretability, and use fewer features due to GP’s automatic feature selection.

Funder

Fundação para a Ciência e a Tecnologia

Universidade Nova de Lisboa

Publisher

Springer Science and Business Media LLC

Subject

Computer Science Applications,Hardware and Architecture,Theoretical Computer Science,Software

Link

https://link.springer.com/content/pdf/10.1007/s10710-023-09464-0.pdf

Reference67 articles.

1. D. Agnelli, A. Bollini, L. Lombardi, Image classification: an evolutionary approach. Pattern Recognit. Lett. 23(1), 303–309 (2002). https://doi.org/10.1016/S0167-8655(01)00128-3

2. H. Al-Sahaf, A. Song, K. Neshatian, M. Zhang, Two-tier genetic programming: towards raw pixel-based image classification. Expert Syst. Appl. 39(16), 12291–12301 (2012). https://doi.org/10.1016/j.eswa.2012.02.123

3. V. Badrinarayanan, A. Kendall, R. Cipolla, Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(12), 2481–2495 (2017). https://doi.org/10.1109/TPAMI.2016.2644615

4. Bakurov, I., Buzzelli, M., Castelli, M., Schettini, R., Vanneschi, L.: Genetic programming for structural similarity design at multiple spatial scales. in Proceedings of the Genetic and Evolutionary Computation Conference, GECCO ’22, p. 911-919. Association for Computing Machinery, New York, NY, USA (2022). https://doi.org/10.1145/3512290.3528783

5. I. Bakurov, M. Buzzelli, R. Schettini, M. Castelli, L. Vanneschi, Full-reference image quality expression via genetic programming. IEEE Trans. Image Process. 32, 1458–1473 (2023). https://doi.org/10.1109/TIP.2023.3244662