Abstract
Abstract
In the contemporary realm of human portrait image segmentation tasks, achieving precise separation of foreground and background within intricate background and foreground edge scenarios poses a formidable challenge. In this study, we introduce a robust, efficient, and remarkably accurate image segmentation model named "MolexNet". In the domain of human portrait image segmentation, our model outperforms the majority of network architecture models in the field. What sets our model apart from conventional lightweight segmentation models is the clever fusion of features at different scales from the input image within the decoder. This enhancement significantly enhances processing speed while maintaining the quality of image segmentation, resulting in exceptional results for human portrait image segmentation tasks. We rigorously evaluate this model using various test datasets, and it consistently exhibits outstanding performance in both image and video segmentation tasks. It adeptly handles complex scenes and dynamic foreground elements, yielding satisfying segmentation results. Moreover, due to its ingenious design concepts, precise optimization strategies, and efficient model structure, our model maintains a high level of accuracy. We also conduct comparative analyses with other open-source architectures to underscore the superior performance and computational efficiency of our model. The model presented in this paper serves as a valuable reference and source of inspiration for research and practical applications in the field of image segmentation.
Publisher
Research Square Platform LLC
Reference34 articles.
1. Le, and Hartwig Adam;Howard Andrew
2. Mark Sandler, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, and Liang-Chieh Chen. Mobilenetv2: Inverted residuals and linear bottlenecks, 2019. https://doi.org/10.48550/arXiv.1801.04381
3. Zhanghan Ke, Jiayu Sun, Kaican Li, Qiong Yan, Rynson W.H. Lau. MODNet: Real-Time Trimap-Free Portrait Matting via Objective Decomposition, 2022. https://doi.org/10.48550/arXiv.2011.1196
4. Imagenet classification with deep convolutional neural networks;Krizhevsky Alex,2012
5. Shanchuan Lin, Linjie Yang, et al. Robust High-Resolution Video Matting with Temporal Guidance, 2021. https://doi.org/10.48550/arXiv.2108.11515