Abstract
Abstract
Sensing transparent objects has many applications in human daily life, including robot navigation and grasping. However, this task presents significant challenges due to the unpredictable nature of scenes that lay beyond transparent objects. This paper aims to solve the transparent object segmentation problem based Transformer. We design a Query Parsing Module (QPM) that formulates the transparent object segmentation task into a dictionary look-up problem and a set of learnable class prototypes as query inputs. Based QPM, we propose a high-performance transformer-based end-to-end segmentation model Transparent Object Segmentation through Query (TOSQ). TOSQ’s encoder is based on the Segformer’s backbone, and its decoder consists of a series of QPM modules. On the Trans10K-V2 dataset, TOSQ significantly outperforms almost all CNN-based and transformer-based methods, fully demonstrating the unique advantages and great potential of TOSQ to solve the semantic segmentation problem of transparent objects in daily human life. The code is publicly available at https://github.com/ldepn/tosq.
Publisher
Research Square Platform LLC
Reference48 articles.
1. Segmenting transparent objects in the wild;Xie E;Lecture Notes in Computer Science,2020
2. Xie, En., Wang W., Wang W., Sun P., Xu H., Liang D., Luo P.: Segmenting Transparent Objects in the Wild with Transformer. In: Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, August 2021. https://doi.org/10.24963/ijcai.2021/165.
3. TransCut2: Transparent Object Segmentation from a Light-Field Image;Xu Y;IEEE Transactions on Computational Imaging,2019
4. Depth-aware glass surface detection with cross-modal context mining;Lin J,2022
5. Glass Segmentation With RGB-Thermal Image Pairs;Huo D;IEEE Transactions on Image Processing,2023