1. Bucher, M., et al.: Zero-shot semantic segmentation. In: NeurIPS (2019)
2. Cen, J., et al.: Segment anything in 3D with NeRFs (2023)
3. Cha, J., et al.: Learning to generate text-grounded mask for open-world semantic segmentation from only image-text pairs. In: CVPR (2023)
4. Cho, S., et al.: CAT-Seg: cost aggregation for open-vocabulary semantic segmentation. CoRR (2023)
5. Devlin, J., et al.: BERT: pre-training of deep bidirectional transformers for language understanding. In: NAACL (2019)