1. Segmentation from natural language expressions;Hu,2016
2. Recurrent multimodal interaction for referring image segmentation;Liu,2017
3. OCID-ref: A 3D robotic dataset with embodied language for clutter scene grounding;Wang,2021
4. Unambiguous scene text segmentation with referring expression comprehension;Rong;IEEE Trans. Image Process.,2020
5. Language-based image editing with recurrent attentive models;Chen,2018