1. Mask2former for video instance segmentation;cheng;CVPR,0
2. Zero-shot text-to-image generation;ramesh;ICML,0
3. Mask2former for video instance segmentation;cheng;ArXiv,2021
4. Learning transferable visual models from natural language supervision;radford;ICML,0
5. Xmem: Longterm video object segmentation with an atkinson-shiffrin memory model;cheng;ECCV,0