1. Do as i can, not as i say: Grounding language in robotic affordances;Ahn,2022
2. Language models are few-shot learners;Brown;Advances in neural information processing systems,2020
3. End-to-End Object Detection with Transformers
4. Learning by cheating;Chen
5. Per-pixel classification is not all you need for semantic segmentation;Cheng;Advances in Neural Information Processing Systems,2021