1. Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
2. Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020).
3. CARA system Architecture - A Click and Assemble Robotic Assembly System
4. IKEA. 2023. Lack Side Table. http://surl.li/jersp [Accessed: 15 Jul 2023].
5. Fine-Grained Activity Recognition for Assembly Videos