1. Michael Bleyer, Christoph Rhemann, and Carsten Rother. 2011. PatchMatch stereo-stereo matching with slanted support windows. In British Machine Vision Conference (BMVC’11), Vol. 11. 1–11.
2. Samuel Rota Bulo, Lorenzo Porzi, and Peter Kontschieder. 2018. In-place activated batchnorm for memory-optimized training of DNNs. In Conference on Computer Vision and Pattern Recognition (CVPR’18). 5639–5647.
3. Neill D. F. Campbell, George Vogiatzis, Carlos Hernández, and Roberto Cipolla. 2008. Using multiple hypotheses to improve depth-maps for multi-view stereo. In European Conference on Computer Vision (ECCV’08). 766–779.
4. Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, and Sergey Zagoruyko. 2020. End-to-end object detection with transformers. In European Conference on Computer Vision (ECCV’20). 213–229.
5. Hanting Chen, Yunhe Wang, Tianyu Guo, Chang Xu, Yiping Deng, Zhenhua Liu, Siwei Ma, Chunjing Xu, Chao Xu, and Wen Gao. 2021. Pre-trained image processing transformer. In Conference on Computer Vision and Pattern Recognition (CVPR’21). 12299–12310.