1. Alhashim, I., Wonka, P., 2018. High quality monocular depth estimation via transfer learning. arXiv preprint arXiv:1812.11941.
2. Ali, A., Touvron, H., Caron, M., Bojanowski, P., Douze, M., Joulin, A., Laptev, I., Neverova, N., Synnaeve, G., Verbeek, J., et al., 2021. Xcit: Cross-covariance image transformers. Adv. Neural Inf. Process. Syst. 34, 20014–20027.
3. Bae, J., Moon, S., Im, S., 2023. Deep digging into the generalization of self-supervised monocular depth estimation, in: Proc. AAAI Conf. Artif. Intell., pp. 187–196.
4. Bhat, S.F., Alhashim, I., Wonka, P., 2021. Adabins: Depth estimation using adaptive bins, in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 4009–4018.
5. Bian, J., Li, Z., Wang, N., Zhan, H., Shen, C., Cheng, M.M., Reid, I., 2019. Unsupervised scale-consistent depth and ego-motion learning from monocular video. Adv. Neural Inf. Process. Syst. 32.