1. Baek, Y., Lee, B., Han, D., Yun, S., & Lee, H. (2019). Character region awareness for text detection. In CVPR (pp. 9365–9374).
2. Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., & Zagoruyko, S. (2020). End-to-end object detection with transformers. In ECCV (pp. 213–229).
3. Chen, T., Kornblith, S., Norouzi, M., & Hinton, G. (2020). A simple framework for contrastive learning of visual representations. In International conference on machine learning (pp. 1597–1607). PMLR.
4. Cheng, Z., Lu, J., Niu, Y., Pu, S., Wu, F., & Zhou, S. (2019). You only recognize once: Towards fast video text spotting. In ACM MM (pp. 855–863).
5. Cheng, Z., Lu, J., Zou, B., Qiao, L., Xu, Y., Pu, S., Niu, Y., Wu, F., & Zhou, S. (2020). Free: A fast and robust end-to-end video text spotter. TIP, 30, 822–837.