1. Baek, J., Matsui, Y., Aizawa, K.: COO/comic onomatopoeia dataset for recognizing arbitrary or truncated texts. arXiv (2022). https://doi.org/10.48550/arXiv.2207.04675
2. Baek, Y., Lee, B., Han, D., Yun, S., Lee, H.: Character region awareness for text detection. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 9357–9366. IEEE, Long Beach, CA, USA (2019). https://doi.org/10.1109/CVPR.2019.00959
3. Chen, J., et al.: MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning (2023)
4. Chen, T., Kornblith, S., Norouzi, M., Hinton, G.: A simple framework for contrastive learning of visual representations (2020)
5. Chiang, W.L., et al.: Vicuna: an open-source chatbot impressing GPT-4 with 90%* ChatGPT quality (2023). https://lmsys.org/blog/2023-03-30-vicuna