1. Word spotting and recognition with embedded attributes;Almazán Jon;IEEE Trans. Pattern Anal. Mach. Intell.,2014
2. Ali Furkan Biten, Ron Litman, Yusheng Xie, Srikar Appalaraju, and R. Manmatha. 2022. Latr: Layout-aware transformer for scene-text vqa. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’22). 16548–16558.
3. Ali Furkan Biten, Ruben Tito, Andres Mafla, Lluis Gomez, Marcal Rusinol, C. V. Jawahar, Ernest Valveny, and Dimosthenis Karatzas. 2019. Scene text visual question answering. In Proceedings of the International Conference on Computer Vision (ICCV’19). 4290–4300.
4. Enriching word vectors with subword information;Bojanowski Piotr;Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL’17),2017
5. Fedor Borisyuk, Albert Gordo, and Viswanath Sivakumar. 2018. Rosetta: Large scale system for text detection and recognition in images. In Proceedings of the ACM Knowledge Discovery and Data Mining (SIGKDD’18). 71–79.