Improving text recognition by combining visual and linguistic features of text-Reference-Cited by-同舟云学术

Improving text recognition by combining visual and linguistic features of text

Published:2022-12 Issue: Volume: Page:
ISSN:
Container-title:The 11th International Symposium on Information and Communication Technology
language:
Short-container-title:

Author:

Tran Cong¹^ORCID,Nguyen-Trong Khanh¹^ORCID,Pham Cuong¹^ORCID,Tran-Anh Dat¹^ORCID,Nguyen-Thi-Tan Tien²^ORCID

Affiliation:

1. Posts and Telecommunications Institute of Technology, Viet Nam

2. Thai Nguyen University of Medicine and Pharmacy, Viet Nam

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3568562.3568624

Reference29 articles.

1. Srikar Appalaraju , Bhavan Jasani , Bhargava Urala Kota , Yusheng Xie , and R. Manmatha . 2021 . DocFormer: End-to-End Transformer for Document Understanding. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV). 973–983 . https://doi.org/10.1109/ICCV48922. 2021 .00103 10.1109/ICCV48922.2021.00103 Srikar Appalaraju, Bhavan Jasani, Bhargava Urala Kota, Yusheng Xie, and R. Manmatha. 2021. DocFormer: End-to-End Transformer for Document Understanding. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV). 973–983. https://doi.org/10.1109/ICCV48922.2021.00103

2. Dzmitry Bahdanau , Kyung Hyun Cho , and Yoshua Bengio . 2015 . Neural machine translation by jointly learning to align and translate . 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings(2015) , 1–15. arxiv:1409.0473 Dzmitry Bahdanau, Kyung Hyun Cho, and Yoshua Bengio. 2015. Neural machine translation by jointly learning to align and translate. 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings(2015), 1–15. arxiv:1409.0473

3. Xiaoxue Chen Lianwen Jin Yuanzhi Zhu Canjie Luo and Tianwei Wang. 2020. Text Recognition in the Wild: A Survey. arxiv:2005.03492 [cs.CV] Xiaoxue Chen Lianwen Jin Yuanzhi Zhu Canjie Luo and Tianwei Wang. 2020. Text Recognition in the Wild: A Survey. arxiv:2005.03492 [cs.CV]

4. Mengmeng Cui Wei Wang Jinjin Zhang and Liang Wang. 2021. Representation and Correlation Enhanced Encoder-Decoder Framework for Scene Text Recognition. arxiv:2106.06960 [cs.CV] Mengmeng Cui Wei Wang Jinjin Zhang and Liang Wang. 2021. Representation and Correlation Enhanced Encoder-Decoder Framework for Scene Text Recognition. arxiv:2106.06960 [cs.CV]

5. Indoor navigation assistance system for visually impaired people using multimodal technologies

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Ensemble Learning for Vietnamese Scene Text Spotting in Urban Environments;2023 RIVF International Conference on Computing and Communication Technologies (RIVF);2023-12-23

2. Information extraction from Visually Rich Documents using graph convolutional network;Journal of Intelligent & Fuzzy Systems;2023-06-01