Improving text recognition by combining visual and linguistic features of text

Author:

Tran Cong1ORCID,Nguyen-Trong Khanh1ORCID,Pham Cuong1ORCID,Tran-Anh Dat1ORCID,Nguyen-Thi-Tan Tien2ORCID

Affiliation:

1. Posts and Telecommunications Institute of Technology, Viet Nam

2. Thai Nguyen University of Medicine and Pharmacy, Viet Nam

Publisher

ACM

Reference29 articles.

1. Srikar Appalaraju , Bhavan Jasani , Bhargava Urala Kota , Yusheng Xie , and R. Manmatha . 2021 . DocFormer: End-to-End Transformer for Document Understanding. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV). 973–983 . https://doi.org/10.1109/ICCV48922. 2021 .00103 10.1109/ICCV48922.2021.00103 Srikar Appalaraju, Bhavan Jasani, Bhargava Urala Kota, Yusheng Xie, and R. Manmatha. 2021. DocFormer: End-to-End Transformer for Document Understanding. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV). 973–983. https://doi.org/10.1109/ICCV48922.2021.00103

2. Dzmitry Bahdanau , Kyung Hyun Cho , and Yoshua Bengio . 2015 . Neural machine translation by jointly learning to align and translate . 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings(2015) , 1–15. arxiv:1409.0473 Dzmitry Bahdanau, Kyung Hyun Cho, and Yoshua Bengio. 2015. Neural machine translation by jointly learning to align and translate. 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings(2015), 1–15. arxiv:1409.0473

3. Xiaoxue Chen Lianwen Jin Yuanzhi Zhu Canjie Luo and Tianwei Wang. 2020. Text Recognition in the Wild: A Survey. arxiv:2005.03492 [cs.CV] Xiaoxue Chen Lianwen Jin Yuanzhi Zhu Canjie Luo and Tianwei Wang. 2020. Text Recognition in the Wild: A Survey. arxiv:2005.03492 [cs.CV]

4. Mengmeng Cui Wei Wang Jinjin Zhang and Liang Wang. 2021. Representation and Correlation Enhanced Encoder-Decoder Framework for Scene Text Recognition. arxiv:2106.06960 [cs.CV] Mengmeng Cui Wei Wang Jinjin Zhang and Liang Wang. 2021. Representation and Correlation Enhanced Encoder-Decoder Framework for Scene Text Recognition. arxiv:2106.06960 [cs.CV]

5. Indoor navigation assistance system for visually impaired people using multimodal technologies

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Ensemble Learning for Vietnamese Scene Text Spotting in Urban Environments;2023 RIVF International Conference on Computing and Communication Technologies (RIVF);2023-12-23

2. Information extraction from Visually Rich Documents using graph convolutional network;Journal of Intelligent & Fuzzy Systems;2023-06-01

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3