1. Semi-structured document image matching and recognition
2. Hangbo Bao , Li Dong , Furu Wei , Wenhui Wang , Nan Yang , Xiaodong Liu , Yu Wang , Jianfeng Gao , Songhao Piao , Ming Zhou , and Hsiao-Wuen Hon . 2020 . UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training . In Proceedings of the 37th International Conference on Machine Learning, ICML 2020, 13--18 July 2020, Virtual Event (Proceedings of Machine Learning Research , Vol. 119). PMLR, 642-- 652 . http://proceedings.mlr.press/v119/bao20a.html Hangbo Bao, Li Dong, Furu Wei, Wenhui Wang, Nan Yang, Xiaodong Liu, Yu Wang, Jianfeng Gao, Songhao Piao, Ming Zhou, and Hsiao-Wuen Hon. 2020. UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training. In Proceedings of the 37th International Conference on Machine Learning, ICML 2020, 13--18 July 2020, Virtual Event (Proceedings of Machine Learning Research, Vol. 119). PMLR, 642--652. http://proceedings.mlr.press/v119/bao20a.html
3. GMN: Generative Multi-modal Network for Practical Document Information Extraction
4. One-shot Text Field labeling using Attention and Belief Propagation for Structure Information Extraction
5. InfoXLM: An Information-Theoretic Framework for Cross-Lingual Language Model Pre-Training