1. TRIE: end-to-end text reading and information extraction for document understanding;Zhang,2020
2. Cross-modal deep networks for document image classification;Bakkali,2020
3. Visual and textual deep feature fusion for document image classification;Bakkali,2020
4. T. Dauphinee, N. Patel, M.M. Rashidi, Modular multimodal architecture for document classification, Arxiv abs/1912.04376(2019).
5. EAML: ensemble self-attention-based mutual learning network for document image classification;Bakkali;Int. J. Doc. Anal.Recognit. (IJDAR),2021