1. Attention is all you need;Vaswani,2017
2. Table structure recognition and form parsing by end-to-end object detection and relation parsing;Li;Pattern Recognit.,2022
3. PICK: Processing key information extraction from documents using improved graph learning-convolutional networks;Yu,2021
4. T.I. Denk, C. Reisswig, BERTgrid: Contextualized Embedding for 2D Document Representation and Understanding, in: Workshop on Document Intelligence At NeurIPS 2019, 2019.
5. ViBERTgrid: A jointly trained multi-modal 2D document representation for key information extraction from documents;Lin,2021