Information extraction from Visually Rich Documents using graph convolutional network-Reference-Cited by-同舟云学术

Information extraction from Visually Rich Documents using graph convolutional network

Published:2023-06-01 Issue:6 Volume:44 Page:10183-10195
ISSN:1064-1246
Container-title:Journal of Intelligent & Fuzzy Systems
language:
Short-container-title:IFS

Author:

Nguyen-Trong Khanh¹,Trinh Thinh¹

Affiliation:

1. Faculty of Information Technology, Posts and Telecommunications Institute of Technology, Hanoi, Vietnam

Abstract

Visually rich documents, such as forms, invoices, receipts, and ID cards, are ubiquitous in daily business and life. Various methods have been used to convey such diverse information, including text, layout, font size, or text position. Combining these elements in information extraction can improve the result performance. However, previous works have not effectively utilized the cooperation between these rich information sources. Text detection and recognition have been performed without semantic supervision (e.g., entity name annotation), and text information extraction has been performed using only serialized plain text, ignoring rich visual information. This paper presents a method for extracting information from such documents, which integrates textual, non-spatial, and spatial visual features. The method consists of two main steps and uses three deep neural networks. The first step, Text Reading, employs two CNN models (Lightweight DB and C-PREN) for OCR tasks, based on the state-of-the-art models DB and PREN, with two improvements. These improvements include reducing noise by removing the SE block of DB and integrating both context and position features in PREN. The second step, Text Information Extraction, uses a graph convolutional network, RGCN, for name entity recognition. Experiments on self-collected and two public datasets have demonstrated that our method improves the performance of the original models and outperforms other state-of-the-art methods.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference29 articles.

1. An analytical study of information extractionfrom unstructured and multidimensional big data;Adnan;Journal of BigData,2019

2. MC-OCR Challenge 2021: Deep Learning Approach for Vietnamese Receipts OCR

3. Named Entity Recognition and Relation Extraction with Graph Neural Networks in Semi Structured Documents

4. Crosslingual named entity recognition for clinicalde-identification applied to a COVID-19 Italian data set;Catelli;Applied Soft Computing,2020