Document Image Extraction System Design

Author:

Widiastuti N I,Dewi K E

Abstract

Abstract The design of the document image extraction system aims to provide an overview of the process in a system that converts the image of a document into text so that it is easier to use (save, manage or search for information) or representing in interesting visualization. The system design based on specific cases such as decision letters, certificates, and assignments. The document identified data that might be needed. After that, other processes are carried out based on each document. The design results show the flow of the system starts from data input, scanning, pre-processing to the document image, character classification, normalization process, and extraction process. This is done because the uniqueness of each document requires a unique process so that it cannot be treated in general. The design of the document image extraction system focuses more on the process of character recognition. The uniqueness of a document has an impact on the extraction process. Then the system design needs to be added to selected the type of document to be extracted.

Publisher

IOP Publishing

Subject

General Medicine

Reference17 articles.

1. Document analysis system;Wong;IBM journal of research and development,1982

2. Information extraction;Sarawagi;Foundations and Trends® in Databases,2008

3. Multi-lingual date field extraction for automatic document retrieval by machine;Mandal;Information Sciences,2015

4. Twenty years of document image analysis in PAMI;Nagy;IEEE Transactions on Pattern Analysis & Machine Intelligence,2000

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Localizing and Analyzing the Infographics in Document Using Deep Learning;2023 17th International Conference on Ubiquitous Information Management and Communication (IMCOM);2023-01-03

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3