A NEURAL-BASED PAGE SEGMENTATION SYSTEM-Reference-Cited by-同舟云学术

A NEURAL-BASED PAGE SEGMENTATION SYSTEM

Published:2005-02 Issue:01 Volume:14 Page:109-122
ISSN:0218-1266
Container-title:Journal of Circuits, Systems and Computers
language:en
Short-container-title:J CIRCUIT SYST COMP

Author:

ALGINAHI Y.¹,FEKRI D.¹,SID-AHMED M. A.¹

Affiliation:

1. Department of Electrical and Computer Engineering, University of Windsor, 401 Sunset Ave, Windsor, Ontario, N9B 3P4, Canada

Abstract

Page segmentation is necessary for optical character recognition and very useful in document image manipulation. This paper describes two classification methods, a modified linear adaptive method and a proposed neural network system that classifies an image into text, halftone image (photos, dark images, etc.), and graphics (graphs, tables, flowcharts, etc.). The blocks were segmented using the Run Length Smearing Algorithm. The smearing process was done automatically by fixing the threshold values for smearing. Features are extracted from the segmented blocks for classification into text, graphics, and halftone images. The second method uses a multi-layer perceptron neural network for classification. Two parameters, a shape factor, f1, and an angle from the rectangular block segments, were fed into the neural network system giving us three classes: text, halftone images, and graphics. Experiments on 30 mixed-content document images show that the method works well on a wide variety of layouts in document images.

Publisher

World Scientific Pub Co Pte Lt

Subject

Electrical and Electronic Engineering,Hardware and Architecture,Electrical and Electronic Engineering,Hardware and Architecture

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218126605002192

Reference13 articles.

1. Block segmentation and text extraction in mixed text/image documents

2. A new distance mapping and its use for shape measurement on binary patterns

3. The document spectrum for page layout analysis

4. Automated page orientation and skew angle detection for binary document images

5. A Document Skew Detection Method Using the Hough Transform

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Applicability of OCR Engines for Text Recognition in Vehicle Number Plates, Receipts and Handwriting;Journal of Circuits, Systems and Computers;2023-11-24

2. Parameter free approach for segmenting complex manhattan layouts;Multimedia Tools and Applications;2022-08-08

3. Extending Page Segmentation Algorithms for Mixed-Layout Document Processing;2011 International Conference on Document Analysis and Recognition;2011-09