Author:
Segal Evan,Spencer-Smith Jesse,C. Schmidt Douglas
Abstract
Although the field of computer vision has grown significantly due to the advent of convolutional neural networks (CNNs), electronic analysis of historical documents has experienced scant research and development attention. Recently, however, computer vision has matured to the point where it can be applied to outperform existing, specialized tools for document analysis. This paper demonstrates empirically how state-of-the-art results can be produced by implementing, training, and evaluating generic computer vision models on historical document segmentation tasks. We show the generality of our approach to document analysis and explain how innovation in this domain can arise from combining generic building blocks for computer vision.
Reference29 articles.
1. Guo Y, Ashour AS. 11—Neutrosophic sets in dermoscopic medical image segmentation. In: Guo Y, Ashour AS, editors. Neutrosophic Set in Medical Image Analysis. Amsterdam, Netherlands: Academic Press; 2019. pp. 229-243
2. Belongie SJ, Carson C, Greenspan H, Malik J. Color- and texture-based image segmentation using EM and its application to content-based image retrieval. In: Sixth International Conference on Computer Vision (IEEE Cat. No. 98CH36271). 1998. pp. 675-682. Available form: https://api.semanticscholar.org/CorpusID:756703
3. Pham DL, Xu C, Prince JL. Current methods in medical image segmentation. Annual Review of Biomedical Engineering. 2000;2:315-337
4. Gidaris S, Komodakis N. Object detection via a multi-region and semantic segmentation-aware CNN model; 2015
5. Nilsson M, Herlin AH, Ardö H, Guzhva O, Åström K, Bergsten C. Development of automatic surveillance of animal behaviour and welfare using image analysis and machine learned segmentation technique. Animal. 2015;9(11):1859-1865