MIDV-2020: a comprehensive benchmark dataset for identity document analysis-Reference-Cited by-同舟云学术

MIDV-2020: a comprehensive benchmark dataset for identity document analysis

Published:2022-04 Issue:2 Volume:46 Page:
ISSN:0134-2452
Container-title:Computer Optics
language:
Short-container-title:Computer Optics

Author:

Bulatov K.B., ,Emelianova E.V.,Tropin D.V.,Skoryukina N.S.,Chernyshova Y.S.,Sheshkus A.V.,Usilin S.A.,Ming Z.,Burie J.-C.,Luqman M.M.,Arlazarov V.V., , , , , , , , , , , , , , , , , , ,

Abstract

Identity documents recognition is an important sub-field of document analysis, which deals with tasks of robust document detection, type identification, text fields recognition, as well as identity fraud prevention and document authenticity validation given photos, scans, or video frames of an identity document capture. Significant amount of research has been published on this topic in recent years, however a chief difficulty for such research is scarcity of datasets, due to the subject matter being protected by security requirements. A few datasets of identity documents which are available lack diversity of document types, capturing conditions, or variability of document field values. In this paper, we present a dataset MIDV-2020 which consists of 1000 video clips, 2000 scanned images, and 1000 photos of 1000 unique mock identity documents, each with unique text field values and unique artificially generated faces, with rich annotation. The dataset contains 72409 annotated images in total, making it the largest publicly available identity document dataset to the date of publication. We describe the structure of the dataset, its content and annotations, and present baseline experimental results to serve as a basis for future research. For the task of document location and identification content-independent, feature-based, and semantic segmentation-based methods were evaluated. For the task of document text field recognition, the Tesseract system was evaluated on field and character levels with grouping by field alphabets and document types. For the task of face detection, the performance of Multi Task Cascaded Convolutional Neural Networks-based method was evaluated separately for different types of image input modes. The baseline evaluations show that the existing methods of identity document analysis have a lot of room for improvement given modern challenges. We believe that the proposed dataset will prove invaluable for advancement of the field of document analysis and recognition.

Funder

Russian Foundation for Basic Research

Publisher

Samara National Research University

Subject

Electrical and Electronic Engineering,Computer Science Applications,Atomic and Molecular Physics, and Optics

Link

https://computeroptics.ru/KO/PDF/KO46-2/460212.pdf

Reference67 articles.

1. Fang X, Fu X, Xu X. ID card identification system based on image recognition. 12th IEEE Conf on Industrial Electronics and Applications (ICIEA) 2017: 1488-1492. DOI: 10.1109/ICIEA.2017.8283074.

2. Attivissimo F, Giaquinto N, Scarpetta M, Spadavecchia M. An automatic reader of identity documents. IEEE Int Conf on Systems, Man and Cybernetics (SMC) 2019: 3525-3530. DOI: 10.1109/SMC.2019.8914438.

3. Kuklinski T, Monk B. The use of ID reader-authenticators in secure access control and credentialing. IEEE Conf on Technologies for Homeland Security 2008: 246-251. DOI: 10.1109/THS.2008.4534458.

4. Soares A, das Neves Junior R, Bezerra B. BID Dataset: a challenge dataset for document processing tasks. Anais Estendidos do XXXIII Conf on Graphics, Patterns and Images 2020: 143-146. DOI:10.5753/sibgrapi.est.2020.12997.

5. Ghanmi N, Nabli C, Awal AM. CheckSim: A reference-based identity document verification by image similarity measure. In Book: Smith EHB, Pal U, eds. Document analysis and recognition – ICDAR 2021 Workshops. Springer Nature Switzerland AG; 2021: 422-436. DOI: 10.1007/978-3-030-86198-8_30.

Cited by 20 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Verification of color characteristics of document images captured in uncontrolled conditions;COMPUT OPT;2024

2. Detection of fingers in document images captured in uncontrolled environment;Sixteenth International Conference on Machine Vision (ICMV 2023);2024-04-03

3. Multilanguage ID document images synthesis for testing recognition pipelines;Sixteenth International Conference on Machine Vision (ICMV 2023);2024-04-03

4. Fast keypoint filtering for feature-based identity documents classification on complex background;Sixteenth International Conference on Machine Vision (ICMV 2023);2024-04-03

5. Identifying fraudulent identity documents by analyzing imprinted guilloche patterns;Multimedia Tools and Applications;2024-03-04