MIDV-2020: a comprehensive benchmark dataset for identity document analysis

Author:

Bulatov K.B., ,Emelianova E.V.,Tropin D.V.,Skoryukina N.S.,Chernyshova Y.S.,Sheshkus A.V.,Usilin S.A.,Ming Z.,Burie J.-C.,Luqman M.M.,Arlazarov V.V., , , , , , , , , , , , , , , , , , ,

Abstract

Identity documents recognition is an important sub-field of document analysis, which deals with tasks of robust document detection, type identification, text fields recognition, as well as identity fraud prevention and document authenticity validation given photos, scans, or video frames of an identity document capture. Significant amount of research has been published on this topic in recent years, however a chief difficulty for such research is scarcity of datasets, due to the subject matter being protected by security requirements. A few datasets of identity documents which are available lack diversity of document types, capturing conditions, or variability of document field values. In this paper, we present a dataset MIDV-2020 which consists of 1000 video clips, 2000 scanned images, and 1000 photos of 1000 unique mock identity documents, each with unique text field values and unique artificially generated faces, with rich annotation. The dataset contains 72409 annotated images in total, making it the largest publicly available identity document dataset to the date of publication. We describe the structure of the dataset, its content and annotations, and present baseline experimental results to serve as a basis for future research. For the task of document location and identification content-independent, feature-based, and semantic segmentation-based methods were evaluated. For the task of document text field recognition, the Tesseract system was evaluated on field and character levels with grouping by field alphabets and document types. For the task of face detection, the performance of Multi Task Cascaded Convolutional Neural Networks-based method was evaluated separately for different types of image input modes. The baseline evaluations show that the existing methods of identity document analysis have a lot of room for improvement given modern challenges. We believe that the proposed dataset will prove invaluable for advancement of the field of document analysis and recognition.

Funder

Russian Foundation for Basic Research

Publisher

Samara National Research University

Subject

Electrical and Electronic Engineering,Computer Science Applications,Atomic and Molecular Physics, and Optics

Cited by 20 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Verification of color characteristics of document images captured in uncontrolled conditions;COMPUT OPT;2024

2. Detection of fingers in document images captured in uncontrolled environment;Sixteenth International Conference on Machine Vision (ICMV 2023);2024-04-03

3. Multilanguage ID document images synthesis for testing recognition pipelines;Sixteenth International Conference on Machine Vision (ICMV 2023);2024-04-03

4. Fast keypoint filtering for feature-based identity documents classification on complex background;Sixteenth International Conference on Machine Vision (ICMV 2023);2024-04-03

5. Identifying fraudulent identity documents by analyzing imprinted guilloche patterns;Multimedia Tools and Applications;2024-03-04

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3