Affiliation:
1. School of Cyberspace Security, Gansu University of Political Science and Law, Lanzhou, Gansu, China
Abstract
The writer identification task infers the writer by analyzing the texture, structure, and other representative features of the handwriting. Inspired by the attention mechanism, an end-to-end writer identification model is proposed in this paper, which combines both global features and local features. The Vision Transformer is used as the backbone network, and the Convolutional block attention module (CBAM) is introduced to enhance the ability of global feature awareness of the model. The proposed method is evaluated on two public data sets, IAM and CVL respectively. In the task of word-level writer identification, the accuracy rates in two data sets were 90.1% and 92.3% respectively. In the task of page-level writer identification, the accuracy rates were 98.6% and 99.5%, as a state-of-the-art performance.
Subject
Artificial Intelligence,General Engineering,Statistics and Probability