Abstract
Writer recognition based on a small amount of handwritten text is one of the most challenging deep learning problems because of the implicit characteristics of handwriting styles. In a deep convolutional neural network, writer recognition based on supervised learning has shown great success. These supervised methods typically require a lot of annotated data. However, collecting annotated data is expensive. Although unsupervised writer recognition methods may address data annotation issues significantly, they often fail to capture sufficient feature relationships and usually perform less efficiently than supervised learning methods. Self-supervised learning may solve the unlabeled dataset issue and train the unsupervised datasets in a supervised manner. This paper introduces Self-Writer, a self-supervised writer recognition approach dealing with unlabeled data. The proposed scheme generates clusterable embeddings from a small fixed-length image frame such as a text block. The training strategy presumes that a small image frame of handwritten text should include the writer’s handwriting characteristics. We construct pairwise constraints and nongenerative augmentation to train Siamese architecture to generate embeddings depending on such an assumption. Self-Writer is evaluated on the two most widely used datasets, IAM and CVL, on pairwise and triplet architecture. We find Self-Writer to be convincing in achieving satisfactory performance using pairwise architectures.
Funder
Institutional Fund Projects
Ministry of Education and King AbdulAziz University, DSR, Jeddah, Saudi Arabia
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference59 articles.
1. Dissimilarity Gaussian mixture models for efficient offline handwritten text-independent identification using SIFT and RootSIFT descriptors;Khan;IEEE Trans. Inf. Forensics Secur.,2018
2. Tapiador, M., Gómez, J., and Sigüenza, J.A. (2004, January 17). Writer identification forensic system based on support vector machines with connected components. Proceedings of the International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, Berlin/Heidelberg, Germany.
3. Fornés, A., Lladós, J., Sánchez, G., and Bunke, H. (2008, January 16–19). Writer identification in old handwritten music scores. Proceedings of the 2008 the Eighth IAPR International Workshop on Document Analysis Systems, Nara, Japan.
4. Fornés, A., Lladós, J., Sánchez, G., and Bunke, H. (2009, January 26–29). On the use of textural features for writer identification in old handwritten music scores. Proceedings of the 2009 10th International Conference on Document Analysis and Recognition, Catalonia, Spain.
5. Ballard, L., Lopresti, D., and Monrose, F. (2006, January 23–26). Evaluating the security of handwriting biometrics. Proceedings of the Tenth International Workshop on Frontiers in Handwriting Recognition, La Baule, France.