Affiliation:
1. Department of Electronic Engineering and Information Science , University of Science and Technology of China , No. 443 Huangshan Rd, Hefei, Anhui Province, 230027 P. R. China
Abstract
Abstract
Text-based CAPTCHA is a convenient and effective safety mechanism that has been widely deployed across websites. The efficient end-to-end models of scene text recognition consisting of CNN and attention-based RNN show limited performance in solving text-based CAPTCHAs. In contrast with the street view image and document, the character sequence in CAPTCHA is non-semantic. The RNN loses its ability to learn the semantic context and only implicitly encodes the relative position of extracted features. Meanwhile, the security features, which prevent characters from segmentation and recognition, extensively increase the complexity of CAPTCHAs. The performance of this model is sensitive to different CAPTCHA schemes. In this paper, we analyze the properties of the text-based CAPTCHA and accordingly consider solving it as a highly position-relative character sequence recognition task. We propose a network named PosConv to leverage the position information in the character sequence without RNN. PosConv uses a novel padding strategy and modified convolution, explicitly encoding the relative position into the local features of characters. This mechanism of PosConv makes the extracted features from CAPTCHAs more informative and robust. We validate PosConv on six text-based CAPTCHA schemes, and it achieves state-of-the-art or competitive recognition accuracy with significantly fewer parameters and faster convergence speed.
Subject
Artificial Intelligence,Computer Vision and Pattern Recognition,Hardware and Architecture,Modeling and Simulation,Information Systems
Reference23 articles.
1. [1] Darko Brodić, Alessia Amelio, Nadeem Ahmad, and Syed Khuram Shahzad. Usability analysis of the image and interactive captcha via prediction of the response time. In International Workshop on Multi-disciplinary Trends in Artificial Intelligence, pages 252–265. Springer, 2017.10.1007/978-3-319-69456-6_21
2. [2] Elie Bursztein, Jonathan Aigrain, Angelika Moscicki, and John C Mitchell. The end is nigh: Generic solving of text-based captchas. In 8th {USENIX} Workshop on Offensive Technologies ({WOOT} 14), 2014.
3. [3] Elie Bursztein, Matthieu Martin, and John Mitchell. Text-based captcha strengths and weaknesses. In Proceedings of the 18th ACM conference on Computer and communications security, pages 125–138, 2011.10.1145/2046707.2046724
4. [4] Kumar Chellapilla, Kevin Larson, Patrice Y Simard, and Mary Czerwinski. Computers beat humans at single character recognition in reading based human interaction proofs (hips). In Conference on Email and Anti-Spam (CEAS), pages 1–8, 2005.10.1145/1054972.1055070
5. [5] Chen Duan, Rong Zhang, and Ke Qing. Feature refine network for text-based captcha recognition. In International Conference on Image and Graphics, pages 64–73. Springer, 2019.10.1007/978-3-030-34110-7_6
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献