1. Matas J, Chum O, Urban M, Pajdla T (2004) Robust wide-baseline stereo from maximally stable extremal regions. Image Vis Comput 22(10):761–767
2. Anthimopoulos M, Gatos B, Pratikakis I (2013) Detection of artificial and scene text in images and video frames. Pattern Anal Appl 16(3):431–446
3. Wang T, Wu DJ, Coates A, Ng AY (2012) End-to-end text recognition with convolutional neural networks. In: Proceedings of 21st international conference on pattern recognition. IEEE, Tsukuba Science City, Japan, pp 3304–3308
4. Nagaoka Y, Miyazaki T, Sugaya Y, Omachi S (2017) Text detection by faster R-CNN with multiple region proposal networks. In: Proceedings of international conference on document analysis and recognition. IEEE, Kyoto, Japan, pp 15–20
5. Jaderberg M, Simonyan K, Vedaldi A, Zisserman A (2016) Reading text in the wild with convolutional neural networks. Int J Comput Vis 116(1):1–20