Affiliation:
1. Taras Shevchenko National University of Kyiv
2. Ivan Kozhedub Kharkiv National University of the Air Force
Abstract
The theoretical foundations of building decision-making systems based on the results of image recognition accompanied by texts are considered. The approximate structure of the image recognition system is given. The basis of image recognition systems is the selection of text inscriptions on existing photos, their pre-processing, selection of isolated areas on the image, performance of mathematical operations on individual groups of pixels to bring them to known forms and comparison with them. The description of various methods of image preprocessing is performed. An analysis of the feasibility of using such methods of image binarization as adaptive Bradley-Roth binarization, median filtering, Gaussian filtering, methods of balanced histograms and class variances, discriminant analysis, logistic, probit regression, etc. was carried out. Different algorithms for dividing the image into separate areas for the purpose of their further recognition are considered. among them the moving average algorithm, the algorithm for estimating the probability of finding an object in a selected area based on boundary analysis, Category-independent object proposals, Constrained Parametric Min-Cuts, Multiscale combinatorical grouping, Selective Search, etc. A comparison of different implementations of image processing algorithms to ensure effective recognition, classification and identification of images is performed. Improvement of individual implementations of image processing algorithms allows to reduce their processing time, which is important for working with large data sets. The main focus of the research is on choosing the most effective methods for recognizing inscriptions on images, improving the algorithms that implement them, with the aim of building recognition systems aimed at processing large data sets.
Publisher
Borys Grinchenko Kyiv Metropolitan University
Reference24 articles.
1. Ivanytska, A., Zubyk, L., Gololobov, D., Isaienkov, Y., Grynkevych, G., & Bychkov, O. (2023). The system for recognizing useful information of the client’s ID-card based on machine learning technologies. CEUR Workshop Proceedings, Vol. 3687, 115–120.
2. Mafi, M., Martin, H., Andrian, J., Barreto, A., Cabrerizo, M., & Adjouadi, M. (2019). A Comprehensive Survey on Impulse and Gaussian Denoising Filters for Digital Images. Signal Processing, 157, 236–260.
3. Mousavirad, S. J., & Ebrahimpour-Komleh, H. (2017). Multilevel image thresholding using entropy of histogram and recently developed population-based metaheuristic algorithms. Evolutionary Intelligence, 10(1–2), 45–75. https://doi.org/10.1007/s12065-017-0152-y
4. Ma, G., & Yue, X. (2022). An improved whale optimization algorithm based on multilevel threshold image segmentation using the Otsu method. Eng. Appl. Artif. Intell. 113. https://doi.org/10.1016/j. engappai.2022.104960
5. Abd Elaziz, M., & Lu, S. (2019). Many-objectives multilevel thresholding image segmentation using knee evolutionary algorithm. Expert Syst. Appl. 125, 305–316.