Application of Graph Structures in Computer Vision Tasks-Reference-Cited by-同舟云学术

Application of Graph Structures in Computer Vision Tasks

Published:2022-10-29 Issue:21 Volume:10 Page:4021
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Andriyanov Nikita^ORCID

Abstract

On the one hand, the solution of computer vision tasks is associated with the development of various kinds of images or random fields mathematical models, i.e., algorithms, that are called traditional image processing. On the other hand, nowadays, deep learning methods play an important role in image recognition tasks. Such methods are based on convolutional neural networks that perform many matrix multiplication operations with model parameters and local convolutions and pooling operations. However, the modern artificial neural network architectures, such as transformers, came to the field of machine vision from natural language processing. Image transformers operate with embeddings, in the form of mosaic blocks of picture and the links between them. However, the use of graph methods in the design of neural networks can also increase efficiency. In this case, the search for hyperparameters will also include an architectural solution, such as the number of hidden layers and the number of neurons for each layer. The article proposes to use graph structures to develop simple recognition networks on different datasets, including small unbalanced X-ray image datasets, widely known the CIFAR-10 dataset and the Kaggle competition Dogs vs Cats dataset. Graph methods are compared with various known architectures and with networks trained from scratch. In addition, an algorithm for representing an image in the form of graph lattice segments is implemented, for which an appropriate description is created, based on graph data structures. This description provides quite good accuracy and performance of recognition. The effectiveness of this approach based, on the descriptors of the resulting segments, is shown, as well as the graph methods for the architecture search.

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/10/21/4021/pdf

Reference42 articles.

1. Cessac, B. Retinal Processing: Insights from Mathematical Modelling. J. Imaging, 2022. 8.

2. Suryanarayana, G., Varadarajan, V., Pillutla, S.R., Nagajyothi, G., and Kotapati, G. Multiple Degradation Skilled Network for Infrared and Visible Image Fusion Based on Multi-Resolution SVD Updation. Mathematics, 2022. 10.

3. Schroder, M., Seidel, K., and Datcu, M. Gibbs random field models for image content characterization. Proceedings of the IGARSS’97. 1997 IEEE International Geoscience and Remote Sensing Symposium Proceedings. Remote Sensing—A Scientific Vision for Sustainable Development, Volume 1.

4. Optimal filtering of multidimensional random fields generated by autoregressions with multiple roots of characteristic equations;Andriyanov;CEUR Workshop Proc.,2019

5. Buslaev, A., Iglovikov, V.I., Khvedchenya, E., Parinov, A., Druzhinin, M., and Kalinin, A.A. Albumentations: Fast and Flexible Image Augmentations. Information, 2020. 11.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Adaptive Convergent Visibility Graph Network: An interpretable method for intelligent rolling bearing diagnosis;Mechanical Systems and Signal Processing;2025-01

2. Multicriteria Assessment Method for Network Structure Congestion Based on Traffic Data Using Advanced Computer Vision;Mathematics;2024-02-12

3. Skew Class-Balanced Re-Weighting for Unbiased Scene Graph Generation;Machine Learning and Knowledge Extraction;2023-03-10

4. Im2Graph: A Weakly Supervised Approach for Generating Holistic Scene Graphs from Regional Dependencies;Future Internet;2023-02-10