Author:
Yousuf Ummar,Singh Ravinder Pal,Mehra Monika
Abstract
The contents of a picture are automatically created in Artificial Intelligence (AI), which combines computer vision and natural language processing (NLP) (Natural Language Processing). It is developed a regenerative neuronal model. Computer vision and machine translation are required. This model is used to produce natural-sounding phrases that describe the picture. Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN) are used in this model (RNN). The CNN is used to extract features from images, while the RNN is used to generate sentences. The model has been trained in such a manner that when an input image is provided to it, it creates captions that almost accurately describe the image. On various datasets, the model's accuracy, smoothness, and command of language learned from picture descriptions are assessed. These tests reveal that the model typically provides correct descriptions of the input image.
Publisher
Innovative Research Publication
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Remote Sensing Image Captioning using CNN and LSTM;2024 International Conference on Advancements in Power, Communication and Intelligent Systems (APCI);2024-06-21
2. Image Captioning Using Python;2023 International Conference on Power, Instrumentation, Energy and Control (PIECON);2023-02-10