Affiliation:
1. Computer Science & Engineering Department, Wright State University, Dayton, OH 45435, USA
Abstract
The efficient processing, associating and understanding multimedia or multi-modal information is a very important research field with a great variety of applications, such as knowledge discovery, document understanding, human computer interaction, etc. An important issue is the development of a common platform for converting different modalities (such as images, text, etc.) into the same medium and associating them for efficient processing and understanding. Thus, this paper presents the development of a robust and efficient methodology capable of automatically converting images into natural language (NL) text sentences using image processing-analysis methods and graphs with attributes for object recognition and image understanding. Then it converts graph representations into NL text sentences. Simple illustrative examples are provided for proving the concept proposed here.
Publisher
World Scientific Pub Co Pte Lt
Subject
Artificial Intelligence,Computer Networks and Communications,Computer Science Applications,Linguistics and Language,Information Systems,Software
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献