Recognition of visual scene elements from a story text in Persian natural language-Reference-Cited by-同舟云学术

Recognition of visual scene elements from a story text in Persian natural language

Published:2022-08-24 Issue: Volume: Page:1-27
ISSN:1351-3249
Container-title:Natural Language Engineering
language:en
Short-container-title:Nat. Lang. Eng.

Author:

Hashemi-Namin Mojdeh,Jahed-Motlagh Mohammad Reza,Torkaman Rahmani Adel

Abstract

Abstract Text-to-scene conversion systems map natural language text to formal representations required for visual scenes. The difficulty involved in this mapping is one of the most critical challenges for developing these systems. The current study mapped Persian natural language text as the headmost system to a conceptual scene model. This conceptual scene model is an intermediate semantic representation between natural language and the visual scene and contains descriptions of visual elements of the scene. It will be used to produce meaningful animation based on an input story in this ongoing study. The mapping task was modeled as a sequential labeling problem, and a conditional random field (CRF) model was trained and tested for sequential labeling of scene model elements. To the best of the authors’ knowledge, no dataset for this task exists; thus, the required dataset was collected for this task. The lack of required off-the-shelf natural language processing modules and a significant error rate in the available corpora were important challenges to dataset collection. Some features of the dataset were manually annotated. The results were evaluated using standard text classification metrics, and an average accuracy of 85.7% was obtained, which is satisfactory.

Publisher

Cambridge University Press (CUP)

Subject

Artificial Intelligence,Linguistics and Language,Language and Linguistics,Software

Reference51 articles.

1. Learning Spatial Knowledge for Text to 3D Scene Generation

2. WordsEye

3. The CoNLL-2008 shared task on joint parsing of syntactic and semantic dependencies

4. A method for automatically creating 3D animated scenes from annotated fiction text;Glass;International Journal on Computer Science and Information System,2009

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Language transfer in L2 academic writings: a dependency grammar approach;Frontiers in Psychology;2024-05-09

2. Simulation Research on Large Language Model of Complex OCR Scene Based on Reinforcement Learning Algorithm Optimization;2023 International Conference on Internet of Things, Robotics and Distributed Computing (ICIRDC);2023-12-29