Photo2Video: Semantic-Aware Deep Learning-Based Video Generation from Still Content-Reference-Cited by-同舟云学术

Photo2Video: Semantic-Aware Deep Learning-Based Video Generation from Still Content

Published:2022-03-10 Issue:3 Volume:8 Page:68
ISSN:2313-433X
Container-title:Journal of Imaging
language:en
Short-container-title:J. Imaging

Author:

Viana Paula^ORCID,Andrade Maria Teresa,Carvalho Pedro^ORCID,Vilaça Luis^ORCID,Teixeira Inês N.,Costa Tiago^ORCID,Jonker Pieter

Abstract

Applying machine learning (ML), and especially deep learning, to understand visual content is becoming common practice in many application areas. However, little attention has been given to its use within the multimedia creative domain. It is true that ML is already popular for content creation, but the progress achieved so far addresses essentially textual content or the identification and selection of specific types of content. A wealth of possibilities are yet to be explored by bringing the use of ML into the multimedia creative process, allowing the knowledge inferred by the former to influence automatically how new multimedia content is created. The work presented in this article provides contributions in three distinct ways towards this goal: firstly, it proposes a methodology to re-train popular neural network models in identifying new thematic concepts in static visual content and attaching meaningful annotations to the detected regions of interest; secondly, it presents varied visual digital effects and corresponding tools that can be automatically called upon to apply such effects in a previously analyzed photo; thirdly, it defines a complete automated creative workflow, from the acquisition of a photograph and corresponding contextual data, through the ML region-based annotation, to the automatic application of digital effects and generation of a semantically aware multimedia story driven by the previously derived situational and visual contextual data. Additionally, it presents a variant of this automated workflow by offering to the user the possibility of manipulating the automatic annotations in an assisted manner. The final aim is to transform a static digital photo into a short video clip, taking into account the information acquired. The final result strongly contrasts with current standard approaches of creating random movements, by implementing an intelligent content- and context-aware video.

Funder

the European Commission

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Graphics and Computer-Aided Design,Computer Vision and Pattern Recognition,Radiology, Nuclear Medicine and imaging

Link

https://www.mdpi.com/2313-433X/8/3/68/pdf

Reference32 articles.

1. Online Video Editor: Smart Video Maker by Magistohttps://www.magisto.com/

2. Free Video Maker: Create Your Own Video Easilyhttps://www.animoto.com/

3. Create Imagery That Gets Noticedhttps://www.flixel.com/

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. DEEP WRAP-UP- Automatic Document Summarization with Animations;2022 6th International Conference on Electronics, Communication and Aerospace Technology;2022-12-01