Topic-Oriented Text Features Can Match Visual Deep Models of Video Memorability-Reference-Cited by-同舟云学术

Topic-Oriented Text Features Can Match Visual Deep Models of Video Memorability

Published:2021-08-12 Issue:16 Volume:11 Page:7406
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Kleinlein Ricardo^ORCID,Luna-Jiménez Cristina^ORCID,Arias-Cuadrado David,Ferreiros Javier^ORCID,Fernández-Martínez Fernando^ORCID

Abstract

Not every visual media production is equally retained in memory. Recent studies have shown that the elements of an image, as well as their mutual semantic dependencies, provide a strong clue as to whether a video clip will be recalled on a second viewing or not. We believe that short textual descriptions encapsulate most of these relationships among the elements of a video, and thus they represent a rich yet concise source of information to tackle the problem of media memorability prediction. In this paper, we deepen the study of short captions as a means to convey in natural language the visual semantics of a video. We propose to use vector embeddings from a pretrained SBERT topic detection model with no adaptation as input features to a linear regression model, showing that, from such a representation, simpler algorithms can outperform deep visual models. Our results suggest that text descriptions expressed in natural language might be effective in embodying the visual semantics required to model video memorability.

Funder

Ministerio de Ciencia, Innovación y Universidades

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/11/16/7406/pdf

Reference31 articles.

1. What Makes a Photograph Memorable?

2. Recognition memory for words, sentences, and pictures

3. Learning 10000 pictures

4. Understanding the Intrinsic Memorability of Images;Isola,2011

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Enhancing Video Memorability Prediction Through Contrastive Learning and Fine Tuning of Pre-Trained Video and Text Encoders;2024

2. Video Memorability Prediction From Jointly-learnt Semantic and Visual Features;20th International Conference on Content-based Multimedia Indexing;2023-09-20

3. Value Assessment of UGC Short Videos through Element Mining and Data Analysis;Applied Sciences;2023-08-19

4. Adaptive Multi-Modal Ensemble Network for Video Memorability Prediction;Applied Sciences;2022-08-27