ResMem-Net: memory based deep CNN for image memorability estimation-Reference-Cited by-同舟云学术

ResMem-Net: memory based deep CNN for image memorability estimation

Published:2021-11-05 Issue: Volume:7 Page:e767
ISSN:2376-5992
Container-title:PeerJ Computer Science
language:en
Short-container-title:

Author:

Praveen Arockia¹,Noorwali Abdulfattah²,Samiayya Duraimurugan³,Zubair Khan Mohammad⁴,Vincent P M Durai Raj⁵,Bashir Ali Kashif⁶,Alagupandi Vinoth³

Affiliation:

1. Phosphene AI, Madurai, India

2. Umm Al-Qura University, Makkah, Saudi Arabia

3. Optisol Business Solutions, Chennai, India

4. Department of Computer Science, Taibah University, Medina, Saudi Arabia

5. School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, Tamilnadu, India

6. The Manchester Metropolitan University, Manchester, United Kingdom

Abstract

Image memorability is a very hard problem in image processing due to its subjective nature. But due to the introduction of Deep Learning and the large availability of data and GPUs, great strides have been made in predicting the memorability of an image. In this paper, we propose a novel deep learning architecture called ResMem-Net that is a hybrid of LSTM and CNN that uses information from the hidden layers of the CNN to compute the memorability score of an image. The intermediate layers are important for predicting the output because they contain information about the intrinsic properties of the image. The proposed architecture automatically learns visual emotions and saliency, shown by the heatmaps generated using the GradRAM technique. We have also used the heatmaps and results to analyze and answer one of the most important questions in image memorability: “What makes an image memorable?”. The model is trained and evaluated using the publicly available Large-scale Image Memorability dataset (LaMem) from MIT. The results show that the model achieves a rank correlation of 0.679 and a mean squared error of 0.011, which is better than the current state-of-the-art models and is close to human consistency (p = 0.68). The proposed architecture also has a significantly low number of parameters compared to the state-of-the-art architecture, making it memory efficient and suitable for production.

Funder

Umm Al-Qura University

Publisher

PeerJ

Subject

General Computer Science

Link

https://peerj.com/articles/cs-767.pdf

Reference38 articles.

1. Attitudes from mere co-occurrences are guided by differentiation;Alves;Journal of Personality and Social Psychology,2020

2. The resiliency of image memorability: a predictor of memory separate from attention and priming;Bainbridge;Neuropsychologia,2020

3. Memorability: a stimulus-driven perceptual neural signature distinctive from memory;Bainbridge;NeuroImage,2017

4. Multiple instance learning based deep CNN for image memorability prediction;Basavaraju;Multimedia Tools and Applications,2019

5. Deep learning for image memorability prediction: the emotional bias;Baveye,2016

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Memorability shapes perceived time (and vice versa);Nature Human Behaviour;2024-04-22

2. Comprehensive Literature Survey on Deep Learning Used in Image Memorability Prediction and Modification;International Conference on Innovative Computing and Communications;2023-10-26

3. EDFA: Ensemble deep CNN for assessing student's cognitive state in adaptive online learning environments;International Journal of Cognitive Computing in Engineering;2023-06

4. VMemNet: A Deep Collaborative Spatial-Temporal Network With Attention Representation for Video Memorability Prediction;IEEE Transactions on Multimedia;2023