What and where: A context-based recommendation system for object insertion-Reference-Cited by-同舟云学术

What and where: A context-based recommendation system for object insertion

Published:2020-03 Issue:1 Volume:6 Page:79-93
ISSN:2096-0433
Container-title:Computational Visual Media
language:en
Short-container-title:Comp. Visual Media

Author:

Zhang Song-Hai,Zhou Zheng-Ping,Liu Bin,Dong Xi,Hall Peter

Abstract

AbstractWe propose a novel problem revolving around two tasks: (i) given a scene, recommend objects to insert, and (ii) given an object category, retrieve suitable background scenes. A bounding box for the inserted object is predicted in both tasks, which helps downstream applications such as semiautomated advertising and video composition. The major challenge lies in the fact that the target object is neither present nor localized in the input, and furthermore, available datasets only provide scenes with existing objects. To tackle this problem, we build an unsupervised algorithm based on object-level contexts, which explicitly models the joint probability distribution of object categories and bounding boxes using a Gaussian mixture model. Experiments on our own annotated test set demonstrate that our system outperforms existing baselines on all sub-tasks, and does so using a unified framework. Future extensions and applications are suggested.

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Computer Graphics and Computer-Aided Design,Computer Vision and Pattern Recognition

Link

http://link.springer.com/content/pdf/10.1007/s41095-020-0158-8.pdf

Reference36 articles.

1. Ricci, F.; Rokach, L.; Shapira, B. Recommender Systems Handbook. Boston: Springer, 2011.

2. Recommender system. Available at https://en.wikipedia.org/wiki/Recommender_system.

3. Johnson, J.; Krishna, R.; Stark, M.; Li, L. J.; Shamma, D. A.; Bernstein, M. S.; Fei-Fei, L. Image retrieval using scene graphs. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 3668–3678, 2015.

4. Wang, J.; Liu, W.; Kumar, S.; Chang, S. F. Learning to hash for indexing big data: A survey. Proceedings of the IEEE Vol. 104, No. 1, 34–57, 2016.

5. Zheng, L.; Yang, Y.; Tian, Q. SIFT meets CNN: A decade survey of instance retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 40, No. 5, 1224–1244, 2018.

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. SynFAGnet: A Fully Automated Generative Network for Realistic Fire Image Generation;Fire Technology;2024-02-03

2. Scene-aware Human Pose Generation using Transformer;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26

3. Deep Image Harmonization with Learnable Augmentation;2023 IEEE/CVF International Conference on Computer Vision (ICCV);2023-10-01

4. Automatic Shadow Generation via Exposure Fusion;IEEE Transactions on Multimedia;2023

5. SceneDirector: Interactive Scene Synthesis by Simultaneously Editing Multiple Objects in Real-Time;IEEE Transactions on Visualization and Computer Graphics;2023