Effective transfer tagging from image to video-Reference-Cited by-同舟云学术

Effective transfer tagging from image to video

Published:2013-05 Issue:2 Volume:9 Page:1-20
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

Yang Yang¹,Yang Yi²,Shen Heng Tao¹

Affiliation:

1. The University of Queensland, St. Lucia, QLD

2. Carnegie Mellon University, Pittsburgh, PA

Abstract

Recent years have witnessed a great explosion of user-generated videos on the Web. In order to achieve an effective and efficient video search, it is critical for modern video search engines to associate videos with semantic keywords automatically. Most of the existing video tagging methods can hardly achieve reliable performance due to deficiency of training data. It is noticed that abundant well-tagged data are available in other relevant types of media (e.g., images). In this article, we propose a novel video tagging framework, termed as Cross-Media Tag Transfer (CMTT), which utilizes the abundance of well-tagged images to facilitate video tagging. Specifically, we build a “cross-media tunnel” to transfer knowledge from images to videos. To this end, an optimal kernel space, in which distribution distance between images and video is minimized, is found to tackle the domain-shift problem. A novel cross-media video tagging model is proposed to infer tags by exploring the intrinsic local structures of both labeled and unlabeled data, and learn reliable video classifiers. An efficient algorithm is designed to optimize the proposed model in an iterative and alternative way. Extensive experiments illustrate the superiority of our proposal compared to the state-of-the-art algorithms.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Link

https://dl.acm.org/doi/pdf/10.1145/2457450.2457456

Reference42 articles.

1. Integrating structured biological data by Kernel Maximum Mean Discrepancy

2. NUS-WIDE

3. Boosting for transfer learning

Cited by 54 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Tag Pollution Detection in Web Videos via Cross-Modal Relevance Estimation;2020 IEEE/ACM 28th International Symposium on Quality of Service (IWQoS);2020-06

2. Learning Click-Based Deep Structure-Preserving Embeddings with Visual Attention;ACM Transactions on Multimedia Computing, Communications, and Applications;2019-09-18

3. A feature selection framework for video semantic recognition via integrated cross-media analysis and embedded learning;EURASIP Journal on Image and Video Processing;2019-02-13

4. A Multiview Representation Framework for Micro-Expression Recognition;IEEE Access;2019

5. Pseudo Transfer with Marginalized Corrupted Attribute for Zero-shot Learning;Proceedings of the 26th ACM international conference on Multimedia;2018-10-15