Multimodal Data Mining in a Multimedia Database Based on Structured Max Margin Learning-Reference-Cited by-同舟云学术

Multimodal Data Mining in a Multimedia Database Based on Structured Max Margin Learning

Published:2016-02-24 Issue:3 Volume:10 Page:1-30
ISSN:1556-4681
Container-title:ACM Transactions on Knowledge Discovery from Data
language:en
Short-container-title:ACM Trans. Knowl. Discov. Data

Author:

Guo Zhen¹,Zhang Zhongfei (Mark)¹,Xing Eric P.²,Faloutsos Christos²

Affiliation:

1. SUNY Binghamton, NY

2. Carnegie Mellon University, Pittsburgh, PA

Abstract

Mining knowledge from a multimedia database has received increasing attentions recently since huge repositories are made available by the development of the Internet. In this article, we exploit the relations among different modalities in a multimedia database and present a framework for general multimodal data mining problem where image annotation and image retrieval are considered as the special cases. Specifically, the multimodal data mining problem can be formulated as a structured prediction problem where we learn the mapping from an input to the structured and interdependent output variables. In addition, in order to reduce the demanding computation, we propose a new max margin structure learning approach called Enhanced Max Margin Learning (EMML) framework, which is much more efficient with a much faster convergence rate than the existing max margin learning methods, as verified through empirical evaluations. Furthermore, we apply EMML framework to develop an effective and efficient solution to the multimodal data mining problem that is highly scalable in the sense that the query response time is independent of the database scale. The EMML framework allows an efficient multimodal data mining query in a very large scale multimedia database, and excels many existing multimodal data mining methods in the literature that do not scale up at all. The performance comparison with a state-of-the-art multimodal data mining method is reported for the real-world image databases.

Funder

National Basic Research Program of China

US NSF

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/2742549

Reference26 articles.

1. Modeling annotated data

2. Stephen Boyd and Lieven Vandenberghe. 2004. Convex Optimization. Cambridge University Press. Stephen Boyd and Lieven Vandenberghe. 2004. Convex Optimization. Cambridge University Press.

3. Semi-supervised learning for structured output variables

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. DESIGN OF EARLY WARNING SYSTEM FOR MENTAL HEALTH PROBLEMS BASED ON DATA MINING AND DATABASE;Revista Brasileira de Medicina do Esporte;2023

2. A Vertical Fragmentation Method for Multimedia Databases Considering Content-Based Queries;Handbook on Decision Making;2022-09-27

3. Data mining in college student education management information system;International Journal of Embedded Systems;2022

4. Design of Rock Climbing Data Acquisition System Based on LoRa;Cyber Security Intelligence and Analytics;2022

5. Sports Policy and Training Decision Support Method Based on Wireless Sensor Network;Wireless Communications and Mobile Computing;2021-10-14