Affiliation:
1. Jadavpur University, India
2. Victoria University, Australia
Abstract
Semantic characterization is necessary for developing intelligent multimedia databases, because humans tend to search for media content based on their inherent semantics. However, automated inference of semantic concepts derived from media components stored in a database is still a challenge. The aim of this chapter is to demonstrate how layered architectures and “visual keywords” can be used to develop intelligent search systems for multimedia databases. The layered architecture is used to extract meta-data from multimedia components at various layers of abstractions. While the lower layers handle physical file attributes and low-level features, the upper layers handle high-level features and attempts to remove ambiguities inherent in them. To access the various abstracted features, a query schema is presented, which provides a single point of access while establishing hierarchical pathways between feature-classes. Minimization of the semantic gap is addressed using the concept of “visual keyword” (VK). “Visual keywords” are segmented portions of images with associated low- and high-level features, implemented within a semantic layer on top of the standard low-level features layer, for characterizing semantic content in media components. Semantic information is however predominantly expressed in textual form, and hence is susceptible to the limitations of textual descriptors – viz. ambiguities related to synonyms, homonyms, hypernyms, and hyponyms. To handle such ambiguities, this chapter proposes a domain specific ontology-based layer on top of the semantic layer, to increase the effectiveness of the search process.
Reference26 articles.
1. Belongie, S., Carson, C., Greenspan, H., & Malik, J. (1997). Recognition of images in large databases using a learning framework (Tech. Rep. No. CSD-97-939), University of California, Berkeley, CS Dept.
2. Bradshaw, B. (2000). Semantic based image retrieval: A probabilistic approach. Proceedings of ACM Conference on Multimedia, (pp. 167-176).
3. Mean shift: a robust approach toward feature space analysis
4. Adaptive pyramid and semantic graph – Knowledge driven segmentation;A.Deruyver;Lecture Notes in Computer Science, 3434,2005