Enhanced ontology-based indexing and searching
Author:
Thenmalar S.,Geetha T.V.
Abstract
Purpose
– The purpose of this paper is to improve the conceptual-based search by incorporating structural ontological information such as concepts and relations. Generally, Semantic-based information retrieval aims to identify relevant information based on the meanings of the query terms or on the context of the terms and the performance of semantic information retrieval is carried out through standard measures-precision and recall. Higher precision leads to the (meaningful) relevant documents obtained and lower recall leads to the less coverage of the concepts.
Design/methodology/approach
– In this paper, the authors enhance the existing ontology-based indexing proposed by Kohler et al., by incorporating sibling information to the index. The index designed by Kohler et al., contains only super and sub-concepts from the ontology. In addition, in our approach, we focus on two tasks; query expansion and ranking of the expanded queries, to improve the efficiency of the ontology-based search. The aforementioned tasks make use of ontological concepts, and relations existing between those concepts so as to obtain semantically more relevant search results for a given query.
Findings
– The proposed ontology-based indexing technique is investigated by analysing the coverage of concepts that are being populated in the index. Here, we introduce a new measure called index enhancement measure, to estimate the coverage of ontological concepts being indexed. We have evaluated the ontology-based search for the tourism domain with the tourism documents and tourism-specific ontology. The comparison of search results based on the use of ontology “with and without query expansion” is examined to estimate the efficiency of the proposed query expansion task. The ranking is compared with the ORank system to evaluate the performance of our ontology-based search. From these analyses, the ontology-based search results shows better recall when compared to the other concept-based search systems. The mean average precision of the ontology-based search is found to be 0.79 and the recall is found to be 0.65, the ORank system has the mean average precision of 0.62 and the recall is found to be 0.51, while the concept-based search has the mean average precision of 0.56 and the recall is found to be 0.42.
Practical implications
– When the concept is not present in the domain-specific ontology, the concept cannot be indexed. When the given query term is not available in the ontology then the term-based results are retrieved.
Originality/value
– In addition to super and sub-concepts, we incorporate the concepts present in same level (siblings) to the ontological index. The structural information from the ontology is determined for the query expansion. The ranking of the documents depends on the type of the query (single concept query, multiple concept queries and concept with relation queries) and the ontological relations that exists in the query and the documents. With this ontological structural information, the search results showed us better coverage of concepts with respect to the query.
Subject
Library and Information Sciences,Information Systems
Reference38 articles.
1. Balaji, J.
,
Geetha, T.V.
,
Parthasarathi, R.
and
Karky, M.
(2011), “Morpho-semantic features for rule-based Tamil enconversion”, International Journal of Computer Applications, Vol. 26 No. 6, pp. 11-18. 2. Baziz, M.
,
Boughanem, M.
,
Pasi, G.
and
Prade, H.
(2007), “An information retrieval driven by ontology from query to document expansion”, in
Evans, D.
,
Furui, S.
and
Soulé-Dupuy, C.
(Eds),
Large Scale Semantic Access to Content (Text, Image, Video, and Sound), RIAO ‘07
, le centre de hautes etudes internationales d’informatique documentaire, Paris, pp. 301-313. 3. Christophides, V.
,
Karvounarakis, G.
,
Plexousakis, D.
,
Scholl, M.
and
Tourtounis, S.
(2003), “Optimizing taxonomic semantic web queries using labeling schemes”, Journal of Web Semantics, Vol. 1 No. 2, pp. 207-228. 4. Desmontils, E.
,
Jacquin, C.
and
Simon, L.
(2003), “Ontology enrichment and indexing process”, Ingenierie des Connaissances, Research Report, No. 03.05, Institut de Recherche en Informatique de Nantes, Nantes. 5. Dragoni, M.
,
Pereira, C.D.C.
and
Tettamanzi, A.G.B.
(2010), “An ontological representation of documents and queries for information retrieval systems”, IEA/AIE’10, Proceedings of the 23rd International Conference on Industrial Engineering and other Applications of Applied Intelligent Systems – Volume Part II, pp. 555-564.
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|