Enhanced ontology-based indexing and searching

Author:

Thenmalar S.,Geetha T.V.

Abstract

Purpose – The purpose of this paper is to improve the conceptual-based search by incorporating structural ontological information such as concepts and relations. Generally, Semantic-based information retrieval aims to identify relevant information based on the meanings of the query terms or on the context of the terms and the performance of semantic information retrieval is carried out through standard measures-precision and recall. Higher precision leads to the (meaningful) relevant documents obtained and lower recall leads to the less coverage of the concepts. Design/methodology/approach – In this paper, the authors enhance the existing ontology-based indexing proposed by Kohler et al., by incorporating sibling information to the index. The index designed by Kohler et al., contains only super and sub-concepts from the ontology. In addition, in our approach, we focus on two tasks; query expansion and ranking of the expanded queries, to improve the efficiency of the ontology-based search. The aforementioned tasks make use of ontological concepts, and relations existing between those concepts so as to obtain semantically more relevant search results for a given query. Findings – The proposed ontology-based indexing technique is investigated by analysing the coverage of concepts that are being populated in the index. Here, we introduce a new measure called index enhancement measure, to estimate the coverage of ontological concepts being indexed. We have evaluated the ontology-based search for the tourism domain with the tourism documents and tourism-specific ontology. The comparison of search results based on the use of ontology “with and without query expansion” is examined to estimate the efficiency of the proposed query expansion task. The ranking is compared with the ORank system to evaluate the performance of our ontology-based search. From these analyses, the ontology-based search results shows better recall when compared to the other concept-based search systems. The mean average precision of the ontology-based search is found to be 0.79 and the recall is found to be 0.65, the ORank system has the mean average precision of 0.62 and the recall is found to be 0.51, while the concept-based search has the mean average precision of 0.56 and the recall is found to be 0.42. Practical implications – When the concept is not present in the domain-specific ontology, the concept cannot be indexed. When the given query term is not available in the ontology then the term-based results are retrieved. Originality/value – In addition to super and sub-concepts, we incorporate the concepts present in same level (siblings) to the ontological index. The structural information from the ontology is determined for the query expansion. The ranking of the documents depends on the type of the query (single concept query, multiple concept queries and concept with relation queries) and the ontological relations that exists in the query and the documents. With this ontological structural information, the search results showed us better coverage of concepts with respect to the query.

Publisher

Emerald

Subject

Library and Information Sciences,Information Systems

Reference38 articles.

1. Balaji, J. , Geetha, T.V. , Parthasarathi, R. and Karky, M. (2011), “Morpho-semantic features for rule-based Tamil enconversion”, International Journal of Computer Applications, Vol. 26 No. 6, pp. 11-18.

2. Baziz, M. , Boughanem, M. , Pasi, G. and Prade, H. (2007), “An information retrieval driven by ontology from query to document expansion”, in Evans, D. , Furui, S. and Soulé-Dupuy, C. (Eds), Large Scale Semantic Access to Content (Text, Image, Video, and Sound), RIAO ‘07 , le centre de hautes etudes internationales d’informatique documentaire, Paris, pp. 301-313.

3. Christophides, V. , Karvounarakis, G. , Plexousakis, D. , Scholl, M. and Tourtounis, S. (2003), “Optimizing taxonomic semantic web queries using labeling schemes”, Journal of Web Semantics, Vol. 1 No. 2, pp. 207-228.

4. Desmontils, E. , Jacquin, C. and Simon, L. (2003), “Ontology enrichment and indexing process”, Ingenierie des Connaissances, Research Report, No. 03.05, Institut de Recherche en Informatique de Nantes, Nantes.

5. Dragoni, M. , Pereira, C.D.C. and Tettamanzi, A.G.B. (2010), “An ontological representation of documents and queries for information retrieval systems”, IEA/AIE’10, Proceedings of the 23rd International Conference on Industrial Engineering and other Applications of Applied Intelligent Systems – Volume Part II, pp. 555-564.

Cited by 6 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3