Author:
Tang Mingwei,Chen Jiangping,Chen Haihua,Xu Zhenyuan,Wang Yueyao,Xie Mengting,Lin Jiangwei
Abstract
Purpose
The purpose of this paper is to provide an integrated semantic information retrieval (IR) solution based on an ontology-improved vector space model for situations where a digital collection is established or curated. It aims to create a retrieval approach which could return the results by meanings rather than by keywords.
Design/methodology/approach
In this paper, the authors propose a semantic term frequency algorithm to create a semantic vector space model (SeVSM) based on ontology. To support the calculation, a multi-branches tree model is created to represent the ontology and a set of algorithms is developed to operate it. Then, a semantic ontology-based IR system based on the SeVSM model is designed and developed to verify the effectiveness of the proposed model.
Findings
The experimental study using 30 queries from 15 different domains confirms the effectiveness of the SeVSM and the usability of the proposed system. The results demonstrate that the proposed model and system can be a significant exploration to enhance IR in specific domains, such as a digital library and e-commerce.
Originality/value
This research not only creates a semantic retrieval model, but also provides the application approach via designing and developing a semantic retrieval system based on the model. Comparing with most of the current related research, the proposed research studies the whole process of realizing a semantic retrieval.
Subject
Library and Information Sciences,Computer Science Applications
Reference49 articles.
1. LitVar: a semantic search engine for linking genomic variant data in PubMed and PMC;Nucleic Acids Research,2018
2. An information retrieval model based on latent semantic indexing with intelligent preprocessing;Journal of Information and Knowledge Management,2005
3. A new approach for query expansion using Wikipedia and WordNet;Information Sciences,2019
4. Latent Dirichlet allocation;Journal of Machine Learning Research,2003
5. Latent Dirichlet allocation complement in the vector space model for multi-label text classification;International Journal of Combinatorial Optimization Problems and Informatics,2015
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献