Performance and accuracy analysis of semantic kernel functions
Author:
Manuja Manoj,Garg Deepak
Abstract
Purpose
– Syntax-based text classification (TC) mechanisms have been overtly replaced by semantic-based systems in recent years. Semantic-based TC systems are particularly useful in those scenarios where similarity among documents is computed considering semantic relationships among their terms. Kernel functions have received major attention because of the unprecedented popularity of SVMs in the field of TC. Most of the kernel functions exploit syntactic structures of the text, but quite a few also use a priori semantic information for knowledge extraction. The purpose of this paper is to investigate semantic kernel functions in the context of TC.
Design/methodology/approach
– This work presents performance and accuracy analysis of seven semantic kernel functions (Semantic Smoothing Kernel, Latent Semantic Kernel, Semantic WordNet-based Kernel, Semantic Smoothing Kernel having Implicit Superconcept Expansions, Compactness-based Disambiguation Kernel Function, Omiotis-based S-VSM semantic kernel function and Top-k S-VSM semantic kernel) being implemented with SVM as kernel method. All seven semantic kernels are implemented in SVM-Light tool.
Findings
– Performance and accuracy parameters of seven semantic kernel functions have been evaluated and compared. The experimental results show that Top-k S-VSM semantic kernel has the highest performance and accuracy among all the evaluated kernel functions which make it a preferred building block for kernel methods for TC and retrieval.
Research limitations/implications
– A combination of semantic kernel function with syntactic kernel function needs to be investigated as there is a scope of further improvement in terms of accuracy and performance in all the seven semantic kernel functions.
Practical implications
– This research provides an insight into TC using a priori semantic knowledge. Three commonly used data sets are being exploited. It will be quite interesting to explore these kernel functions on live web data which may test their actual utility in real business scenarios.
Originality/value
– Comparison of performance and accuracy parameters is the novel point of this research paper. To the best of the authors’ knowledge, this type of comparison has not been done previously.
Subject
Library and Information Sciences,Information Systems
Reference35 articles.
1. Agirre, E.
and
Rigau, G.
(1996), “Word sense disambiguation using conceptual density”, Proceedings of the 16th Conference on Computational Linguistics, Association for Computational Linguistics, Vol. 1, pp. 16-22. 2. Basili, R.
,
Cammisa, M.
and
Moschitti, A.
(2005), “Effective use of WordNet semantics via kernel-based learning”, Proceedings of the Ninth Conference on Computational Natural Language Learning, Association for Computational Linguistics, pp. 1-8. 3. Bloehdorn, S.
and
Sure, Y.
(2007), “Kernel methods for mining instance data in ontologies”, in
Aberer, K.
,
Choi, K.-S.
,
Noy, N.
,
Allemang, D.
,
Lee, K.-I.
,
Nixon, L.
,
Golbeck, J.
,
Mika, P.
,
Maynard, D.
,
Mizoguchi, R.
,
Schreiber, G.
and
Cudré-Mauroux, R.
(Eds),
ISWC/ASWC 2007, LNCS 4825
, Springer
Berlin Heidelberg, pp. 58-71. 4. Bloehdorn, S.
,
Basili, R.
,
Cammisa, M.
and
Moschitti, A.
(2006), “Semantic kernels for text classification based on topological measures of feature similarity”, Data Mining, ICDM, Sixth IEEE International Conference, pp. 808-812. 5. Cristianini, N.
,
Shawe-Taylor, J.
and
Lodhi, H.
(2002), “Latent semantic kernels”,
Journal of Intelligent Information Systems
, Vol. 18 Nos 2-3, pp. 127-152.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. The Research of Semantic Kernel in SVM for Chinese Text Classification;Proceedings of the 2nd International Conference on Intelligent Information Processing - IIP'17;2017
|
|