A new similarity measure for subject hierarchical structures

Author:

Zhang Jin,Lei Zeng Marcia

Abstract

Purpose – The purpose of this paper is to introduce a new similarity method to gauge the differences between two subject hierarchical structures. Design/methodology/approach – In the proposed similarity measure, nodes on two hierarchical structures are projected onto a two-dimensional space, respectively, and both structural similarity and subject similarity of nodes are considered in the similarity between the two hierarchical structures. The extent to which the structural similarity impacts on the similarity can be controlled by adjusting a parameter. An experiment was conducted to evaluate soundness of the measure. Eight experts whose research interests were information retrieval and information organization participated in the study. Results from the new measure were compared with results from the experts. Findings – The evaluation shows strong correlations between the results from the new method and the results from the experts. It suggests that the similarity method achieved satisfactory results. Practical implications – Hierarchical structures that are found in subject directories, taxonomies, classification systems, and other classificatory structures play an extremely important role in information organization and information representation. Measuring the similarity between two subject hierarchical structures allows an accurate overarching understanding of the degree to which the two hierarchical structures are similar. Originality/value – Both structural similarity and subject similarity of nodes were considered in the proposed similarity method, and the extent to which the structural similarity impacts on the similarity can be adjusted. In addition, a new evaluation method for a hierarchical structure similarity was presented.

Publisher

Emerald

Subject

Library and Information Sciences,Information Systems

Reference39 articles.

1. Bille, P. (2005), “A survey on tree edit distance and related problems”, Theoretical Computer Science, Vol. 337 Nos 1-3, pp. 217-239.

2. Bunke, H. and Shearer, K. (1998), “A graph distance metric based on the maximal common subgraph”, Pattern Recognition Letters, Vol. 19 Nos 3-4, pp. 255-259.

3. Chen, S. , Zeng, M.L. and Chen, H. (2012), “Alignment of conceptual structures in controlled vocabularies in the domain of Chinese art – a discussion of issues and patterns”, in Neelameghan, A. and Raghavan, K.S. (Eds), Categories, Contexts and Relations in Knowledge Organization, Proceedings of the Twelfth International ISKO Conference, Mysore, August 6-9, Ergon-Verlag, Würzburg, pp. 249-255.

4. Dahchour, M. , Pirotte, A. and Zimányi, E. (2005), “Generic relationships in information modeling”, in Spaccapietra, S. (Ed.), Lecture Notes in Computer Science #3730: Journal on Data Semantics, Vol. 4, Springer-Verlag, Berlin, pp. 1-34.

5. Department Health and Human Services (2009), “HIPAA administrative simplification: modifications to medical data code set standards to adopt ICD-10-CM and ICD-10-PCS (RIN 0958-AN25)”, available at: www.gpo.gov/fdsys/pkg/FR-2009-01-16/pdf/E9-743.pdf (accessed December 4, 2012).

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3