Using contextual and lexical features to restructure and validate the classification of biomedical concepts-Reference-Cited by-同舟云学术

Using contextual and lexical features to restructure and validate the classification of biomedical concepts

Published:2007-07-24 Issue:1 Volume:8 Page:
ISSN:1471-2105
Container-title:BMC Bioinformatics
language:en
Short-container-title:BMC Bioinformatics

Author:

Fan Jung-Wei,Xu Hua,Friedman Carol

Abstract

Abstract Background Biomedical ontologies are critical for integration of data from diverse sources and for use by knowledge-based biomedical applications, especially natural language processing as well as associated mining and reasoning systems. The effectiveness of these systems is heavily dependent on the quality of the ontological terms and their classifications. To assist in developing and maintaining the ontologies objectively, we propose automatic approaches to classify and/or validate their semantic categories. In previous work, we developed an approach using contextual syntactic features obtained from a large domain corpus to reclassify and validate concepts of the Unified Medical Language System (UMLS), a comprehensive resource of biomedical terminology. In this paper, we introduce another classification approach based on words of the concept strings and compare it to the contextual syntactic approach. Results The string-based approach achieved an error rate of 0.143, with a mean reciprocal rank of 0.907. The context-based and string-based approaches were found to be complementary, and the error rate was reduced further by applying a linear combination of the two classifiers. The advantage of combining the two approaches was especially manifested on test data with sufficient contextual features, achieving the lowest error rate of 0.055 and a mean reciprocal rank of 0.969. Conclusion The lexical features provide another semantic dimension in addition to syntactic contextual features that support the classification of ontological concepts. The classification errors of each dimension can be further reduced through appropriate combination of the complementary classifiers.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology

Link

https://link.springer.com/content/pdf/10.1186/1471-2105-8-264.pdf

Reference46 articles.

1. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000, 25 (1): 25-29. 10.1038/75556.

2. Rosse C, Mejino JL: A reference ontology for biomedical informatics: the Foundational Model of Anatomy. J Biomed Inform. 2003, 36 (6): 478-500. 10.1016/j.jbi.2003.11.007.

3. Lindberg DA, Humphreys BL, McCray AT: The Unified Medical Language System. Methods Inf Med. 1993, 32 (4): 281-291.

4. Campbell KE, Oliver DE, Shortliffe EH: The Unified Medical Language System: toward a collaborative approach for solving terminologic problems. J Am Med Inform Assoc. 1998, 5 (1): 12-16.

5. Yu AC: Methods in biomedical ontology. J Biomed Inform. 2006, 39 (3): 252-266. 10.1016/j.jbi.2005.11.006.

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Two complementary AI approaches for predicting UMLS semantic group assignment: heuristic reasoning and deep learning;Journal of the American Medical Informatics Association;2023-08-01

2. A review of auditing techniques for the Unified Medical Language System;Journal of the American Medical Informatics Association;2020-08-07

3. The Unified Medical Language System at 30 Years and How It Is Used and Published: Systematic Review and Content Analysis (Preprint);2020-05-25

4. Using data-driven sublanguage pattern mining to induce knowledge models: application in medical image reports knowledge representation;BMC Medical Informatics and Decision Making;2018-07-06

5. Constraints from protein structure and intra-molecular coevolution influence the fitness of HIV-1 recombinants;Virology;2014-04