Exploring relation types for literature-based discovery-Reference-Cited by-同舟云学术

Exploring relation types for literature-based discovery

Published:2015-05-12 Issue:5 Volume:22 Page:987-992
ISSN:1527-974X
Container-title:Journal of the American Medical Informatics Association
language:en
Short-container-title:

Author:

Preiss Judita¹,Stevenson Mark¹,Gaizauskas Robert¹

Affiliation:

1. Department of Computer Science, The University of Sheffield 211 Portobello, Sheffield S1 4DP, UK

Abstract

Abstract Objective Literature-based discovery (LBD) aims to identify “hidden knowledge” in the medical literature by: (1) analyzing documents to identify pairs of explicitly related concepts (terms), then (2) hypothesizing novel relations between pairs of unrelated concepts that are implicitly related via a shared concept to which both are explicitly related. Many LBD approaches use simple techniques to identify semantically weak relations between concepts, for example, document co-occurrence. These generate huge numbers of hypotheses, difficult for humans to assess. More complex techniques rely on linguistic analysis, for example, shallow parsing, to identify semantically stronger relations. Such approaches generate fewer hypotheses, but may miss hidden knowledge. The authors investigate this trade-off in detail, comparing techniques for identifying related concepts to discover which are most suitable for LBD. Materials and methods A generic LBD system that can utilize a range of relation types was developed. Experiments were carried out comparing a number of techniques for identifying relations. Two approaches were used for evaluation: replication of existing discoveries and the “time slicing” approach.1 Results Previous LBD discoveries could be replicated using relations based either on document co-occurrence or linguistic analysis. Using relations based on linguistic analysis generated many fewer hypotheses, but a significantly greater proportion of them were candidates for hidden knowledge. Discussion and Conclusion The use of linguistic analysis-based relations improves accuracy of LBD without overly damaging coverage. LBD systems often generate huge numbers of hypotheses, which are infeasible to manually review. Improving their accuracy has the potential to make these systems significantly more usable.

Publisher

Oxford University Press (OUP)

Subject

Health Informatics

Link

http://academic.oup.com/jamia/article-pdf/22/5/987/34145915/ocv002.pdf

Reference32 articles.

1. A new evaluation methodology for literature-based discovery;Yetisgen-Yildiz;J Biomed Inform.,2009

2. Using concepts in literature-based discovery: simulating Swanson's Reynaud - fish oil and migraine - magnesium discoveries;Weeber;J Am Soc Inform Sci Technol.,2001

3. Using literature-based discovery to identify novel therapeutic approaches;Hristovski;Cardiovasc Hematol Agents Med Chem.,2013

4. Fish oil, Raynaud's syndrome, and undiscovered public knowledge;Swanson;Perspect Biol Med.,1986

5. Exploiting semantic relations for literature-based discovery;Hristovski

Cited by 26 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Online Unstructured Data Analysis Models with KoBERT and Word2vec: A Study on Sentiment Analysis of Public Opinion in Korean;INTERNATIONAL JOURNAL of FUZZY LOGIC and INTELLIGENT SYSTEMS;2023-09-30

2. Review of Natural Language Processing in Pharmacology;Pharmacological Reviews;2023-03-17

3. Avoiding background knowledge: literature based discovery from important information;BMC Bioinformatics;2023-03-14

4. Predicting the impact of online news articles – is information necessary?;Multimedia Tools and Applications;2022-01-08

5. DD-RDL: Drug-Disease Relation Discovery and Labeling;Communications in Computer and Information Science;2022