Author:
Barbosa-Silva Adriano,Soldatos Theodoros G,Magalhães Ivan LF,Pavlopoulos Georgios A,Fontaine Jean-Fred,Andrade-Navarro Miguel A,Schneider Reinhard,Ortega J Miguel
Abstract
Abstract
Background
Biological knowledge is represented in scientific literature that often describes the function of genes/proteins (bioentities) in terms of their interactions (biointeractions). Such bioentities are often related to biological concepts of interest that are specific of a determined research field. Therefore, the study of the current literature about a selected topic deposited in public databases, facilitates the generation of novel hypotheses associating a set of bioentities to a common context.
Results
We created a text mining system (LAITOR:
L
iterature
A
ssistant for
I
dentification of
T
erms co-
O
ccurrences and
R
elationships) that analyses co-occurrences of bioentities, biointeractions, and other biological terms in MEDLINE abstracts. The method accounts for the position of the co-occurring terms within sentences or abstracts. The system detected abstracts mentioning protein-protein interactions in a standard test (BioCreative II IAS test data) with a precision of 0.82-0.89 and a recall of 0.48-0.70. We illustrate the application of LAITOR to the detection of plant response genes in a dataset of 1000 abstracts relevant to the topic.
Conclusions
Text mining tools combining the extraction of interacting bioentities and biological concepts with network displays can be helpful in developing reasonable hypotheses in different scientific backgrounds.
Publisher
Springer Science and Business Media LLC
Subject
Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology
Reference40 articles.
1. Andrade MA, Bork P: Automated extraction of information in molecular biology. FEBS Lett 2000, 476: 12–17. 10.1016/S0014-5793(00)01661-6
2. Krallinger M, Valencia A: Text-mining and information-retrieval services for molecular biology. Genome Biol 2005, 6: 224. 10.1186/gb-2005-6-7-224
3. Kostoff RN, DeMarco RA: Extracting information from the literature by text mining. Anal Chem 2001, 73: 370A-378A. 10.1021/ac012472h
4. Blaschke C, Andrade MA, Ouzounis C, Valencia A: Automatic extraction of biological information from scientific text: protein-protein interactions. Proc Int Conf Intell Syst Mol Biol 1999: 60–67.
5. Natarajan J, Berrar D, Dubitzky W, Hack C, Zhang Y, DeSesa C, Van Brocklyn JR, Bremer EG: Text mining of full-text journal articles combined with gene expression analysis reveals a relationship between sphingosine-1-phosphate and invasiveness of a glioblastoma cell line. BMC Bioinformatics 2006, 7: 373. 10.1186/1471-2105-7-373
Cited by
22 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献