Author:
Hur Junguk,Xiang Zuoshuang,Feldman Eva L,He Yongqun
Abstract
Abstract
Background
Vaccine literature indexing is poorly performed in PubMed due to limited hierarchy of Medical Subject Headings (MeSH) annotation in the vaccine field. Vaccine Ontology (VO) is a community-based biomedical ontology that represents various vaccines and their relations. SciMiner is an in-house literature mining system that supports literature indexing and gene name tagging. We hypothesize that application of VO in SciMiner will aid vaccine literature indexing and mining of vaccine-gene interaction networks. As a test case, we have examined vaccines for Brucella, the causative agent of brucellosis in humans and animals.
Results
The VO-based SciMiner (VO-SciMiner) was developed to incorporate a total of 67 Brucella vaccine terms. A set of rules for term expansion of VO terms were learned from training data, consisting of 90 biomedical articles related to Brucella vaccine terms. VO-SciMiner demonstrated high recall (91%) and precision (99%) from testing a separate set of 100 manually selected biomedical articles. VO-SciMiner indexing exhibited superior performance in retrieving Brucella vaccine-related papers over that obtained with MeSH-based PubMed literature search. For example, a VO-SciMiner search of "live attenuated Brucella vaccine" returned 922 hits as of April 20, 2011, while a PubMed search of the same query resulted in only 74 hits. Using the abstracts of 14,947 Brucella-related papers, VO-SciMiner identified 140 Brucella genes associated with Brucella vaccines. These genes included known protective antigens, virulence factors, and genes closely related to Brucella vaccines. These VO-interacting Brucella genes were significantly over-represented in biological functional categories, including metabolite transport and metabolism, replication and repair, cell wall biogenesis, intracellular trafficking and secretion, posttranslational modification, and chaperones. Furthermore, a comprehensive interaction network of Brucella vaccines and genes were identified. The asserted and inferred VO hierarchies provide semantic support for inferring novel knowledge of association of vaccines and genes from the retrieved data. New hypotheses were generated based on this analysis approach.
Conclusion
VO-SciMiner can be used to improve the efficiency for PubMed searching in the vaccine domain.
Publisher
Springer Science and Business Media LLC
Reference38 articles.
1. Almond JW: Vaccine renaissance. Nat Rev Microbiol. 2007, 5 (7): 478-481. 10.1038/nrmicro1702.
2. American-Diabetes-Association: Economic costs of diabetes in the U.S. In 2007. Diabetes Care. 2008, 31 (3): 596-615.
3. Bradac J, Dieffenbach CW: HIV vaccine development: Lessons from the past, informing the future. IDrugs. 2009, 12 (7): 435-439.
4. Perkins SD, Smither SJ, Atkins HS: Towards a Brucella vaccine for humans. FEMS Microbiol Rev. 2010
5. Disease NIoAaI: The Jordan Report, Accelerated Development of Vaccines 2007. 2007
Cited by
33 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献