OmixLitMiner: A Bioinformatics Tool for Prioritizing Biological Leads from ‘Omics Data Using Literature Retrieval and Data Mining-Reference-Cited by-同舟云学术

OmixLitMiner: A Bioinformatics Tool for Prioritizing Biological Leads from ‘Omics Data Using Literature Retrieval and Data Mining

Published:2020-02-19 Issue:4 Volume:21 Page:1374
ISSN:1422-0067
Container-title:International Journal of Molecular Sciences
language:en
Short-container-title:IJMS

Author:

Steffen Pascal^ORCID,Wu Jemma,Hariharan Shubhang,Voss Hannah,Raghunath Vijay,Molloy Mark P.,Schlüter Hartmut^ORCID

Abstract

Proteomics and genomics discovery experiments generate increasingly large result tables, necessitating more researcher time to convert the biological data into new knowledge. Literature review is an important step in this process and can be tedious for large scale experiments. An informed and strategic decision about which biomolecule targets should be pursued for follow-up experiments thus remains a considerable challenge. To streamline and formalise this process of literature retrieval and analysis of discovery based ‘omics data and as a decision-facilitating support tool for follow-up experiments we present OmixLitMiner, a package written in the computational language R. The tool automates the retrieval of literature from PubMed based on UniProt protein identifiers, gene names and their synonyms, combined with user defined contextual keyword search (i.e., gene ontology based). The search strategy is programmed to allow either strict or more lenient literature retrieval and the outputs are assigned to three categories describing how well characterized a regulated gene or protein is. The category helps to meet a decision, regarding which gene/protein follow-up experiments may be performed for gaining new knowledge and to exclude following already known biomarkers. We demonstrate the tool’s usefulness in this retrospective study assessing three cancer proteomics and one cancer genomics publication. Using the tool, we were able to corroborate most of the decisions in these papers as well as detect additional biomolecule leads that may be valuable for future research.

Publisher

MDPI AG

Subject

Inorganic Chemistry,Organic Chemistry,Physical and Theoretical Chemistry,Computer Science Applications,Spectroscopy,Molecular Biology,General Medicine,Catalysis

Link

https://www.mdpi.com/1422-0067/21/4/1374/pdf

Reference18 articles.

1. Text Mining in Genomics and Proteomics

2. A Review of Recent Advancement in Integrating Omics Data with Literature Mining towards Biomedical Discoveries

3. MineBlast: a literature presentation service supporting protein annotation by data mining of BLAST results

4. PPInterFinder—a mining tool for extracting causal relations on human proteins from literature

5. A genome-wide MeSH-based literature mining system predicts implicit gene-to-gene relationships and networks

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Co-activation of LIN28A and CTNNB1 disturbs cortical neuronal migration and pia mater integrity;2024-08-02

2. Fast retrieval method of biomedical literature based on feature mining;International Journal of Data Mining and Bioinformatics;2023

3. Research on cloud storage biological data deduplication method based on Simhash algorithm;International Journal of Data Mining and Bioinformatics;2023

4. An Evaluation Model for the Influence Factors of Interest in Literature Courses Based on Data Analysis and Association Rules in a Small-Sample Environment;Journal of Environmental and Public Health;2022-09-09

5. An Analysis of the Interaction between Ancient Literature Informatization Project and Classical Literature Research Based on Intelligent Computing;Mathematical Problems in Engineering;2022-07-30