Abstract
AbstractTens of thousands of gene expression data sets describing a variety of model organisms in a many different pathophysiological conditions are currently stored in publicly available databases such as Gene Expression Omnibus (GEO) and ArrayExpress. As microarray technology is giving way to RNA-Seq, it becomes strategic to develop high-level tools of analysis to preserve access to this huge amount of information through the most sophisticated methods of data preparation and processing developed over the years, while ensuring at the same time the reproducibility of the results. To meet this need, here we present bioTEA, a novel software tool that combines the ease of use with the versatility and power of an R/Bioconductor-based differential expression analysis, starting from raw data retrieval and preparation to gene annotation. BioTEA is an R-coded pipeline, wrapped in a Python-based command line interface and containerized with Docker technology. The user can choose among multiple options—including gene filtering, batch effect handling, sample pairing, statistical test type—to adapt the algorithm flow to the structure of the particular data set. All these options are saved in single text file which can be easily shared between different laboratories to deterministically reproduce the results. In addition, a detailed log file provides accurate information about each step of analysis. Overall, these features make bioTEA an invaluable tool for both bioiformaticians and wet-lab biologists interested in transcriptomics.
Publisher
Cold Spring Harbor Laboratory
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献