Abstract
AbstractOver the last decade, the field of systems vaccinology has emerged, in which high throughput transcriptomics and other omics assays are used to probe changes of the innate and adaptive immune system in response to vaccination. RNA-Seq technology has matured in recent years and is now widely deployed for transcriptional analysis of clinical specimens. The goal of this study was to benchmark technical parameters of RNA-Seq in the context of a multi-site, double-blind randomized clinical trial using primary patient samples. We collected longitudinal peripheral blood mononuclear cell (PBMC) samples from 10 subjects after vaccination with a live attenuated Francisella tularensis vaccine and performed RNA-Seq at two different sites using aliquots from the same sample to generate two large-scale replicate datasets. We evaluated the impact of: (i) filtering lowly-expressed genes, (ii) using external RNA controls, (iii) fold change and false discovery rate (FDR) filtering, (iv) read length, and (v) sequencing depth on differential expressed genes (DEGs) concordance between replicate datasets. Using synthetic mRNA spike-ins, we developed a method for empirically establishing minimal read-count thresholds for maintaining fold change accuracy on a per-experiment basis. We defined a reference PBMC transcriptome by pooling sequence data, and established the impact of read depth and gene filtering on transcript representation. Lastly, we modeled statistical power to detect DE genes for a range of sample sizes, effect sizes, and coverage depths. The results from this study provide RNA sequencing benchmarks and guidelines for planning future similar vaccine studies.
Publisher
Cold Spring Harbor Laboratory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献