Pipeline for RNA sequencing data analysis by combination of Nextflow and R

Author:

Qu Jia-HuaORCID

Abstract

AbstractWith the development of high-throughput technologies, RNA sequencing (RNA-seq) has become a widely used technology in biological studies and thus a large number of RNA-seq data are emerging and remain to be analyzed. Although there are many different options for analysis methods and tools, a unified pipeline for RNA-seq data analysis is always necessary for a laboratory. Given the update of new methods and tools, I summarized my customized analysis codes to generate an updated pipeline for RNA-seq data analysis. During aging, gene mutations accumulate, and hormone regulation is disrupted, which may exacerbate age-related diseases. Therefore, we generated a dataset from mice with a gene mutation or not and under different hormone treatments to study the effects of two factors, i.e., hormone and gene mutation, on the transcriptome. Based on the Nextflow nf-core rnaseq pipeline, this project established this pipeline consisting of three stages: (1) upstream analysis containing quality control of fastq files before and after trimming, trimming, alignment, and quantification; (2) midstream analysis containing count normalization, differentially expressed genes analysis, and visualization via boxplot, PCA, t-SNE, sample distance heatmap, MA plot, volcano plot, and gene expression heatmap; and (3) downstream analysis containing functional enrichments of KEGG pathways and GO terms. Results showed distinct effects of the single factor as well as interactive effects of the two factors. Codes are also provided for readers who want to customize their analysis pipeline adapted from this pipeline easily.

Publisher

Cold Spring Harbor Laboratory

Reference19 articles.

1. Characterization of diverse populations of sinoatrial node cells and their proliferation potential at single nucleus resolution;Heliyon,2023

2. Transcriptome of left ventricle and sinoatrial node in young and old C57 mice;Fortune Journal of Health Sciences,2023

3. Qu, J.H. , et al., Proteomic Landscape and Deduced Functions of the Cardiac 14-3-3 Protein Interactome. Cells, 2022. 11(21).

4. Agrimi, J. , et al., Cardiac AC8 Over-Expression Increases Locomotion by Altering Heart-Brain Communication. JACC Clin Electrophysiol, 2023.

5. Nextflow enables reproducible computational workflows

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3