AUSPP: A universal short-read pre-processing package-Reference-Cited by-同舟云学术

AUSPP: A universal short-read pre-processing package

Published:2019-12 Issue:06 Volume:17 Page:1950037
ISSN:0219-7200
Container-title:Journal of Bioinformatics and Computational Biology
language:en
Short-container-title:J. Bioinform. Comput. Biol.

Author:

Gao Lei¹^ORCID,Wu Cong¹,Liu Lin¹

Affiliation:

1. The Key Laboratory of Plant Epigenetics of Guangdong Province, College of Life Sciences and Oceanography, Shenzhen University, Shenzhen, P. R. China

Abstract

There are many short-read aligners that can map short reads to a reference genome/sequence, and most of them can directly accept a FASTQ file as the input query file. However, the raw data usually need to be pre-processed. Few software programs specialize in pre-processing raw data generated by a variety of next-generation sequencing (NGS) technologies. Here, we present AUSPP, a Perl script-based pipeline for pre-processing and automatic mapping of NGS short reads. This pipeline encompasses quality control, adaptor trimming, collapsing of reads, structural RNA removal, length selection, read mapping, and normalized wiggle file creation. It facilitates the processing from raw data to genome mapping and is therefore a powerful tool for the steps before meta-analysis. Most importantly, since AUSPP has default processing pipeline settings for many types of NGS data, most of the time, users will simply need to provide the raw data and genome. AUSPP is portable and easy to install, and the source codes are freely available at https://github.com/highlei/AUSPP .

Funder

Natural Science Foundation of SZU

Guangdong Innovation Research Team Fund

Publisher

World Scientific Pub Co Pte Lt

Subject

Computer Science Applications,Molecular Biology,Biochemistry

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0219720019500379

Reference29 articles.

1. Profiling the HeLa S3 transcriptome using randomly primed cDNA and massively parallel short-read sequencing

2. High-Resolution Profiling of Histone Methylations in the Human Genome

3. Genome-Wide Mapping of in Vivo Protein-DNA Interactions

4. Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing

5. Genome-wide maps of chromatin state in pluripotent and lineage-committed cells

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Comprehensive bioinformatics analysis and molecular validation of lncRNAs-mediated ceRNAs network in schizophrenia;Life Sciences;2023-01

2. Integrated Analysis of Transcriptome and Small RNAome Reveals the Regulatory Network for Rapid Growth in Mikania micrantha;International Journal of Molecular Sciences;2022-09-13

3. TRANS-ACTING SIRNA3-derived short interfering RNAs confer cleavage of mRNAs in rice;Plant Physiology;2021-09-21