pysradb: A Python package to query next-generation sequencing metadata and data from NCBI Sequence Read Archive-Reference-Cited by-同舟云学术

pysradb: A Python package to query next-generation sequencing metadata and data from NCBI Sequence Read Archive

Published:2019-04-23 Issue: Volume:8 Page:532
ISSN:2046-1402
Container-title:F1000Research
language:en
Short-container-title:F1000Res

Author:

Choudhary Saket^ORCID

Abstract

The NCBI Sequence Read Archive (SRA) is the primary archive of next-generation sequencing datasets. SRA makes metadata and raw sequencing data available to the research community to encourage reproducibility and to provide avenues for testing novel hypotheses on publicly available data. However, methods to programmatically access this data are limited. We introduce the Python package, pysradb, which provides a collection of command line methods to query and download metadata and data from SRA, utilizing the curated metadata database available through the SRAdb project. We demonstrate the utility of pysradb on multiple use cases for searching and downloading SRA datasets. It is available freely at https://github.com/saketkc/pysradb.

Publisher

F1000 Research Ltd

Subject

General Pharmacology, Toxicology and Pharmaceutics,General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology,General Medicine

Link

https://f1000research.com/articles/8-532/v1/pdf

Reference22 articles.

1. A systematic survey of loss-of-function variants in human protein-coding genes.;D MacArthur;Science.,2012

2. Massive mining of publicly available RNA-seq data from human and mouse.;A Lachmann;Nat Commun.,2018

3. Reproducible RNA-seq analysis using recount2.;L Collado-Torres;Nat Biotechnol.,2017

4. The sequence read archive.;R Leinonen;Nucleic Acids Res.,2011

5. Sra toolkit,2018

Cited by 37 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Diversity and selection analyses identify transmission-blocking antigens as the optimal vaccine candidates in Plasmodium falciparum;eBioMedicine;2024-08

2. CytoCellDB: a comprehensive resource for exploring extrachromosomal DNA in cancer cell lines;NAR Cancer;2024-07-09

3. Global insights into endophytic bacterial communities of terrestrial plants: Exploring the potential applications of endophytic microbiota in sustainable agriculture;Science of The Total Environment;2024-06

4. iSeq: An integrated tool to fetch public sequencing data;2024-05-20

5. Diversity and selection analyses identify transmission-blocking antigens as the optimal vaccine candidates inPlasmodium falciparum;2024-05-12