Abstract
ABSTRACTAdvancement in technology has enabled sequencing machines to produce vast amounts of genetic data, causing an increase in storage demands. Most genomic software utilizes read alignments for several purposes including transcriptome assembly and gene count estimation. Herein we present, ABRIDGE, a state-of-the-art compressor for SAM alignment files offering users both lossless and lossy compression options. This reference-based file compressor achieves the best compression ratio among all compression software ensuring lower space demand and faster file transmission. Central to the software is a novel algorithm that retains non-redundant information. This new approach has allowed ABRIDGE to achieve a compression 16% higher than the second-best compressor for RNA-Seq reads and over 35% for DNA-Seq reads. ABRIDGE also offers users the option to randomly access location without having to decompress the entire file. ABRIDGE is distributed under MIT license and can be obtained from GitHub (https://github.com/sagnikbanerjee15/Abridge) and docker hub. We anticipate that the user community will adopt ABRIDGE within their existing pipeline encouraging further research in this domain.
Publisher
Cold Spring Harbor Laboratory
Reference34 articles.
1. Richard Hickman , Marcel C Van Verk , Anja J H Van Dijken , Marciel Pereira Mendes , Irene A Vroegop-Vos , Lotte Caarls , Merel Steenbergen , Ivo Van Der Nagel , Gert Jan Wesselink , and Aleksey Jironkin . Architecture and dynamics of the jasmonic acid gene regulatory network. The Plant Cell Online, pages tpc–00958, 2017.
2. A user-friendly platform for yeast two-hybrid library screening using next generation sequencing
3. Small RNA discovery in the interaction between barley and the powdery mildew pathogen;BMC genomics,2019
4. De novo transcriptome of Phakopsora pachyrhizi uncovers putative effector repertoire during infection;Physiological and Molecular Plant Pathology,2020
5. RNA-Seq: a revolutionary tool for transcriptomics
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献