Affiliation:
1. Technion Israel Institute of Technology
2. ETH Zurich Zurich
3. UPMEM
4. Bar-Ilan University
Abstract
Abstract
Variant calling is a fundamental stage in genome analysis that identifies mutations (variations) in a sequenced genome relative to a known reference genome. Pair-HMM is a key part of the variant calling algorithm and its most compute-intensive part. In recent years, Processing-in-Memory (PiM) solutions, which consist of placing compute capabilities near/inside memory, have been proposed to speed up the genome analysis pipeline. We implement the Pair-HMM algorithm on a commercial PiM platform developed by UPMEM. We modify the Pair-HMM algorithm to make it more suitable for PiM execution with acceptable loss of accuracy. We evaluate our implementation on single chromosomes and whole genome sequencing datasets, demonstrating up to 2x speedup compared to existing CPU accelerations and up to 3x speedup compared to FPGA accelerations.
Publisher
Research Square Platform LLC
Reference60 articles.
1. Generations of sequencing technologies;Pettersson E;Genomics,2009
2. Next-generation sequencing;Reis-Filho JS;Breast Cancer Res,2009
3. DNA sequencing technologies: 2006–2016;Mardis ER;Nat Protoc,2017
4. Roodi M, Moshovos A. Gene sequencing: where time goes. In2018 IEEE International Symposium on Workload Characterization (IISWC) 2018 Sep 30 (pp. 84–85). IEEE.
5. Big data: astronomical or genomical?;Stephens ZD;PLoS Biol,2015