An FPGA-based hardware accelerator supporting sensitive sequence homology filtering with profile hidden Markov models

Author:

Anderson Tim,Wheeler Travis J.ORCID

Abstract

AbstractBackgroundSequence alignment lies at the heart of genome sequence annotation. While the BLAST suite of alignment tools has long held an important role in alignment-based sequence database search, greater sensitivity is achieved through the use of profile hidden Markov models (pHMMs). The Forward algorithm that provides much of pHMMs’ sensitivity is relatively slow, motivating extensive efforts to increase speed. Numerous researchers have devised methods to improve pHMM alignment speed using hardware accelerators such as graphics processing units (GPUs) and field programmable gate arrays (FPGAs). Here, we describe an FPGA hardware accelerator for a key bottleneck step in the analysis pipeline employed by the popular pHMM aligment tool, HMMER.HMMER accelerates pHMM Forward alignment by screening most sequence with a series of filters that rapidly approximate the result of computing full Forward alignment. The first of these filters, the Single Segment ungapped Viterbi (SSV) algorithm, is designed to filter out 98% of non-related inputs and accounts for 70% of the overall runtime of the DNA search toolnhmmerin common use cases. SSV is an ideal target for hardware acceleration due to its limited data dependency structure.ResultsWe present Hardware Accelerated single segment Viterbi Additional Coprocessor (HAVAC), an FPGA-based hardware accelerator for the SSV algorithm. The core HAVAC kernel calculates the SSV matrix at 1739 GCUPS on a Xilinx Alveo U50 FPGA accelerator card, ∼ 227x faster than the optimized SSV implementation innhmmer. Accounting for PCI-e data transfer data processing, HAVAC is 65x faster than nhmmer’s SSV with one thread and 35x faster than nhmmer with four threads, and uses ∼ 31% the energy of a traditional high end Intel CPU. Because these computations are performed on a co-processor, the host CPU remain free to simultaneously compute downstream pHMM alignment and later post-processing.Author summarySequence alignment lies at the heart of genome sequence annotation, and must be both fast and accurate. Signals of relationships between sequences are obscured over time by mutational forces, so that alignment and annotation of the full diversity of life demands highly sensitive tools. Profile hidden Markov models (pHMMs) provide the greatest sensitivity in the face of diversity, but are relatively slow. Here, we describe an approach to improving the speed of pHMM search that leverages field programmable gate arrays - hardware devices that can be configured to implement arbitrary digital circuits to achieve impressive parallelism and energy efficiency. Our tool, HAVAC, accelerates one key bottleneck step in the analysis pipeline employed by the popular pHMM aligment tool, HMMER. HAVAC produces a ∼ 60x speedup over the analogous stage in HMMER. HAVAC can be implemented as a part of a larger sequence homology search tool for faster search times and reduced energy usage. Interested users can download HAVAC on github athttps://github.com/TravisWheelerLab/HAVAC.

Publisher

Cold Spring Harbor Laboratory

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3