Author:
Gálvez Sergio,Agostini Federico,Caselli Javier,Hernandez Pilar,Dorado Gabriel
Abstract
New High-Performance Computing architectures have been recently developed for commercial central processing unit (CPU). Yet, that has not improved the execution time of widely used bioinformatics applications, like BLAST+. This is due to a lack of optimization between the bases of the existing algorithms and the internals of the hardware that allows taking full advantage of the available CPU cores. To optimize the new architectures, algorithms must be revised and redesigned; usually rewritten from scratch. BLVector adapts the high-level concepts of BLAST+ to the x86 architectures with AVX-512, to harness their capabilities. A deep comprehensive study has been carried out to optimize the approach, with a significant reduction in time execution. BLVector reduces the execution time of BLAST+ when aligning up to mid-size protein sequences (∼750 amino acids). The gain in real scenario cases is 3.2-fold. When applied to longer proteins, BLVector consumes more time than BLAST+, but retrieves a much larger set of results. BLVector and BLAST+ are fine-tuned heuristics. Therefore, the relevant results returned by both are the same, although they behave differently specially when performing alignments with low scores. Hence, they can be considered complementary bioinformatics tools.
Subject
Genetics(clinical),Genetics,Molecular Medicine
Reference31 articles.
1. Basic local alignment search tool.;Altschul;J. Mol. Biol.,1990
2. UniProt: a worldwide hub of protein knowledge.;Bateman;Nucleic Acids Res.,2019
3. HPC-BLAST scalable sequence analysis for the intel® many integrated core future;Brook;Supercomputing 2014.,2014
4. A model for evolutionary change in proteins;Dayhoff;Atlas of Protein Sequence and Structure,1978
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献