A study of pClust settings-Reference-Cited by-同舟云学术

A study of pClust settings

Published:2020-07-22 Issue:1 Volume:11 Page:1-5
ISSN:2331-9291
Container-title:ACM SIGBioinformatics Record
language:en
Short-container-title:ACM SIGBioinformatics Rec.

Author:

Khaledian Ehdieh¹,Broschat Shira L.¹

Affiliation:

1. Washington State University

Abstract

Recently, high-throughput approaches to DNA sequencing such as massive parallel sequencing have resulted in the availability of a vast number of whole genome sequences. This availability has presented scientists with an unprecedented opportunity to gain knowledge by means of datamining and data analysis. A number of our datamining and data analysis strategies are based on the use of a fast and accurate software tool, pClust , to group protein sequences into homologous clusters. However, pClust has a number of parameters with values that must be chosen, and the choice of these values affects the accuracy of the clustering results. In this paper, we present a study of the most significant parameters: alignment length, match similarity, and optimal score. In addition, we study both local and semi-global alignments.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3411750.3411751

Reference20 articles.

1. A work stealing based approach for enabling scalable optimal sequence homology detection

2. M. Hauser C. E. Mayer and J. Söding "kclust: fast and sensitive clustering of large protein sequence databases " BMC Bioinformatics vol. 14 no. 1 p. 248 2013. M. Hauser C. E. Mayer and J. Söding "kclust: fast and sensitive clustering of large protein sequence databases " BMC Bioinformatics vol. 14 no. 1 p. 248 2013.

3. Biological Databases- Integration of Life Science Data

4. The Importance of Biological Databases in Biological Discovery

5. E. Khaledian A. H. Gebremedhin K. A. Brayton and S. L. Broschat "A network science approach for determining the ancestral phylum of bacteria " in Proceedings of the 2018 ACM International Conference on Bioinformatics Computational Biology and Health Informatics 2018 pp. 398--403. E. Khaledian A. H. Gebremedhin K. A. Brayton and S. L. Broschat "A network science approach for determining the ancestral phylum of bacteria " in Proceedings of the 2018 ACM International Conference on Bioinformatics Computational Biology and Health Informatics 2018 pp. 398--403.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. PASS: Protein Annotation Surveillance Site for Protein Annotation Using Homologous Clusters, NLP, and Sequence Similarity Networks;Frontiers in Bioinformatics;2021-09-29