1. SMART 6: recent updates and new developments;Letunic;Nucleic Acids Res.,2009
2. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences;Li;Bioinformatics,2006
3. Clustering of highly homologous sequences to reduce the size of large protein databases;Li;Bioinformatics,2001
4. Tolerating some redundancy significantly speeds up clustering of large protein databases;Li;Bioinformatics,2002
5. Probing metagenomics by rapid cluster analysis of very large datasets;Li;PLoS ONE,2008