On the transformation of MinHash-based uncorrected distances into proper evolutionary distances for phylogenetic inference-Reference-Cited by-同舟云学术

On the transformation of MinHash-based uncorrected distances into proper evolutionary distances for phylogenetic inference

Published:2020-11-10 Issue: Volume:9 Page:1309
ISSN:2046-1402
Container-title:F1000Research
language:en
Short-container-title:F1000Res

Author:

Criscuolo Alexis^ORCID

Abstract

Recently developed MinHash-based techniques were proven successful in quickly estimating the level of similarity between large nucleotide sequences. This article discusses their usage and limitations in practice to approximating uncorrected distances between genomes, and transforming these pairwise dissimilarities into proper evolutionary distances. It is notably shown that complex distance measures can be easily approximated using simple transformation formulae based on few parameters. MinHash-based techniques can therefore be very useful for implementing fast yet accurate alignment-free phylogenetic reconstruction procedures from large sets of genomes. This last point of view is assessed with a simulation study using a dedicated bioinformatics tool.

Publisher

F1000 Research Ltd

Subject

General Pharmacology, Toxicology and Pharmaceutics,General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology,General Medicine

Link

https://f1000research.com/articles/9-1309/v1/pdf

Reference62 articles.

1. An assembly and alignment-free method of phylogeny reconstruction from next-generation sequencing data.;H Fan;BMC Genomics.,2015

2. Mash: fast genome and metagenome distance estimation using MinHash.;B Ondov;Genome Biol.,2016

3. sourmash: a library for MinHash sketching of DNA.;C Titus Brown;Journal of Open Source Software.,2016

4. Dashing: Fast and accurate genomic distances with HyperLogLog.;D Baker;bioRxiv.,2019

5. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries.;C Jain;Nat Commun.,2018

Cited by 19 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Lysobacter gummosus 10.1.1, a Producer of Antimicrobial Agents;Microorganisms;2023-11-24

2. Corynebacterium ramonii sp. nov., a novel toxigenic member of the Corynebacterium diphtheriae species complex;Research in Microbiology;2023-09

3. A global Corynebacterium diphtheriae genomic framework sheds light on current diphtheria reemergence;Peer Community Journal;2023-08-31

4. Paenibacillus plantiphilus sp. nov. from the plant environment of Zea mays;Antonie van Leeuwenhoek;2023-06-20

5. RabbitTClust: enabling fast clustering analysis of millions of bacteria genomes with MinHash sketches;Genome Biology;2023-05-17