PanTA: An ultra-fast method for constructing large and growing microbial pangenomes-Reference-Cited by-同舟云学术

PanTA: An ultra-fast method for constructing large and growing microbial pangenomes

Published:2023-07-03 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Le Duc Quang,Nguyen Tien Anh,Nguyen Tam Thi,Nguyen Son Hoang,Do Van Hoan,Nguyen Canh Hao,Phung Huong Thanh,Ho Tho Huu,Nam Vo Sy,Nguyen Trang,Nguyen Hoang Anh,Cao Minh Duc

Abstract

AbstractPangenome analysis has become indispensable in bacterial genomics due to the high variability of gene content between isolates within a clade. While many computational methods exist for constructing the pangenome from a bacterial genome collection, speed and scalability still remain an issue for the fast-growing genomic collections. Here, we present PanTA, a efficient method to build and analyze pangenomes of bacteria strains. We show that PanTA exhibits an unprecedented 10 times speed up and 2 times more memory efficient over the current state of the art methods. More importantly, PanTA enables the progressive pangenome construction where new samples are added into an existing pangenome without the need of rebuilding the accumulated collection from the scratch. The progressive building of pangenomes can further reduce the memory requirements by half. We demonstrate that PanTA can build the pangenome of theEscherichia colispecies from the entire collection of over 28000 high quality genomes collected from the RefSeq database. Crucially, the whole analysis is performed on a modest laptop computer within two days, highlighting the scalability and practicality of PanTA.

Publisher

Cold Spring Harbor Laboratory

Reference35 articles.

1. Why prokaryotes have pangenomes

2. Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: Implications for the microbial "pan-genome"

3. Distinct evolutionary trajectories in the Escherichia coli pangenome occur within sequence types

4. Current status of pan-genome analysis for pathogenic bacteria

5. Insights into the population structure and pan-genome of Haemophilus influenzae

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Pasa: leveraging population pangenome graph to scaffold prokaryote genome assemblies;Nucleic Acids Research;2023-12-12

2. Pasa: Leverage population pangenome graph to scaffold prokaryote genome assemblies;2023-07-10