Abstract
AbstractContrarily to other lineages such as L2 and L4, there are still scarce whole-genome-sequence data on L5-L6 MTBC clinical isolates in public genomes repositories. Recent results suggest a high complexity of L5 history in Africa. It is of importance for an adequate assessment of TB infection in Africa, that is still related to the presence of L5-L6 MTBC strains. This study reports a significant improvement of our knowledge of L5 diversity, phylogeographical history, and global population structure ofMycobacterium africanumL5. To achieve this aim, we sequenced new clinical isolates from Northern Nigeria and from proprietary collections, and used a new powerful bioinformatical pipeline,TB-Annotatorthat explores not only the shared SNPs but also shared missing genes, identical IS6110insertion sites and shared regions of deletion. This study using both newly sequenced genomes and available public genomes allows to describe new L5 sublineages. We report that the MTBC L5 tree is made-up of at least 12 sublineages from which 6 are new descriptions. We confront our new classification to the most recent published one and suggest new naming for the discovered sublineages. Finally, we discuss the phylogeographical specificity of sublineages 5.1 and sublineage 5.2 and suggest a new hypothesis of L5-L6 emergence in Africa.Impact statementRecent studies onMycobacterium africanum(L5-L6-L9 of MTBC) genomic diversity and its evolution in Africa discovered three new lineages of theMycobacterium tuberculosiscomplex (MTBC) in the last ten years (L7, L8, L9). These discoveries are symptomatic of the delay in characterizing the diversity of the MTBC on the African continent. Another understudied part of MTBC diversity is the intra-lineage diversity of L5 and L6. This study unravels an hidden diversity of L5 in Africa and provides a more exhaustive description of specific genetic features of each sublineage by using a proprietary “TB-Annotator” pipeline. Furthermore, we identify different phylogeographical localization trends between L5.1 and L5.2, suggesting different histories. Our results suggest that a better understanding of the spatiotemporal dynamics of MTBC in Africa absolutely requires a large sampling effort and powerful tools to dig into the retrieved diversity.Data summary[A section describing all supporting external data, software or code, including the DOI(s) and/or accession numbers(s), and the associated URL. If no data was generated or reused in the research, please state this.]The search was done in the TB-Annotator 15901 genomes version which is available at: http://(to be added). The new sequenced genomes are available via NCBI Bioproject accession number: (to be added). The authors confirm all supporting data, code and protocols have been provided within the article or through supplementary data files.
Publisher
Cold Spring Harbor Laboratory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献