Performance Analysis of Cross-Assembly of Metatranscriptomic Datasets in Viral Community Studies-Reference-Cited by-同舟云学术

Performance Analysis of Cross-Assembly of Metatranscriptomic Datasets in Viral Community Studies

Published:2023-11-20 Issue:2 Volume:18 Page:418-433
ISSN:1994-6538
Container-title:Mathematical Biology and Bioinformatics
language:
Short-container-title:Math.Biol.Bioinf.

Author:

Bukin Yu.S.,Bondaryuk A.N.,Butina T.V.

Abstract

We conducted a comparative analysis of individual and cross-assemblies of several metatranscriptomic data sets to study viral communities using several metatranscriptomes of endemic Baikal mollusks. We have shown that, compared to individual dataset assemblies, a Hidden Markov Model-based cross-assembly procedure increases the number of viral contigs (or scaffolds) per sample, the number of virotypes identified, and the average length of scaffolds per sample. The proportion of assembled viral reads from the total number of reads in samples is higher in cross-assembly. De novo cross-genomic assemblies combined with a virus identification algorithm using Hidden Markov Model present the data in a table with the number of reads from different samples for each scaffold. The table allows comparison of samples based on the representation of all viral scaffolds, including those not taxonomically identified, i.e. those that have no analogues in the NCBI RefSeq database. Thus, cross-genomic assemblies allow for comparative analyzes taking into account the latent diversity of viruses. We propose a pipeline for metatranscriptomic data analysis using de novo cross-genomic assembly to study viral diversity.

Publisher

Institute of Mathematical Problems of Biology of RAS (IMPB RAS)

Subject

Applied Mathematics,Biomedical Engineering

Reference45 articles.

1. Advances in Metagenomics and Its Application in Environmental Microorganisms

2. Metagenomics in Virology

3. Integrating Viral Metagenomics into an Ecological Framework

4. Potential Applications of Human Viral Metagenomics and Reference Materials: Considerations for Current and Future Viruses

5. Unraveling the viral dark matter through viral metagenomics