Affiliation:
1. Department of Plant and Microbial Biology, North Carolina State University, 4550A Thomas Hall, Box 7615, Raleigh,
27695, NC, United States of America
Abstract
Background:
Investigators using metagenomic sequencing to study microbiomes often
trim and decontaminate reads without knowing their effect on downstream analyses.
Objective:
This study was designed to evaluate the impacts JGI trimming and decontamination procedures
have on assembly and binning metrics, placement of MAGs into species trees, and functional
profiles of MAGs extracted from complex rhizosphere metagenomes, as well as how more aggressive
trimming impacts these binning metrics.
Methods:
Twenty-three Miscanthus x giganteus rhizosphere metagenomes were subjected to different
combinations and thresholds of force, kmer, and quality trimming and decontamination using BBDuk.
Reads were assembled and binned in KBase. Phylogenomic and statistical analyses were applied to
evaluate the effects of trimming and decontamination on downstream analyses.
Results:
We found that JGI trimmed and decontaminated reads had significant impacts on assembly
and binning metrics compared to raw reads, including significantly higher total contig counts, more
contigs greater than 10k bp in length, and larger total lengths of raw assemblies compared to QC assemblies,
and 2.0% lower average contamination of QC MAGs compared to raw MAGs. We also
found that differences in the placement of MAGs in species trees increased with decreasing completeness
and contamination thresholds. Furthermore, aggressive trimming (Q20) was found to significantly
reduce MAG counts.
Conclusion:
Trimming and decontamination of metagenomics reads prior to assembly can change an
investigator’s answer to the questions, “Who is there and what are they doing?” However, mild trimming
and decontamination of metagenomic reads with high-quality scores are recommended for removing
sample processing and sequencing artifacts.
Funder
United States Department of Energy
Publisher
Bentham Science Publishers Ltd.
Subject
Computational Mathematics,Genetics,Molecular Biology,Biochemistry