Abstract
ABSTRACTTremendous advances in mass spectrometric and bioinformatic approaches have expanded proteomics into the field of microbial ecology. The commonly used spectral annotation method for metaproteomics data relies on database searching, which requires sample-specific databases obtained from whole metagenome sequencing experiments. However, creating these databases is complex, time-consuming, and prone to errors, potentially biasing experimental outcomes and conclusions. This asks for alternative approaches that can provide rapid and orthogonal insights into metaproteomics data. Here we present NovoLign, ade novometaproteomics pipeline that performs sequence alignment ofde novosequences from complete metaproteomics experiments. The pipeline enables rapid taxonomic profiling of complex communities and evaluates the taxonomic coverage of metaproteomics outcomes obtained from database searches. Furthermore, the NovoLign pipeline supports the creation of reference sequence databases for database searching to ensure comprehensive coverage. The NovoLign pipeline is publicly available via:https://github.com/hbckleikamp/NovoLign.
Publisher
Cold Spring Harbor Laboratory
Reference73 articles.
1. Microorganisms and their roles in fundamental biogeochemical cycles
2. Human microbiome in health and disease;Annual Review of Pathology: Mechanisms of Disease,2012
3. Rousk, J. & Bengtson, P. , Vol. 5 103 (Frontiers Media SA, 2014).
4. Wierzchos, J. , Ríos, A.d.l. & Ascaso, C. Microorganisms in desert rocks: the edge of life on Earth. (2012).
5. A framework based on fundamental biochemical principles to engineer microbial community dynamics;Current Opinion in Biotechnology,2021