Data mining antibody sequences for database searching in bottom-up proteomics

Author:

Trinh Xuan-TungORCID,Freitag Rebecca,Krawczyk Konrad,Schwämmle VeitORCID

Abstract

AbstractMass spectrometry (MS)-based proteomics allows identifying and quantifying thousands of proteins but suffers from challenges when measuring human antibodies due to their vast variety. The mainly used bottom-up proteomics approaches rely on database searches that compare experimental values of peptides and their fragments to theoretical values derived from protein sequences in a database. While the human body can produce millions of distinct antibodies, the current databases for human antibodies such as UniProtKB/Swiss-Prot are limited to only 1095 sequences (as of 2024 Jan). This limitation may hinder the identification of new antibodies using mass spectrometry. Therefore, extending the database for mass spectrometry is an important task for discovering new antibodies. Recent genomic studies have compiled millions of human antibody sequences publicly accessible through the Observed Antibody Space (OAS) database. However, this data has yet to be exploited to confirm the presence of these antibodies. In this study, we adopted this extensive collection of antibody sequences for conducting efficient database searches in publicly available proteomics data with a focus on the SARS-CoV-2 disease. Thirty million heavy antibody sequences from 146 SARS-CoV-2 patients in the OAS database were digestedin silicoto obtain 18 million unique peptides. These peptides were then used to create new databases for bottom-up proteomics. We used those databases for searching new antibody peptides in publicly available SARS-CoV-2 human plasma samples in the Proteomics Identification Database (PRIDE). This approach avoids false positives in antibody peptide identification as confirmed by searching against negative controls (brain samples) and employing different database sizes. We show that the found sequences provide valuable information to distinguish diseased from healthy and expect that the newly discovered antibody peptides can be further employed to develop therapeutic antibodies. The method will be broadly applicable to find characteristic antibodies for other diseases.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3