Abstract
AbstractChemical cross-linking of proteins coupled with mass spectrometry (CXMS) has enjoyed growing popularity in biomedical research. Most CXMS experiments utilize cross-linkers based on N-hydroxysuccinimide (NHS) ester, which react selectively with the amine groups found on the free N-termini of proteins and on the side chain of lysine (K) residues. It is also reported that under certain conditions they can react with the hydroxyl groups of serine (S), threonine (T), and tyrosine (Y). Some of the popular cross-link search engines including MeroX and xiSearch set STY, in addition to K, as cross-linkable sites by default. However, to what extent NHS ester cross-linkers react with STY under the typical CXMS experimental conditions remains unclear, nor has the reliability of STY-cross-link identifications. Here, by setting amino acids with chemically inert side chains such as glycine (G), valine (V), and leucine (L) as cross-linkable sites, which serves as a negative control, we show that software-identified STY-cross-links are only as reliable as GVL-cross-links. This is true across different NHS ester cross-linkers including DSS, DSSO, and DSBU, and across different search engines including MeroX, xiSearch, and pLink. Using a published dataset originated from synthetic peptides, we demonstrate that STY-cross-links indeed have a high false discovery rate. Further analysis revealed that depending on the data and the CXMS search engine used to analyze the data, up to 65% of the STY-cross-links identified are actually K-K cross-links of the same peptide pairs, up to 61% are actually K-mono-links, and the rest tend to contain short peptides at high risk of false identification.
Publisher
Cold Spring Harbor Laboratory