Identification of errors introduced during high throughput sequencing of the T cell receptor repertoire-Reference-Cited by-同舟云学术

Identification of errors introduced during high throughput sequencing of the T cell receptor repertoire

Published:2011-02-11 Issue:1 Volume:12 Page:
ISSN:1471-2164
Container-title:BMC Genomics
language:en
Short-container-title:BMC Genomics

Author:

Nguyen Phuong,Ma Jing,Pei Deqing,Obert Caroline,Cheng Cheng,Geiger Terrence L

Abstract

Abstract Background Recent advances in massively parallel sequencing have increased the depth at which T cell receptor (TCR) repertoires can be probed by >3log10, allowing for saturation sequencing of immune repertoires. The resolution of this sequencing is dependent on its accuracy, and direct assessments of the errors formed during high throughput repertoire analyses are limited. Results We analyzed 3 monoclonal TCR from TCR transgenic, Rag-/- mice using Illumina® sequencing. A total of 27 sequencing reactions were performed for each TCR using a trifurcating design in which samples were divided into 3 at significant processing junctures. More than 20 million complementarity determining region (CDR) 3 sequences were analyzed. Filtering for lower quality sequences diminished but did not eliminate sequence errors, which occurred within 1-6% of sequences. Erroneous sequences were pre-dominantly of correct length and contained single nucleotide substitutions. Rates of specific substitutions varied dramatically in a position-dependent manner. Four substitutions, all purine-pyrimidine transversions, predominated. Solid phase amplification and sequencing rather than liquid sample amplification and preparation appeared to be the primary sources of error. Analysis of polyclonal repertoires demonstrated the impact of error accumulation on data parameters. Conclusions Caution is needed in interpreting repertoire data due to potential contamination with mis-sequence reads. However, a high association of errors with phred score, high relatedness of erroneous sequences with the parental sequence, dominance of specific nt substitutions, and skewed ratio of forward to reverse reads among erroneous sequences indicate approaches to filter erroneous sequences from repertoire data sets.

Publisher

Springer Science and Business Media LLC

Subject

Genetics,Biotechnology

Link

https://link.springer.com/content/pdf/10.1186/1471-2164-12-106.pdf

Reference27 articles.

1. Casrouge A, Beaudoing E, Dalle S, Pannetier C, Kanellopoulos J, Kourilsky P: Size estimate of the alpha beta TCR repertoire of naive mouse splenocytes. J Immunol. 2000, 164: 5782-5787.

2. Arstila TP, Casrouge A, Baron V, Even J, Kanellopoulos J, Kourilsky P: A direct estimate of the human alphabeta T cell receptor diversity. Science. 1999, 286: 958-961. 10.1126/science.286.5441.958.

3. Rudolph MG, Stanfield RL, Wilson IA: How TCRs bind MHCs, peptides, and coreceptors. Annu Rev Immunol. 2006, 24: 419-466. 10.1146/annurev.immunol.23.021704.115658.

4. Moon JJ, Chu HH, Pepper M, McSorley SJ, Jameson SC, Kedl RM, Jenkins MK: Naive CD4(+) T cell frequency varies for different epitopes and predicts repertoire diversity and response magnitude. Immunity. 2007, 27: 203-213. 10.1016/j.immuni.2007.07.007.

5. Wynn KK, Crough T, Campbell S, McNeil K, Galbraith A, Moss DJ, Silins SL, Bell S, Khanna R: Narrowing of T-cell receptor beta variable repertoire during symptomatic herpesvirus infection in transplant patients. Immunol Cell Biol. 2010, 88: 125-135. 10.1038/icb.2009.74.

Cited by 62 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Machine Learning Approaches to TCR Repertoire Analysis;Frontiers in Immunology;2022-07-15

2. Analysis of T-Cell Receptor Repertoire in Transplantation: Fingerprint of T Cell-mediated Alloresponse;Frontiers in Immunology;2022-01-12

3. High-throughput and single-cell T cell receptor sequencing technologies;Nature Methods;2021-07-19

4. Dynamics of thymus function and T cell receptor repertoire breadth in health and disease;Seminars in Immunopathology;2021-02

5. Sequencing barcode construction and identification methods based on block error-correction codes;Science China Life Sciences;2020-04-14