Population matched (PM) germline allelic variants of immunoglobulin (IG) loci: New pmIG database to better understandIGrepertoire and selection processes in disease and vaccination

Author:

Khatri InduORCID,Berkowska Magdalena A.,van den Akker Erik B.,Teodosio Cristina,Reinders Marcel J.T.,van Dongen Jacques J.M.

Abstract

AbstractAt the population level, immunoglobulin (IG) loci harbor inter-individual allelic variants in the many different germlineIGvariable (V), Diversity (D) and Joining (J) genes of theIGheavy (IGH),IGkappa (IGK) andIGlambda (IGL) loci, which together form the genetic basis of the highly diverse antigen-specific B-cell receptors. These inter-individual allelic variants can be shared between or be specific to human populations. The currentIGdatabases IMGT, VBASE2 and IgPdb hold information about germline alleles, most of which are partial sequences, obtained from a mixture of human (B-cell) samples, many with sequence errors and/or acquired (non-germline)IGvariations, induced by somatic hypermutation (SHM) during antigen-specific B-cell responses. We systematically identified true germline alleles (without SHM) from 26 different human populations around the world, profiled by the “1000 Genomes data”. Our resource is uniquely enriched with completeIGallele sequences and their frequencies across human populations. We identified 409IGHV, 179IGKV, and 199IGLVgermline allelessupported by at least seven haplotypes(= minimum of four individuals), after removal of potential false-positives, based on using other genomic databases, i.e. ENSEMBL, TopMed, ExAC, ProjectMine. Remarkably, the positions of the identified variant nucleotides of the different alleles are not at random (as observed in case of SHM), but show striking patterns, restricted to limited nucleotide positions, the same as found in otherIGdata bases, suggesting over-time evolutionary selection processes. The identification of these specific patterns provides extra evidence that the identified variant nucleotides are not sequencing errors, but genuine allelic variants. The diversity of germline allelic variants inIGHandIGLloci is the highest in Africans, while theIGKlocus is most diverse in Europeans. We also report on the presence of recombination signal sequences (RSS) inVpseudogenes, explaining their usage inV(D)Jrearrangements. We propose that this new set of genuine germlineIGsequences can serve as a new population-matchedIG(pmIG) database for better understanding B-cell repertoire and B-cell receptor selection processes in disease and vaccination within and between different human populations. The database in format of fasta is available via GitHub (https://github.com/InduKhatri/pmIG).Contribution to the Field StatementWe present a catalogue of immunoglobulin (IG) germline-alleles of unprecedented completeness and accuracy from 26 different human populations belonging to five different large ethnicities (Source: 1000 Genomes). We identified the population distribution of several known germline alleles and identified multiple new alleles, especially in African populations, indicative of high allelic diversity ofIGgenes in Africa. Strikingly, the identified variant nucleotides of the different alleles are not at random, but show striking patterns, restricted to limited nucleotide positions, the same as found in otherIGdatabases, suggesting over-time evolutionary selection processes. Furthermore, we identified recombination signal sequences in pseudogenes (previously not known). We provide an overview ofIGgermline alleles shared with and between known databases and also point to potential sources of non-germline variation and incompleteness of the existingIGdatabases. More importantly, we believe that this information can serve as a novel population-matchedIG(pmIG) database, highly valuable for the research community in supporting the dissection and understanding of differences in effectiveness of antibody-based immune responses in infectious diseases, other (immune) diseases and vaccination within and between human populations. Such knowledge might be used in developing population-specific vaccination strategies e.g. for currently ongoing SARS-CoV2 pandemic.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3