Abstract
AbstractNew high-quality human genome assemblies derived from lymphoblastoid cell lines (LCLs) provide reference genomes and pangenomes for genomics studies. However, the characteristics of LCLs pose technical challenges to profiling immunoglobulin (IG) genes. IG loci in LCLs contain a mixture of germline and somatically recombined haplotypes, making them difficult to genotype or assemble accurately. To address these challenges, we introduceIGLoo, a software tool that implements novel methods for analyzing sequence data and genome assemblies derived from LCLs.IGLoocharacterizes somatic V(D)J recombination events in the sequence data and identifies the breakpoints and missing IG genes in the LCL-based assemblies. Furthermore,IGLooimplements a novel reassembly framework to improve germline assembly quality by integrating information about somatic events and population structural variantions in the IG loci. We appliedIGLooto study the assemblies from the Human Pangenome Reference Consortium, providing new insights into the mechanisms, gene usage, and patterns of V(D)J recombination, causes of assembly fragmentation in the IG heavy chain (IGH) locus, and improved representation of the IGH assemblies.
Publisher
Cold Spring Harbor Laboratory
Reference30 articles.
1. Somatic generation of antibody diversity
2. Pax5 induces V-to-DJ rearrangements and locus contraction of the immunoglobulin heavy-chain gene
3. Genetic variation in the immunoglobulin heavy chain locus shapes the human antibody repertoire;Nature communications,4419
4. Ighv1-69 polymorphism modulates anti-influenza antibody repertoires, correlates with ighv utilization shifts and varies by ethnicity;Scientific reports,2016
5. Jeong Hyun Lee , Laura Toy , Justin T Kos , Yana Safonova , William R Schief , Colin Havenar-Daughton , Corey T Watson , and Shane Crotty . Vaccine genetics of ighv1-2 vrc01-class broadly neutralizing antibody precursor näıve human b cells. NPJ vaccines, 6(1):113, 2021.