IGLoo enables comprehensive analysis and assembly of immunoglobulin heavy-chain loci in lymphoblastoid cell lines using PacBio high-fidelity reads

Cell Rep Methods. 2025 May 19;5(5):101033. doi: 10.1016/j.crmeth.2025.101033. Epub 2025 May 1.

Abstract

High-quality human genome assemblies derived from lymphoblastoid cell lines (LCLs) provide reference genomes and pangenomes for genomics studies. However, LCLs pose technical challenges for profiling immunoglobulin (IG) genes, as their IG loci contain a mixture of germline and somatically recombined haplotypes, making genotyping and assembly difficult with widely used frameworks. To address this, we introduce IGLoo, a software tool that analyzes sequence data and assemblies derived from LCLs, characterizing somatic V(D)J recombination events and identifying breakpoints and missing IG genes in the assemblies. Furthermore, IGLoo implements a reassembly framework to improve germline assembly quality by integrating information on somatic events and population structural variations in IG loci. Applying IGLoo to the assemblies from the Human Pangenome Reference Consortium, we gained valuable insights into the mechanisms, gene usage, and patterns of V(D)J recombination and the causes of assembly artifacts in the IG heavy-chain (IGH) locus, and we improved the representation of IGH assemblies.

Keywords: CP: Genetics; CP: Systems biology; IG; IGH; LCL; assembly; immunoglobulin gene loci; immunoglobulin heavy chain; lymphoblastoid cell line.

MeSH terms

  • Cell Line
  • Genetic Loci*
  • Genome, Human
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • Immunoglobulin Heavy Chains* / genetics
  • Sequence Analysis, DNA / methods
  • Software*
  • V(D)J Recombination / genetics

Substances

  • Immunoglobulin Heavy Chains