RESUMO
In adaptive immune receptor repertoire analysis, determining the germline variable (V) allele associated with each T- and B-cell receptor sequence is a crucial step. This process is highly impacted by allele annotations. Aligning sequences, assigning them to specific germline alleles, and inferring individual genotypes are challenging when the repertoire is highly mutated, or sequence reads do not cover the whole V region. Here, we propose an alternative naming scheme for the V alleles, as well as a novel method to infer individual genotypes. We demonstrate the strengths of the two by comparing their outcomes to other genotype inference methods. We validate the genotype approach with independent genomic long-read data. The naming scheme is compatible with current annotation tools and pipelines. Analysis results can be converted from the proposed naming scheme to the nomenclature determined by the International Union of Immunological Societies (IUIS). Both the naming scheme and the genotype procedure are implemented in a freely available R package (PIgLET https://bitbucket.org/yaarilab/piglet). To allow researchers to further explore the approach on real data and to adapt it for their uses, we also created an interactive website (https://yaarilab.github.io/IGHV_reference_book).
Assuntos
Genômica , Cadeias Pesadas de Imunoglobulinas , Receptores de Antígenos de Linfócitos B , Alelos , Genótipo , Receptores de Antígenos de Linfócitos B/genética , Cadeias Pesadas de Imunoglobulinas/genéticaRESUMO
Reproductive isolation drives the formation of new species, and many genes contribute to this through Dobzhansky-Muller incompatibilities (DMIs). These incompatibilities occur when gene divergence affects loci encoding interacting products such as receptors and their ligands. We suggest here that the nature of vertebrate immunoglobulin (IG) genes must make them prone to DMIs. The genes of these complex loci form functional genes through the process of recombination, giving rise to a repertoire of heterodimeric receptors of incredible diversity. This repertoire, within individuals and within species, must defend against pathogens but must also avoid pathogenic self-reactivity. We suggest that this avoidance of autoimmunity is only achieved through a coordination of evolution between heavy- and light-chain genes, and between these genes and the rest of the genome. Without coordinated evolution, the hybrid offspring of two diverging populations will carry a heavy burden of DMIs, resulting in a loss of fitness. Critical incompatibilities could manifest as incompatibilities between a mother and her divergent offspring. During fetal development, biochemical differences between the parents of hybrid offspring could make their offspring a target of the maternal immune system. This hypothesis was conceived in the light of recent insights into the population genetics of IG genes. This has suggested that antibody genes are probably as susceptible to evolutionary forces as other parts of the genome. Further repertoire studies in human and nonhuman species should now help determine whether antibody genes have been part of the evolutionary forces that drive the development of species.
Assuntos
Especiação Genética , Isolamento Reprodutivo , Animais , Feminino , Genes de Imunoglobulinas , Humanos , Modelos Genéticos , Vertebrados/genéticaRESUMO
VDJbase is a publicly available database that offers easy searching of data describing the complete sets of gene sequences (genotypes and haplotypes) inferred from adaptive immune receptor repertoire sequencing datasets. VDJbase is designed to act as a resource that will allow the scientific community to explore the genetic variability of the immunoglobulin (Ig) and T cell receptor (TR) gene loci. It can also assist in the investigation of Ig- and TR-related genetic predispositions to diseases. Our database includes web-based query and online tools to assist in visualization and analysis of the genotype and haplotype data. It enables users to detect those alleles and genes that are significantly over-represented in a particular population, in terms of genotype, haplotype and gene expression. The database website can be freely accessed at https://www.vdjbase.org/, and no login is required. The data and code use creative common licenses and are freely downloadable from https://bitbucket.org/account/user/yaarilab/projects/GPHP.
Assuntos
Biologia Computacional/métodos , Bases de Dados Genéticas , Genótipo , Haplótipos , Receptores Imunológicos/genética , Recombinação V(D)J , Humanos , Anotação de Sequência Molecular , Receptores de Antígenos de Linfócitos B/genética , Receptores de Antígenos de Linfócitos T/genética , Software , Design de Software , Navegador , Fluxo de TrabalhoRESUMO
The genomes of classical inbred mouse strains include genes derived from all three major subspecies of the house mouse, Mus musculus. We recently posited that genetic diversity in the immunoglobulin heavy chain (IGH) gene loci of C57BL/6 and BALB/c mice reflects differences in subspecies origin. To investigate this hypothesis, we conducted high-throughput sequencing of IGH gene rearrangements to document IGH variable (IGHV), joining (IGHJ) and diversity (IGHD) genes in four inbred wild-derived mouse strains (CAST/EiJ, LEWES/EiJ, MSM/MsJ and PWD/PhJ) and a single disease model strain (NOD/ShiLtJ), collectively representing genetic backgrounds of several major mouse subspecies. A total of 341 germline IGHV sequences were inferred in the wild-derived strains, including 247 not curated in the international ImMunoGeneTics information system. By contrast, 83/84 inferred NOD IGHV genes had previously been observed in C57BL/6 mice. Variability among the strains examined was observed for only a single IGHJ gene, involving a description of a novel allele. By contrast, unexpected variation was found in the IGHD gene loci, with four previously unreported IGHD gene sequences being documented. Very few IGHV sequences of C57BL/6 and BALB/c mice were shared with strains representing major subspecies, suggesting that their IGH loci may be complex mosaics of genes of disparate origins. This suggests a similar level of diversity is likely present in the IGH loci of other classical inbred strains. This must now be documented if we are to properly understand interstrain variation in models of antibody-mediated disease.
Assuntos
Cadeias Pesadas de Imunoglobulinas/genética , Região Variável de Imunoglobulina/genética , Animais , Sequência de Bases , Bases de Dados Genéticas , Células Germinativas/metabolismo , Camundongos Endogâmicos C57BL , Camundongos Endogâmicos NODRESUMO
The immune systems of all mammals include populations of B cells producing antibodies with incredibly diverse specificities. Repertoire diversity has been described as the "miracle of immunology," and it was long thought to be the result of essentially stochastic processes. Recently, however, analysis of high throughput gene sequencing data has shown that hard-wired biases in these processes result in antibody repertoires that are broadly predictable. The repertoires of mice and humans are both predictable, but they are strikingly different. In this review, features of the naïve antibody repertoires of the two species are contrasted. We show that the mouse repertoire includes a conspicuous population of public clonotypes that are shared by different individuals of an inbred strain. These clonotypes are the result of gene rearrangements that involve little gene processing. By skewing repertoire formation toward such sequences, which probably target commonly encountered pathogens, it may be that the relatively small mouse repertoire is appropriate and effective despite its size. Species like the mouse face challenges that are a direct consequence of their small body sizes and the limitations this places on the antibody arsenal-particularly early in ontogeny. We propose that it is the differences in the naïve repertoires of mice and humans, and the differences in the ways these repertoires are used, which ensure that the very different biological needs of the two species are met. The processes that contribute to repertoire formation may appear to be stochastic, but in both species, evolution has left little to chance.
Assuntos
Anticorpos/imunologia , Linfócitos B/imunologia , Evolução Molecular , Sistema Imunitário , Animais , Anticorpos/genética , Humanos , CamundongosRESUMO
Human IgH diversity is influenced by biases in the pairing of IGHD and IGHJ genes, but these biases have not been described in detail. We used high-throughput sequencing of VDJ rearrangements to explore DJ pairing biases in 29 individuals. It was possible to infer three contrasting IGHD-IGHJ haplotypes in nine of these individuals, and two of these haplotypes include deletion polymorphisms involving multiple contiguous IGHD genes. Therefore, we were able to explore how the underlying genetic makeup of the H chain locus influences the formation of particular DJ pairs. Analysis of nonproductive rearrangements demonstrates that 3' IGHD genes tend to pair preferentially with 5' IGHJ genes, whereas 5' IGHD genes pair preferentially with 3' IGHJ genes; the relationship between IGHD gene pairing frequencies and IGHD gene position is a near linear one for each IGHJ gene. However, striking differences are seen in individuals who carry deletion polymorphisms in the D locus. The absence of different blocks of IGHD genes leads to increases in the utilization frequencies of just a handful of genes, and these genes have no clear positional relationships to the deleted genes. This suggests that pairing frequencies may be influenced by additional complex positional relationships that perhaps arise from chromatin structure. In contrast to IGHD gene usage, IGHJ gene usage is unaffected by the IGHD gene-deletion polymorphisms. Such an outcome would be expected if the recombinase complex associates with an IGHJ gene before associating with an IGHD gene partner.
Assuntos
Rearranjo Gênico de Cadeia Pesada de Linfócito B/genética , Genes de Cadeia Pesada de Imunoglobulina/genética , Cadeias Pesadas de Imunoglobulinas/genética , Recombinação V(D)J/genética , Sequência de Bases , Genótipo , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Dados de Sequência Molecular , Polimorfismo GenéticoRESUMO
The chemotherapeutic Parthenolide is an exciting new candidate for the treatment of acute lymphoblastic leukemia, but like many other small-molecule drugs, it has low aqueous solubility. As a consequence, Parthenolide can only be administered clinically in the presence of harmful cosolvents. Accordingly, we describe the synthesis, characterization, and testing of a range of biocompatible triblock copolymer micelles as particle-based delivery vectors for the hydrophobic drug Parthenolide. The drug-loaded particles are produced via an emulsion-to-micelle transition method, and the effects of introducing anionic and cationic surface charges on stability, drug sequestration, biocompatibility, and efficacy are investigated. Significantly, we demonstrate high levels of efficacy in the organic solvent-free systems against human mesenchymal stem cells and primary T-acute lymphoblastic leukemia patient cells, highlighting the effectiveness of the delivery vectors for the treatment of acute lymphoblastic leukemia.
Assuntos
Portadores de Fármacos/química , Polímeros/química , Leucemia-Linfoma Linfoblástico de Células Precursoras/tratamento farmacológico , Sesquiterpenos/química , Sesquiterpenos/farmacologia , Materiais Biocompatíveis/química , Células Cultivadas , Estabilidade de Medicamentos , Emulsões/química , Humanos , Células-Tronco Mesenquimais/efeitos dos fármacos , Micelas , Solubilidade , Solventes/químicaRESUMO
A model of murine IgG function is presented in which the co-expression of the IgG subclasses is a central feature, class switching occurs before the commencement of somatic hypermutation, and there is little switching between subclasses. It is named the quartet model to emphasize the harmony that comes from the simultaneous presence of the four subclasses. In this model, IgG3 and IgG2b antibodies are particularly important early in the response, when T-cell help may be limiting. IgG3 initiates inflammation through complement fixation, whereas IgG2b provides early FcγR-mediated effector functions. As T-cell help strengthens, IgG2a antibodies increase the power of the response, whereas IgG1 production helps limit the inflammatory drive and limits immunopathology. The model highlights the fact that murine IgG subclasses function quite differently to human IgG subclasses. This allows them to serve the special immunological needs of a species that is vulnerable because of its small size.
Assuntos
Imunoglobulina G/metabolismo , Modelos Imunológicos , Animais , Formação de Anticorpos/imunologia , Humanos , Imunoglobulina G/classificação , Camundongos , Hipermutação Somática de Imunoglobulina/genéticaRESUMO
Somatic point mutations provide glimpses into B-cell histories, and mutation numbers generally correlate with antibody affinity. We recently proposed a model of human isotype function, based in part on mutation analysis, in which the dominant pathway of isotype switching involves B cells moving sequentially through the four immunoglobulin (Ig) G subclasses. This should result in predictable differences in affinity between isotypes, and this helps explain how different isotypes work together. The model built on analysis of rearranged immunoglobulin heavy chain sequences amplified from Papua New Guinean villagers, which showed highly significant differences in the mean number of V-REGION mutations in sequences, associated with the different IgG subclasses. To determine whether this relationship between mutation levels and isotypes is a more general phenomenon, the present study was conducted in healthy, urban residents of Sydney, Australia. VDJ sequences were generated from eight individuals, using 454 pyrosequencing, from cells expressing all isotypes except IgD and IgE. This resulted in 35 118 unique, productive VDJ sequences for the study. The data confirm that VDJ genes associated with progressively more 3' Ig heavy chain gamma (IGHG) constant region genes show increasing levels of point mutation. Mean V-REGION mutations in IgA1 and IgA2 sequences were similar. Patterns of mutations also differed between isotypes. Despite their association with T-independent responses, IgG2 sequences showed significantly more mutational evidence of antigen selection than other IgG isotypes. Antigen selection was also significantly higher in IgA2 than in IgA1 sequences, raising the possibility of a preferential switch pathway from IGHG2 to IGHA2.
Assuntos
Isotipos de Imunoglobulinas/genética , Mutação , Éxons VDJ , Regiões Determinantes de Complementaridade/genética , Rearranjo Gênico do Linfócito B , Variação Genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Switching de Imunoglobulina , Imunoglobulina G/genética , Cadeias Pesadas de Imunoglobulinas/genética , Região Variável de Imunoglobulina/genéticaRESUMO
The existence of many highly similar genes in the lymphocyte receptor gene loci makes them difficult to investigate, and the determination of phased "haplotypes" has been particularly problematic. However, V(D)J gene rearrangements provide an opportunity to infer the association of Ig genes along the chromosomes. The chromosomal distribution of H chain genes in an Ig genotype can be inferred through analysis of VDJ rearrangements in individuals who are heterozygous at points within the IGH locus. We analyzed VDJ rearrangements from 44 individuals for whom sufficient unique rearrangements were available to allow comprehensive genotyping. Nine individuals were identified who were heterozygous at the IGHJ6 locus and for whom sufficient suitable VDJ rearrangements were available to allow comprehensive haplotyping. Each of the 18 resulting IGHVâIGHDâIGHJ haplotypes was unique. Apparent deletion polymorphisms were seen that involved as many as four contiguous, functional IGHV genes. Two deletion polymorphisms involving multiple contiguous IGHD genes were also inferred. Three previously unidentified gene duplications were detected, where two sequences recognized as allelic variants of a single gene were both inferred to be on a single chromosome. Phased genomic data brings clarity to the study of the contribution of each gene to the available repertoire of rearranged VDJ genes. Analysis of rearrangement frequencies suggests that particular genes may have substantially different yet predictable propensities for rearrangement within different haplotypes. Together with data highlighting the extent of haplotypic variation within the population, this suggests that there may be substantial variability in the available Ab repertoires of different individuals.
Assuntos
Haplótipos , Cadeias Pesadas de Imunoglobulinas/genética , Região Variável de Imunoglobulina/genética , Recombinação V(D)J/genética , Rearranjo Gênico , Genes de Imunoglobulinas , Loci Gênicos , Genótipo , Heterozigoto , Humanos , Polimorfismo GenéticoRESUMO
Introduction: Analysis of an individual's immunoglobulin (IG) gene repertoire requires the use of high-quality germline gene reference sets. When sets only contain alleles supported by strong evidence, AIRR sequencing (AIRR-seq) data analysis is more accurate and studies of the evolution of IG genes, their allelic variants and the expressed immune repertoire is therefore facilitated. Methods: The Adaptive Immune Receptor Repertoire Community (AIRR-C) IG Reference Sets have been developed by including only human IG heavy and light chain alleles that have been confirmed by evidence from multiple high-quality sources. To further improve AIRR-seq analysis, some alleles have been extended to deal with short 3' or 5' truncations that can lead them to be overlooked by alignment utilities. To avoid other challenges for analysis programs, exact paralogs (e.g. IGHV1-69*01 and IGHV1-69D*01) are only represented once in each set, though alternative sequence names are noted in accompanying metadata. Results and discussion: The Reference Sets include less than half the previously recognised IG alleles (e.g. just 198 IGHV sequences), and also include a number of novel alleles: 8 IGHV alleles, 2 IGKV alleles and 5 IGLV alleles. Despite their smaller sizes, erroneous calls were eliminated, and excellent coverage was achieved when a set of repertoires comprising over 4 million V(D)J rearrangements from 99 individuals were analyzed using the Sets. The version-tracked AIRR-C IG Reference Sets are freely available at the OGRDB website (https://ogrdb.airr-community.org/germline_sets/Human) and will be regularly updated to include newly observed and previously reported sequences that can be confirmed by new high-quality data.
Assuntos
Genes de Imunoglobulinas , Imunoglobulinas , Humanos , Imunoglobulinas/genética , Alelos , Recombinação V(D)J/genética , Células GerminativasRESUMO
Analysis of an individual's immunoglobulin or T cell receptor gene repertoire can provide important insights into immune function. High-quality analysis of adaptive immune receptor repertoire sequencing data depends upon accurate and relatively complete germline sets, but current sets are known to be incomplete. Established processes for the review and systematic naming of receptor germline genes and alleles require specific evidence and data types, but the discovery landscape is rapidly changing. To exploit the potential of emerging data, and to provide the field with improved state-of-the-art germline sets, an intermediate approach is needed that will allow the rapid publication of consolidated sets derived from these emerging sources. These sets must use a consistent naming scheme and allow refinement and consolidation into genes as new information emerges. Name changes should be minimised, but, where changes occur, the naming history of a sequence must be traceable. Here we outline the current issues and opportunities for the curation of germline IG/TR genes and present a forward-looking data model for building out more robust germline sets that can dovetail with current established processes. We describe interoperability standards for germline sets, and an approach to transparency based on principles of findability, accessibility, interoperability, and reusability.
RESUMO
We have analysed the transcribed immunoglobulin kappa (IGK) repertoire of peripheral blood B cells from four individuals from two genetically distinct populations, Papua New Guinean and Australian, using high-throughput DNA sequencing. The depth of sequencing data for each individual averaged 5,548 high-quality IGK reads, and permitted genotyping of the inferred IGKV and IGKJ germline gene segments for each individual. All individuals were homozygous at each IGKJ locus and had highly similar inferred IGKV genotypes. Preferential gene usage was seen at both the IGKV and IGKJ loci, but only IGKV segment usage varied significantly between individuals. Despite the differences in IGKV gene utilisation, the rearranged IGK repertoires showed extensive identity at the amino acid level. Public rearrangements (those shared by two or more individuals) made up 60.2% of the total sequenced IGK rearrangements. The total diversity of IGK rearrangements of each individual was estimated to range from just 340 to 549 unique amino acid sequences. Thus, the repertoire of unique expressed IGK rearrangements is dramatically less than previous theoretical estimates of IGK diversity, and the majority of expressed IGK rearrangements are likely to be extensively shared in individual human beings.
Assuntos
Linfócitos B/imunologia , Rearranjo Gênico do Linfócito B , Cadeias kappa de Imunoglobulina/genética , Alelos , Austrália , Genética Populacional , Genótipo , Humanos , Cadeias kappa de Imunoglobulina/imunologia , Papua Nova GuinéRESUMO
Individual variation in the Ig germline gene repertoire leads to individual differences in the combinatorial diversity of the Ab repertoire, but the study of such variation has been problematic. The application of high-throughput DNA sequencing to the study of rearranged Ig genes now makes this possible. The sequencing of thousands of VDJ rearrangements from an individual, either from genomic DNA or expressed mRNA, should allow their germline IGHV, IGHD, and IGHJ repertoires to be inferred. In addition, where previously mere glimpses of diversity could be gained from sequencing studies, new large data sets should allow the rearrangement frequency of different genes and alleles to be seen with clarity. We analyzed the DNA of 108,210 human IgH chain rearrangements from 12 individuals and determined their individual IGH genotypes. The number of reportedly functional IGHV genes and allelic variants ranged from 45 to 60, principally because of variable levels of gene heterozygosity, and included 14 previously unreported IGHV polymorphisms. New polymorphisms of the IGHD3-16 and IGHJ6 genes were also seen. At heterozygous loci, remarkably different rearrangement frequencies were seen for the various IGHV alleles, and these frequencies were consistent between individuals. The specific alleles that make up an individual's Ig genotype may therefore be critical in shaping the combinatorial repertoire. The extent of genotypic variation between individuals is highlighted by an individual with aplastic anemia who appears to lack six contiguous IGHD genes on both chromosomes. These deletions significantly alter the potential expressed IGH repertoire, and possibly immune function, in this individual.
Assuntos
Genes de Cadeia Pesada de Imunoglobulina , Região Variável de Imunoglobulina/genética , Sequência de Bases , Rearranjo Gênico do Linfócito B , Genótipo , Humanos , Dados de Sequência Molecular , Reação em Cadeia da Polimerase , Polimorfismo GenéticoRESUMO
Background and aims: The majority of Australians are regular users of social media, especially young adults. Of concern, is that a minority of people appear to use social media in an addictive or problematic way which is associated with negative psychological outcomes such as depression. Social comparisons, where users compare themselves to others on social media, have also been linked with depression. Therefore, the key aim of the study was to determine whether social comparisons mediate the relationship between Problematic Social Media Use (PSMU) and depression. Method: In a two-part study 144 participants (65 females) answered a series of self-report questions assessing factors relating to PSMU and then came into the lab to view a series of social media images, (pre-tested to be upward or downward comparisons). Results: Females used social media more problematically, liked more upward than downward comparison images and compared themselves more negatively to others on social media than did males. Higher PSMU scores were associated with depression and low self-esteem and comparing oneself more negatively to others on social media. Finally, focusing on upward comparisons and a tendency to make negative comparisons to others on social media partially mediated the association between PSMU and depression. Discussion and conclusions: Social comparisons may function as a mechanism linking PSMU with negative psychological outcomes. Clinical interventions for individuals with PSMU which reduce the focus on upward social comparisons may also reduce negative psychological outcomes such as depression.
Assuntos
Mídias Sociais , Austrália , Depressão/psicologia , Feminino , Humanos , Masculino , Autoimagem , Comparação Social , Adulto JovemRESUMO
BACKGROUND: T and B cell receptor (TCR, BCR) repertoires constitute the foundation of adaptive immunity. Adaptive immune receptor repertoire sequencing (AIRR-seq) is a common approach to study immune system dynamics. Understanding the genetic factors influencing the composition and dynamics of these repertoires is of major scientific and clinical importance. The chromosomal loci encoding for the variable regions of TCRs and BCRs are challenging to decipher due to repetitive elements and undocumented structural variants. METHODS: To confront this challenge, AIRR-seq-based methods have recently been developed for B cells, enabling genotype and haplotype inference and discovery of undocumented alleles. However, this approach relies on complete coverage of the receptors' variable regions, whereas most T cell studies sequence a small fraction of that region. Here, we adapted a B cell pipeline for undocumented alleles, genotype, and haplotype inference for full and partial AIRR-seq TCR data sets. The pipeline also deals with gene assignment ambiguities, which is especially important in the analysis of data sets of partial sequences. RESULTS: From the full and partial AIRR-seq TCR data sets, we identified 39 undocumented polymorphisms in T cell receptor Beta V (TRBV) and 31 undocumented 5 ' UTR sequences. A subset of these inferences was also observed using independent genomic approaches. We found that a single nucleotide polymorphism differentiating between the two documented T cell receptor Beta D2 (TRBD2) alleles is strongly associated with dramatic changes in the expressed repertoire. CONCLUSIONS: We reveal a rich picture of germline variability and demonstrate how a single nucleotide polymorphism dramatically affects the composition of the whole repertoire. Our findings provide a basis for annotation of TCR repertoires for future basic and clinical studies.
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Receptores de Antígenos de Linfócitos T alfa-beta , Alelos , Células Germinativas , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Receptores de Antígenos de Linfócitos T/genética , Receptores de Antígenos de Linfócitos T alfa-beta/genéticaRESUMO
The immunoglobulin genes of inbred mouse strains that are commonly used in models of antibody-mediated human diseases are poorly characterized. This compromises data analysis. To infer the immunoglobulin genes of BALB/c mice, we used long-read SMRT sequencing to amplify VDJ-C sequences from F1 (BALB/c x C57BL/6) hybrid animals. Strain variations were identified in the Ighm and Ighg2b genes, and analysis of VDJ rearrangements led to the inference of 278 germline IGHV alleles. 169 alleles are not present in the C57BL/6 genome reference sequence. To establish a set of expressed BALB/c IGHV germline gene sequences, we computationally retrieved IGHV haplotypes from the IgM dataset. Haplotyping led to the confirmation of 162 BALB/c IGHV gene sequences. A musIGHV398 pseudogene variant also appears to be present in the BALB/cByJ substrain, while a functional musIGHV398 gene is highly expressed in the BALB/cJ substrain. Only four of the BALB/c alleles were also observed in the C57BL/6 haplotype. The full set of inferred BALB/c sequences has been used to establish a BALB/c IGHV reference set, hosted at https://ogrdb.airr-community.org. We assessed whether assemblies from the Mouse Genome Project (MGP) are suitable for the determination of the genes of the IGH loci. Only 37 (43.5%) of the 85 confirmed IMGT-named BALB/c IGHV and 33 (42.9%) of the 77 confirmed non-IMGT IGHV were found in a search of the MGP BALB/cJ genome assembly. This suggests that current MGP assemblies are unsuitable for the comprehensive documentation of germline IGHVs and more efforts will be needed to establish strain-specific reference sets.
Assuntos
Cadeias Pesadas de Imunoglobulinas , Região Variável de Imunoglobulina , Animais , Haplótipos , Cadeias Pesadas de Imunoglobulinas/genética , Região Variável de Imunoglobulina/genética , Camundongos , Camundongos Endogâmicos BALB C , Camundongos Endogâmicos C57BL , Análise de Sequência de DNARESUMO
Complete and accurate knowledge of the genes and allelic variants of the human immunoglobulin gene loci is critical for studies of B cell repertoire development and somatic point mutation, but evidence from studies of VDJ rearrangements suggests that our knowledge of the available immunoglobulin gene repertoire is far from complete. The reported repertoire has changed little over the last 15 years. This is, in part, a consequence of the inefficiencies involved in searching for new members of large, multigenic gene families by cloning and sequencing. The advent of high-throughput sequencing provides a new avenue by which the germline repertoire can be explored. In this report, we describe pyrosequencing studies of the heavy chain IGHV1, IGHV3 and IGHV4 gene subgroups in ten Papua New Guineans. Thousands of 454 reads aligned with complete identity to 51 previously reported functional IGHV genes and allelic variants. A new gene, IGHV3-NL1*01, was identified, which differs from the nearest previously reported gene by 15 nucleotides. Sixteen new IGHV alleles were also identified, 15 of which varied from previously reported functional IGHV genes by between one and four nucleotides, while one sequence appears to be a functional variant of the pseudogene IGHV3-25. BLAST searches suggest that at least six of these new genes are carried within the relatively well-studied populations of North America, Europe or Asia. This study substantially expands the known immunoglobulin gene repertoire and demonstrates that genetic variation of immunoglobulin genes can now be efficiently explored in different human populations using high-throughput pyrosequencing.