Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 7 de 7
1.
Front Immunol ; 10: 129, 2019.
Article En | MEDLINE | ID: mdl-30814994

The adaptive immune receptor repertoire (AIRR) contains information on an individuals' immune past, present and potential in the form of the evolving sequences that encode the B cell receptor (BCR) repertoire. AIRR sequencing (AIRR-seq) studies rely on databases of known BCR germline variable (V), diversity (D), and joining (J) genes to detect somatic mutations in AIRR-seq data via comparison to the best-aligning database alleles. However, it has been shown that these databases are far from complete, leading to systematic misidentification of mutated positions in subsets of sample sequences. We previously presented TIgGER, a computational method to identify subject-specific V gene genotypes, including the presence of novel V gene alleles, directly from AIRR-seq data. However, the original algorithm was unable to detect alleles that differed by more than 5 single nucleotide polymorphisms (SNPs) from a database allele. Here we present and apply an improved version of the TIgGER algorithm which can detect alleles that differ by any number of SNPs from the nearest database allele, and can construct subject-specific genotypes with minimal prior information. TIgGER predictions are validated both computationally (using a leave-one-out strategy) and experimentally (using genomic sequencing), resulting in the addition of three new immunoglobulin heavy chain V (IGHV) gene alleles to the IMGT repertoire. Finally, we develop a Bayesian strategy to provide a confidence estimate associated with genotype calls. All together, these methods allow for much higher accuracy in germline allele assignment, an essential step in AIRR-seq studies.


Immunoglobulins/genetics , Algorithms , Alleles , Bayes Theorem , Genotype , Humans , Myasthenia Gravis/immunology , Sequence Analysis, DNA
2.
Nat Commun ; 7: 11112, 2016 Mar 23.
Article En | MEDLINE | ID: mdl-27005435

The adaptive immune system's capability to protect the body requires a highly diverse lymphocyte antigen receptor repertoire. However, the influence of individual genetic and epigenetic differences on these repertoires is not typically measured. By leveraging the unique characteristics of B, CD4(+) T and CD8(+) T-lymphocyte subsets from monozygotic twins, we quantify the impact of heritable factors on both the V(D)J recombination process and on thymic selection. We show that the resulting biases in both V(D)J usage and N/P addition lengths, which are found in naïve and antigen experienced cells, contribute to significant variation in the CDR3 region. Moreover, we show that the relative usage of V and J gene segments is chromosomally biased, with ∼1.5 times as many rearrangements originating from a single chromosome. These data refine our understanding of the heritable mechanisms affecting the repertoire, and show that biases are evident on a chromosome-wide level.


B-Lymphocytes/metabolism , CD4-Positive T-Lymphocytes/metabolism , CD8-Positive T-Lymphocytes/metabolism , Genes, Immunoglobulin/genetics , Genes, T-Cell Receptor/genetics , Twins, Monozygotic/genetics , V(D)J Recombination/genetics , Adaptive Immunity/genetics , Humans , Reverse Transcriptase Polymerase Chain Reaction
3.
Bioinformatics ; 31(20): 3356-8, 2015 Oct 15.
Article En | MEDLINE | ID: mdl-26069265

UNLABELLED: Advances in high-throughput sequencing technologies now allow for large-scale characterization of B cell immunoglobulin (Ig) repertoires. The high germline and somatic diversity of the Ig repertoire presents challenges for biologically meaningful analysis, which requires specialized computational methods. We have developed a suite of utilities, Change-O, which provides tools for advanced analyses of large-scale Ig repertoire sequencing data. Change-O includes tools for determining the complete set of Ig variable region gene segment alleles carried by an individual (including novel alleles), partitioning of Ig sequences into clonal populations, creating lineage trees, inferring somatic hypermutation targeting models, measuring repertoire diversity, quantifying selection pressure, and calculating sequence chemical properties. All Change-O tools utilize a common data format, which enables the seamless integration of multiple analyses into a single workflow. AVAILABILITY AND IMPLEMENTATION: Change-O is freely available for non-commercial use and may be downloaded from http://clip.med.yale.edu/changeo. CONTACT: steven.kleinstein@yale.edu.


B-Lymphocytes/chemistry , Gene Rearrangement, B-Lymphocyte , Genes, Immunoglobulin/genetics , High-Throughput Nucleotide Sequencing/methods , Immunoglobulin Variable Region/genetics , Mutation/genetics , Software , Alleles , Databases, Genetic , Humans
4.
Proc Natl Acad Sci U S A ; 112(8): E862-70, 2015 Feb 24.
Article En | MEDLINE | ID: mdl-25675496

Individual variation in germline and expressed B-cell immunoglobulin (Ig) repertoires has been associated with aging, disease susceptibility, and differential response to infection and vaccination. Repertoire properties can now be studied at large-scale through next-generation sequencing of rearranged Ig genes. Accurate analysis of these repertoire-sequencing (Rep-Seq) data requires identifying the germline variable (V), diversity (D), and joining (J) gene segments used by each Ig sequence. Current V(D)J assignment methods work by aligning sequences to a database of known germline V(D)J segment alleles. However, existing databases are likely to be incomplete and novel polymorphisms are hard to differentiate from the frequent occurrence of somatic hypermutations in Ig sequences. Here we develop a Tool for Ig Genotype Elucidation via Rep-Seq (TIgGER). TIgGER analyzes mutation patterns in Rep-Seq data to identify novel V segment alleles, and also constructs a personalized germline database containing the specific set of alleles carried by a subject. This information is then used to improve the initial V segment assignments from existing tools, like IMGT/HighV-QUEST. The application of TIgGER to Rep-Seq data from seven subjects identified 11 novel V segment alleles, including at least one in every subject examined. These novel alleles constituted 13% of the total number of unique alleles in these subjects, and impacted 3% of V(D)J segment assignments. These results reinforce the highly polymorphic nature of human Ig V genes, and suggest that many novel alleles remain to be discovered. The integration of TIgGER into Rep-Seq processing pipelines will increase the accuracy of V segment assignments, thus improving B-cell repertoire analyses.


Alleles , Automation , B-Lymphocytes/metabolism , Genes, Immunoglobulin , High-Throughput Nucleotide Sequencing/methods , Immunoglobulin Variable Region/genetics , Base Sequence , Databases, Genetic , Gene Rearrangement, B-Lymphocyte , Genotype , Humans , Mutation/genetics , Mutation Rate , Polymorphism, Genetic , Software , V(D)J Recombination/genetics
5.
Proc Natl Acad Sci U S A ; 111(13): 4928-33, 2014 Apr 01.
Article En | MEDLINE | ID: mdl-24639495

The adaptive immune system confers protection by generating a diverse repertoire of antibody receptors that are rapidly expanded and contracted in response to specific targets. Next-generation DNA sequencing now provides the opportunity to survey this complex and vast repertoire. In the present work, we describe a set of tools for the analysis of antibody repertoires and their application to elucidating the dynamics of the response to viral vaccination in human volunteers. By analyzing data from 38 separate blood samples across 2 y, we found that the use of the germ-line library of V and J segments is conserved between individuals over time. Surprisingly, there appeared to be no correlation between the use level of a particular VJ combination and degree of expansion. We found the antibody RNA repertoire in each volunteer to be highly dynamic, with each individual displaying qualitatively different response dynamics. By using combinatorial phage display, we screened selected VH genes paired with their corresponding VL library for affinity against the vaccine antigens. Altogether, this work presents an additional set of tools for profiling the human antibody repertoire and demonstrates characterization of the fast repertoire dynamics through time in multiple individuals responding to an immune challenge.


Antibodies/immunology , Immunity/immunology , Viral Vaccines/immunology , Clone Cells , Genetic Vectors , Healthy Volunteers , Humans , Immunoglobulin Variable Region/genetics , Male , Mutation/genetics , Reproducibility of Results , Time Factors , V(D)J Recombination/genetics , Vaccination
6.
Front Immunol ; 4: 358, 2013.
Article En | MEDLINE | ID: mdl-24298272

Analyses of somatic hypermutation (SHM) patterns in B cell immunoglobulin (Ig) sequences contribute to our basic understanding of adaptive immunity, and have broad applications not only for understanding the immune response to pathogens, but also to determining the role of SHM in autoimmunity and B cell cancers. Although stochastic, SHM displays intrinsic biases that can confound statistical analysis, especially when combined with the particular codon usage and base composition in Ig sequences. Analysis of B cell clonal expansion, diversification, and selection processes thus critically depends on an accurate background model for SHM micro-sequence targeting (i.e., hot/cold-spots) and nucleotide substitution. Existing models are based on small numbers of sequences/mutations, in part because they depend on data from non-coding regions or non-functional sequences to remove the confounding influences of selection. Here, we combine high-throughput Ig sequencing with new computational analysis methods to produce improved models of SHM targeting and substitution that are based only on synonymous mutations, and are thus independent of selection. The resulting "S5F" models are based on 806,860 Synonymous mutations in 5-mer motifs from 1,145,182 Functional sequences and account for dependencies on the adjacent four nucleotides (two bases upstream and downstream of the mutation). The estimated profiles can explain almost half of the variance in observed mutation patterns, and clearly show that both mutation targeting and substitution are significantly influenced by neighboring bases. While mutability and substitution profiles were highly conserved across individuals, the variability across motifs was found to be much larger than previously estimated. The model and method source code are made available at http://clip.med.yale.edu/SHM.

7.
Genetics ; 195(1): 275-87, 2013 Sep.
Article En | MEDLINE | ID: mdl-23852385

Whole-genome sequencing, particularly in fungi, has progressed at a tremendous rate. More difficult, however, is experimental testing of the inferences about gene function that can be drawn from comparative sequence analysis alone. We present a genome-wide functional characterization of a sequenced but experimentally understudied budding yeast, Saccharomyces bayanus var. uvarum (henceforth referred to as S. bayanus), allowing us to map changes over the 20 million years that separate this organism from S. cerevisiae. We first created a suite of genetic tools to facilitate work in S. bayanus. Next, we measured the gene-expression response of S. bayanus to a diverse set of perturbations optimized using a computational approach to cover a diverse array of functionally relevant biological responses. The resulting data set reveals that gene-expression patterns are largely conserved, but significant changes may exist in regulatory networks such as carbohydrate utilization and meiosis. In addition to regulatory changes, our approach identified gene functions that have diverged. The functions of genes in core pathways are highly conserved, but we observed many changes in which genes are involved in osmotic stress, peroxisome biogenesis, and autophagy. A surprising number of genes specific to S. bayanus respond to oxidative stress, suggesting the organism may have evolved under different selection pressures than S. cerevisiae. This work expands the scope of genome-scale evolutionary studies from sequence-based analysis to rapid experimental characterization and could be adopted for functional mapping in any lineage of interest. Furthermore, our detailed characterization of S. bayanus provides a valuable resource for comparative functional genomics studies in yeast.


Genome, Fungal , Saccharomyces/genetics , Fungal Proteins/genetics , Fungal Proteins/metabolism , Gene Expression Profiling , Molecular Sequence Annotation , Oxidative Stress , Saccharomyces/metabolism
...