Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 12 de 12
Filter
Add more filters










Publication year range
1.
Front Immunol ; 10: 435, 2019.
Article in English | MEDLINE | ID: mdl-30936866

ABSTRACT

Immunoglobulins or antibodies are the main effector molecules of the B-cell lineage and are encoded by hundreds of variable (V), diversity (D), and joining (J) germline genes, which recombine to generate enormous IG diversity. Recently, high-throughput adaptive immune receptor repertoire sequencing (AIRR-seq) of recombined V-(D)-J genes has offered unprecedented insights into the dynamics of IG repertoires in health and disease. Faithful biological interpretation of AIRR-seq studies depends upon the annotation of raw AIRR-seq data, using reference germline gene databases to identify the germline genes within each rearrangement. Existing reference databases are incomplete, as shown by recent AIRR-seq studies that have inferred the existence of many previously unreported polymorphisms. Completing the documentation of genetic variation in germline gene databases is therefore of crucial importance. Lymphocyte receptor genes and alleles are currently assigned by the Immunoglobulins, T cell Receptors and Major Histocompatibility Nomenclature Subcommittee of the International Union of Immunological Societies (IUIS) and managed in IMGT®, the international ImMunoGeneTics information system® (IMGT). In 2017, the IMGT Group reached agreement with a group of AIRR-seq researchers on the principles of a streamlined process for identifying and naming inferred allelic sequences, for their incorporation into IMGT®. These researchers represented the AIRR Community, a network of over 300 researchers whose objective is to promote all aspects of immunoglobulin and T-cell receptor repertoire studies, including the standardization of experimental and computational aspects of AIRR-seq data generation and analysis. The Inferred Allele Review Committee (IARC) was established by the AIRR Community to devise policies, criteria, and procedures to perform this function. Formalized evaluations of novel inferred sequences have now begun and submissions are invited via a new dedicated portal (https://ogrdb.airr-community.org). Here, we summarize recommendations developed by the IARC-focusing, to begin with, on human IGHV genes-with the goal of facilitating the acceptance of inferred allelic variants of germline IGHV genes. We believe that this initiative will improve the quality of AIRR-seq studies by facilitating the description of human IG germline gene variation, and that in time, it will expand to the documentation of TR and IG genes in many vertebrate species.


Subject(s)
Alleles , Genes, Immunoglobulin , Genetic Variation/genetics , Terminology as Topic , V(D)J Recombination , Base Sequence , Databases, Genetic , Datasets as Topic , Gene Library , Germ-Line Mutation , High-Throughput Nucleotide Sequencing , Humans , Immunoglobulin Heavy Chains/genetics , Immunoglobulin Variable Region/genetics , Polymerase Chain Reaction/methods , Sequence Alignment , Sequence Homology, Nucleic Acid , VDJ Exons/genetics
2.
Front Immunol ; 9: 2206, 2018.
Article in English | MEDLINE | ID: mdl-30323809

ABSTRACT

Increased interest in the immune system's involvement in pathophysiological phenomena coupled with decreased DNA sequencing costs have led to an explosion of antibody and T cell receptor sequencing data collectively termed "adaptive immune receptor repertoire sequencing" (AIRR-seq or Rep-Seq). The AIRR Community has been actively working to standardize protocols, metadata, formats, APIs, and other guidelines to promote open and reproducible studies of the immune repertoire. In this paper, we describe the work of the AIRR Community's Data Representation Working Group to develop standardized data representations for storing and sharing annotated antibody and T cell receptor data. Our file format emphasizes ease-of-use, accessibility, scalability to large data sets, and a commitment to open and transparent science. It is composed of a tab-delimited format with a specific schema. Several popular repertoire analysis tools and data repositories already utilize this AIRR-seq data format. We hope that others will follow suit in the interest of promoting interoperable standards.


Subject(s)
Antibodies/genetics , Base Sequence , Database Management Systems , Information Dissemination/methods , Receptors, Antigen, T-Cell/genetics , Adaptive Immunity/genetics , Databases, Genetic , Datasets as Topic , High-Throughput Nucleotide Sequencing/economics , Humans , Receptors, Immunologic/genetics , Research Design
3.
Immunol Rev ; 284(1): 24-41, 2018 07.
Article in English | MEDLINE | ID: mdl-29944754

ABSTRACT

Next-generation sequencing allows the characterization of the adaptive immune receptor repertoire (AIRR) in exquisite detail. These large-scale AIRR-seq data sets have rapidly become critical to vaccine development, understanding the immune response in autoimmune and infectious disease, and monitoring novel therapeutics against cancer. However, at present there is no easy way to compare these AIRR-seq data sets across studies and institutions. The ability to combine and compare information for different disease conditions will greatly enhance the value of AIRR-seq data for improving biomedical research and patient care. The iReceptor Data Integration Platform (gateway.ireceptor.org) provides one implementation of the AIRR Data Commons envisioned by the AIRR Community (airr-community.org), an initiative that is developing protocols to facilitate sharing and comparing AIRR-seq data. The iReceptor Scientific Gateway links distributed (federated) AIRR-seq repositories, allowing sequence searches or metadata queries across multiple studies at multiple institutions, returning sets of sequences fulfilling specific criteria. We present a review of the development of iReceptor, and how it fits in with the general trend toward sharing genomic and health data, and the development of standards for describing and reporting AIRR-seq data. Researchers interested in integrating their repositories of AIRR-seq data into the iReceptor Platform are invited to contact support@ireceptor.org.


Subject(s)
Antibodies/genetics , Databases, Genetic , Information Dissemination/methods , Receptors, Antigen, B-Cell/genetics , Receptors, Antigen, T-Cell/genetics , Antibodies/immunology , Genomics/methods , High-Throughput Nucleotide Sequencing , Humans , Internet
5.
BMC Bioinformatics ; 17(Suppl 13): 333, 2016 Oct 06.
Article in English | MEDLINE | ID: mdl-27766961

ABSTRACT

BACKGROUND: The genes that produce antibodies and the immune receptors expressed on lymphocytes are not germline encoded; rather, they are somatically generated in each developing lymphocyte by a process called V(D)J recombination, which assembles specific, independent gene segments into mature composite genes. The full set of composite genes in an individual at a single point in time is referred to as the immune repertoire. V(D)J recombination is the distinguishing feature of adaptive immunity and enables effective immune responses against an essentially infinite array of antigens. Characterization of immune repertoires is critical in both basic research and clinical contexts. Recent technological advances in repertoire profiling via high-throughput sequencing have resulted in an explosion of research activity in the field. This has been accompanied by a proliferation of software tools for analysis of repertoire sequencing data. Despite the widespread use of immune repertoire profiling and analysis software, there is currently no standardized format for output files from V(D)J analysis. Researchers utilize software such as IgBLAST and IMGT/High V-QUEST to perform V(D)J analysis and infer the structure of germline rearrangements. However, each of these software tools produces results in a different file format, and can annotate the same result using different labels. These differences make it challenging for users to perform additional downstream analyses. RESULTS: To help address this problem, we propose a standardized file format for representing V(D)J analysis results. The proposed format, VDJML, provides a common standardized format for different V(D)J analysis applications to facilitate downstream processing of the results in an application-agnostic manner. The VDJML file format specification is accompanied by a support library, written in C++ and Python, for reading and writing the VDJML file format. CONCLUSIONS: The VDJML suite will allow users to streamline their V(D)J analysis and facilitate the sharing of scientific knowledge within the community. The VDJML suite and documentation are available from https://vdjserver.org/vdjml/ . We welcome participation from the community in developing the file format standard, as well as code contributions.


Subject(s)
Genomics/methods , Receptors, Immunologic/genetics , Software , V(D)J Recombination , Humans , Information Dissemination
6.
Stand Genomic Sci ; 5(2): 224-9, 2011 Nov 30.
Article in English | MEDLINE | ID: mdl-22180825

ABSTRACT

Genotyping experiments are widely used in clinical and basic research laboratories to identify associations between genetic variations and normal/abnormal phenotypes. Genotyping assay techniques vary from single genomic regions that are interrogated using PCR reactions to high throughput assays examining genome-wide sequence and structural variation. The resulting genotype data may include millions of markers of thousands of individuals, requiring various statistical, modeling or other data analysis methodologies to interpret the results. To date, there are no standards for reporting genotyping experiments. Here we present the Minimum Information about a Genotyping Experiment (MIGen) standard, defining the minimum information required for reporting genotyping experiments. MIGen standard covers experimental design, subject description, genotyping procedure, quality control and data analysis. MIGen is a registered project under MIBBI (Minimum Information for Biological and Biomedical Investigations) and is being developed by an interdisciplinary group of experts in basic biomedical science, clinical science, biostatistics and bioinformatics. To accommodate the wide variety of techniques and methodologies applied in current and future genotyping experiment, MIGen leverages foundational concepts from the Ontology for Biomedical Investigations (OBI) for the description of the various types of planned processes and implements a hierarchical document structure. The adoption of MIGen by the research community will facilitate consistent genotyping data interpretation and independent data validation. MIGen can also serve as a framework for the development of data models for capturing and storing genotyping results and experiment metadata in a structured way, to facilitate the exchange of metadata.

7.
Pac Symp Biocomput ; : 359-70, 2010.
Article in English | MEDLINE | ID: mdl-19908388

ABSTRACT

The immune response HLA class II DRB1 gene provides the major genetic contribution to Juvenile Idiopathic Arthritis (JIA), with a hierarchy of predisposing through intermediate to protective effects. With JIA, and the many other HLA associated diseases, it is difficult to identify the combinations of biologically relevant amino acid (AA) residues directly involved in disease due to the high level of HLA polymorphism, the pattern of AA variability, including varying degrees of linkage disequilibrium (LD), and the fact that most HLA variation occurs at functionally important sites. In a subset of JIA patients with the clinical phenotype oligoarticular-persistent (OP), we have applied a recently developed novel approach to genetic association analyses with genes/proteins sub-divided into biologically relevant smaller sequence features (SFs), and their "alleles" which are called variant types (VTs). With SFVT analysis, association tests are performed on variation at biologically relevant SFs based on structural (e.g., beta-strand 1) and functional (e.g., peptide binding site) features of the protein. We have extended the SFVT analysis pipeline to additionally include pairwise comparisons of DRB1 alleles within serogroup classes, our extension of the Salamon Unique Combinations algorithm, and LD patterns of AA variability to evaluate the SFVT results; all of which contributed additional complementary information. With JIA-OP, we identified a set of single AA SFs, and SFs in which they occur, particularly pockets of the peptide binding site, that account for the major disease risk attributable to HLA DRB1. These are (in numeric order): AAs 13 (pockets 4 and 6), 37 and 57 (both pocket 9), 67 (pocket 7), 74 (pocket 4), and 86 (pocket 1), and to a lesser extent 30 (pockets 6 and 7) and 71 (pockets 4, 5, and 7).


Subject(s)
Arthritis, Juvenile/genetics , Arthritis, Juvenile/immunology , HLA-DRB1 Chains/genetics , Case-Control Studies , Child , Computational Biology , Gene Frequency , Genetic Association Studies/statistics & numerical data , Genetic Predisposition to Disease , Genetic Variation , HLA-DRB1 Chains/chemistry , Haplotypes , Humans , Linkage Disequilibrium
8.
Hum Mol Genet ; 19(4): 707-19, 2010 Feb 15.
Article in English | MEDLINE | ID: mdl-19933168

ABSTRACT

We describe a novel approach to genetic association analyses with proteins sub-divided into biologically relevant smaller sequence features (SFs), and their variant types (VTs). SFVT analyses are particularly informative for study of highly polymorphic proteins such as the human leukocyte antigen (HLA), given the nature of its genetic variation: the high level of polymorphism, the pattern of amino acid variability, and that most HLA variation occurs at functionally important sites, as well as its known role in organ transplant rejection, autoimmune disease development and response to infection. Further, combinations of variable amino acid sites shared by several HLA alleles (shared epitopes) are most likely better descriptors of the actual causative genetic variants. In a cohort of systemic sclerosis patients/controls, SFVT analysis shows that a combination of SFs implicating specific amino acid residues in peptide binding pockets 4 and 7 of HLA-DRB1 explains much of the molecular determinant of risk.


Subject(s)
Genetic Variation , HLA Antigens/genetics , Scleroderma, Systemic/genetics , HLA Antigens/chemistry , HLA-DR Antigens/chemistry , HLA-DR Antigens/genetics , HLA-DRB1 Chains , Humans , Molecular Conformation
10.
Chem Commun (Camb) ; (5): 581-3, 2005 Feb 07.
Article in English | MEDLINE | ID: mdl-15672142

ABSTRACT

We report here a strategy for the photolithographic synthesis of diverse, spatially addressable arrays of cyclic peptides which employs a differential deprotection strategy for the combinatorial addition of side chains to a pre-fabricated cyclic core.


Subject(s)
Peptide Library , Peptides, Cyclic/chemical synthesis , Printing , Protein Array Analysis/methods , Ultraviolet Rays , Molecular Structure , Time Factors
11.
Biotechniques ; 37(4): 654-8, 660, 2004 Oct.
Article in English | MEDLINE | ID: mdl-15517977

ABSTRACT

There has been increasing interest and efforts devoted to developing biosensor technologies for identifying pathogens, particularly in the biothreat area. In this study, a universal set of short 12- and 13-mer oligonucleotide probes was derived independently of a priori genomic sequence information and used to generate unique species-dependent genomic hybridization signatures. The probe set sequences were algorithmically generated to be maximally distant in sequence space and not dependent on the sequence of any particular genome. The probe set is universally applicable because it is unbiased and independent of hybridization predictions based upon simplified assumptions regarding probe-target duplex formation from linear sequence analysis. Tests were conducted on microarrays containing 14,283 unique probes synthesized using an in situ light-directed synthesis methodology. The genomic DNA hybridization intensity patterns reproducibly differentiated various organisms (Bacillus subtilis, Yersinia pestis, Streptococcus pneumonia, Bacillus anthracis, and Homo sapiens), including the correct identification of a blinded "unknown" sample. Applications of this method include not only pathological and forensic genome identification in medicine and basic science, but also potentially a novel method for the discovery of unknown targets and associations inherent in dynamic nucleic acid populations such as represented by differential gene expression.


Subject(s)
DNA Probes , DNA, Bacterial/genetics , Gene Expression Profiling/methods , Oligonucleotide Array Sequence Analysis/instrumentation , Oligonucleotide Probes , Bacillus anthracis/genetics , Bacillus subtilis/genetics , Bioterrorism , Gene Expression Profiling/instrumentation , Molecular Sequence Data , Nucleic Acid Hybridization/methods , Oligonucleotide Array Sequence Analysis/methods , Streptococcus pneumoniae/genetics , Yersinia pestis/genetics
12.
J Am Chem Soc ; 126(13): 4088-9, 2004 Apr 07.
Article in English | MEDLINE | ID: mdl-15053581

ABSTRACT

We describe a novel photolithographic approach to the synthesis of peptoids (oligo-N-substituted glycines). This strategy enables the construction of a spatially addressable peptoid microarray, thus providing a potentially powerful tool for the discovery of protein ligands.

SELECTION OF CITATIONS
SEARCH DETAIL
...