Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 321
Filtrar
1.
Nat Immunol ; 25(6): 1073-1082, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38816615

RESUMO

A key barrier to the development of vaccines that induce broadly neutralizing antibodies (bnAbs) against human immunodeficiency virus (HIV) and other viruses of high antigenic diversity is the design of priming immunogens that induce rare bnAb-precursor B cells. The high neutralization breadth of the HIV bnAb 10E8 makes elicitation of 10E8-class bnAbs desirable; however, the recessed epitope within gp41 makes envelope trimers poor priming immunogens and requires that 10E8-class bnAbs possess a long heavy chain complementarity determining region 3 (HCDR3) with a specific binding motif. We developed germline-targeting epitope scaffolds with affinity for 10E8-class precursors and engineered nanoparticles for multivalent display. Scaffolds exhibited epitope structural mimicry and bound bnAb-precursor human naive B cells in ex vivo screens, protein nanoparticles induced bnAb-precursor responses in stringent mouse models and rhesus macaques, and mRNA-encoded nanoparticles triggered similar responses in mice. Thus, germline-targeting epitope scaffold nanoparticles can elicit rare bnAb-precursor B cells with predefined binding specificities and HCDR3 features.


Assuntos
Vacinas contra a AIDS , Anticorpos Neutralizantes , Anticorpos Anti-HIV , Proteína gp41 do Envelope de HIV , Infecções por HIV , HIV-1 , Macaca mulatta , Animais , Humanos , Proteína gp41 do Envelope de HIV/imunologia , Anticorpos Anti-HIV/imunologia , Camundongos , Vacinas contra a AIDS/imunologia , Anticorpos Neutralizantes/imunologia , HIV-1/imunologia , Infecções por HIV/imunologia , Infecções por HIV/prevenção & controle , Infecções por HIV/virologia , Vacinação , Anticorpos Amplamente Neutralizantes/imunologia , Linfócitos B/imunologia , Nanopartículas/química , Feminino , Regiões Determinantes de Complementaridade/imunologia , Epitopos/imunologia
2.
Cell ; 177(5): 1153-1171.e28, 2019 05 16.
Artigo em Inglês | MEDLINE | ID: mdl-31080066

RESUMO

Conventional immunization strategies will likely be insufficient for the development of a broadly neutralizing antibody (bnAb) vaccine for HIV or other difficult pathogens because of the immunological hurdles posed, including B cell immunodominance and germinal center (GC) quantity and quality. We found that two independent methods of slow delivery immunization of rhesus monkeys (RMs) resulted in more robust T follicular helper (TFH) cell responses and GC B cells with improved Env-binding, tracked by longitudinal fine needle aspirates. Improved GCs correlated with the development of >20-fold higher titers of autologous nAbs. Using a new RM genomic immunoglobulin locus reference, we identified differential IgV gene use between immunization modalities. Ab mapping demonstrated targeting of immunodominant non-neutralizing epitopes by conventional bolus-immunized animals, whereas slow delivery-immunized animals targeted a more diverse set of epitopes. Thus, alternative immunization strategies can enhance nAb development by altering GCs and modulating the immunodominance of non-neutralizing epitopes.


Assuntos
Anticorpos Neutralizantes/imunologia , Linfócitos B/imunologia , Centro Germinativo/imunologia , Anticorpos Anti-HIV/imunologia , HIV-1/imunologia , Imunização Passiva , Linfócitos T Auxiliares-Indutores/imunologia , Animais , Linfócitos B/patologia , Feminino , Centro Germinativo/patologia , Centro Germinativo/virologia , Macaca mulatta , Masculino , Linfócitos T Auxiliares-Indutores/patologia , Produtos do Gene env do Vírus da Imunodeficiência Humana/imunologia
5.
Proc Natl Acad Sci U S A ; 121(11): e2321700121, 2024 Mar 12.
Artigo em Inglês | MEDLINE | ID: mdl-38442159

RESUMO

Ribosomes are often used in synthetic biology as a tool to produce desired proteins with enhanced properties or entirely new functions. However, repurposing ribosomes for producing designer proteins is challenging due to the limited number of engineering solutions available to alter the natural activity of these enzymes. In this study, we advance ribosome engineering by describing a novel strategy based on functional fusions of ribosomal RNA (rRNA) with messenger RNA (mRNA). Specifically, we create an mRNA-ribosome fusion called RiboU, where the 16S rRNA is covalently attached to selenocysteine insertion sequence (SECIS), a regulatory RNA element found in mRNAs encoding selenoproteins. When SECIS sequences are present in natural mRNAs, they instruct ribosomes to decode UGA codons as selenocysteine (Sec, U) codons instead of interpreting them as stop codons. This enables ribosomes to insert Sec into the growing polypeptide chain at the appropriate site. Our work demonstrates that the SECIS sequence maintains its functionality even when inserted into the ribosome structure. As a result, the engineered ribosomes RiboU interpret UAG codons as Sec codons, allowing easy and site-specific insertion of Sec in a protein of interest with no further modification to the natural machinery of protein synthesis. To validate this approach, we use RiboU ribosomes to produce three functional target selenoproteins in Escherichia coli by site-specifically inserting Sec into the proteins' active sites. Overall, our work demonstrates the feasibility of creating functional mRNA-rRNA fusions as a strategy for ribosome engineering, providing a novel tool for producing Sec-containing proteins in live bacterial cells.


Assuntos
Magnoliopsida , Selenocisteína , RNA Mensageiro/genética , RNA Ribossômico 16S , Selenoproteínas/genética , Ribossomos/genética , Códon de Terminação/genética , Escherichia coli/genética
6.
Trends Immunol ; 44(1): 7-21, 2023 01.
Artigo em Inglês | MEDLINE | ID: mdl-36470826

RESUMO

The recombination between immunoglobulin (IG) gene segments determines an individual's naïve antibody repertoire and, consequently, (auto)antigen recognition. Emerging evidence suggests that mammalian IG germline variation impacts humoral immune responses associated with vaccination, infection, and autoimmunity - from the molecular level of epitope specificity, up to profound changes in the architecture of antibody repertoires. These links between IG germline variants and immunophenotype raise the question on the evolutionary causes and consequences of diversity within IG loci. We discuss why the extreme diversity in IG loci remains a mystery, why resolving this is important for the design of more effective vaccines and therapeutics, and how recent evidence from multiple lines of inquiry may help us do so.


Assuntos
Genes de Imunoglobulinas , Mutação em Linhagem Germinativa , Animais , Humanos , Genes de Imunoglobulinas/genética , Imunidade Humoral/genética , Evolução Biológica , Células Germinativas , Mamíferos
7.
J Immunol ; 2024 Jul 15.
Artigo em Inglês | MEDLINE | ID: mdl-39007649

RESUMO

The expressed Ab repertoire is a critical determinant of immune-related phenotypes. Ab-encoding transcripts are distinct from other expressed genes because they are transcribed from somatically rearranged gene segments. Human Abs are composed of two identical H and L chain polypeptides derived from genes in IGH locus and one of two L chain loci. The combinatorial diversity that results from Ab gene rearrangement and the pairing of different H and L chains contributes to the immense diversity of the baseline Ab repertoire. During rearrangement, Ab gene selection is mediated by factors that influence chromatin architecture, promoter/enhancer activity, and V(D)J recombination. Interindividual variation in the composition of the Ab repertoire associates with germline variation in IGH, implicating polymorphism in Ab gene regulation. Determining how IGH variants directly mediate gene regulation will require integration of these variants with other functional genomic datasets. In this study, we argue that standard approaches using short reads have limited utility for characterizing regulatory regions in IGH at haplotype resolution. Using simulated and chromatin immunoprecipitation sequencing reads, we define features of IGH that limit use of short reads and a single reference genome, namely 1) the highly duplicated nature of the DNA sequence in IGH and 2) structural polymorphisms that are frequent in the population. We demonstrate that personalized diploid references enhance performance of short-read data for characterizing mappable portions of the locus, while also showing that long-read profiling tools will ultimately be needed to fully resolve functional impacts of IGH germline variation on expressed Ab repertoires.

8.
Am J Hum Genet ; 109(6): 1065-1076, 2022 06 02.
Artigo em Inglês | MEDLINE | ID: mdl-35609568

RESUMO

The human genome contains tens of thousands of large tandem repeats and hundreds of genes that show common and highly variable copy-number changes. Due to their large size and repetitive nature, these variable number tandem repeats (VNTRs) and multicopy genes are generally recalcitrant to standard genotyping approaches and, as a result, this class of variation is poorly characterized. However, several recent studies have demonstrated that copy-number variation of VNTRs can modify local gene expression, epigenetics, and human traits, indicating that many have a functional role. Here, using read depth from whole-genome sequencing to profile copy number, we report results of a phenome-wide association study (PheWAS) of VNTRs and multicopy genes in a discovery cohort of ∼35,000 samples, identifying 32 traits associated with copy number of 38 VNTRs and multicopy genes at 1% FDR. We replicated many of these signals in an independent cohort and observed that VNTRs showing trait associations were significantly enriched for expression QTLs with nearby genes, providing strong support for our results. Fine-mapping studies indicated that in the majority (∼90%) of cases, the VNTRs and multicopy genes we identified represent the causal variants underlying the observed associations. Furthermore, several lie in regions where prior SNV-based GWASs have failed to identify any significant associations with these traits. Our study indicates that copy number of VNTRs and multicopy genes contributes to diverse human traits and suggests that complex structural variants potentially explain some of the so-called "missing heritability" of SNV-based GWASs.


Assuntos
Variações do Número de Cópias de DNA , Repetições Minissatélites , Variações do Número de Cópias de DNA/genética , Genoma Humano , Estudo de Associação Genômica Ampla , Humanos , Repetições Minissatélites/genética , Fenótipo
9.
J Immunol ; 210(10): 1607-1619, 2023 05 15.
Artigo em Inglês | MEDLINE | ID: mdl-37027017

RESUMO

Current Adaptive Immune Receptor Repertoire sequencing (AIRR-seq) using short-read sequencing strategies resolve expressed Ab transcripts with limited resolution of the C region. In this article, we present the near-full-length AIRR-seq (FLAIRR-seq) method that uses targeted amplification by 5' RACE, combined with single-molecule, real-time sequencing to generate highly accurate (99.99%) human Ab H chain transcripts. FLAIRR-seq was benchmarked by comparing H chain V (IGHV), D (IGHD), and J (IGHJ) gene usage, complementarity-determining region 3 length, and somatic hypermutation to matched datasets generated with standard 5' RACE AIRR-seq using short-read sequencing and full-length isoform sequencing. Together, these data demonstrate robust FLAIRR-seq performance using RNA samples derived from PBMCs, purified B cells, and whole blood, which recapitulated results generated by commonly used methods, while additionally resolving H chain gene features not documented in IMGT at the time of submission. FLAIRR-seq data provide, for the first time, to our knowledge, simultaneous single-molecule characterization of IGHV, IGHD, IGHJ, and IGHC region genes and alleles, allele-resolved subisotype definition, and high-resolution identification of class switch recombination within a clonal lineage. In conjunction with genomic sequencing and genotyping of IGHC genes, FLAIRR-seq of the IgM and IgG repertoires from 10 individuals resulted in the identification of 32 unique IGHC alleles, 28 (87%) of which were previously uncharacterized. Together, these data demonstrate the capabilities of FLAIRR-seq to characterize IGHV, IGHD, IGHJ, and IGHC gene diversity for the most comprehensive view of bulk-expressed Ab repertoires to date.


Assuntos
Regiões Determinantes de Complementaridade , Humanos , Regiões Determinantes de Complementaridade/genética , Sequência de Bases
10.
Nucleic Acids Res ; 51(16): e86, 2023 09 08.
Artigo em Inglês | MEDLINE | ID: mdl-37548401

RESUMO

In adaptive immune receptor repertoire analysis, determining the germline variable (V) allele associated with each T- and B-cell receptor sequence is a crucial step. This process is highly impacted by allele annotations. Aligning sequences, assigning them to specific germline alleles, and inferring individual genotypes are challenging when the repertoire is highly mutated, or sequence reads do not cover the whole V region. Here, we propose an alternative naming scheme for the V alleles, as well as a novel method to infer individual genotypes. We demonstrate the strengths of the two by comparing their outcomes to other genotype inference methods. We validate the genotype approach with independent genomic long-read data. The naming scheme is compatible with current annotation tools and pipelines. Analysis results can be converted from the proposed naming scheme to the nomenclature determined by the International Union of Immunological Societies (IUIS). Both the naming scheme and the genotype procedure are implemented in a freely available R package (PIgLET https://bitbucket.org/yaarilab/piglet). To allow researchers to further explore the approach on real data and to adapt it for their uses, we also created an interactive website (https://yaarilab.github.io/IGHV_reference_book).


Assuntos
Genômica , Cadeias Pesadas de Imunoglobulinas , Receptores de Antígenos de Linfócitos B , Alelos , Genótipo , Receptores de Antígenos de Linfócitos B/genética , Cadeias Pesadas de Imunoglobulinas/genética
11.
Genes Immun ; 2024 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-38844673

RESUMO

Immunoglobulins (IGs), critical components of the human immune system, are composed of heavy and light protein chains encoded at three genomic loci. The IG Kappa (IGK) chain locus consists of two large, inverted segmental duplications. The complexity of the IG loci has hindered use of standard high-throughput methods for characterizing genetic variation within these regions. To overcome these limitations, we use long-read sequencing to create haplotype-resolved IGK assemblies in an ancestrally diverse cohort (n = 36), representing the first comprehensive description of IGK haplotype variation. We identify extensive locus polymorphism, including novel single nucleotide variants (SNVs) and novel structural variants harboring functional IGKV genes. Among 47 functional IGKV genes, we identify 145 alleles, 67 of which were not previously curated. We report inter-population differences in allele frequencies for 10 IGKV genes, including alleles unique to specific populations within this dataset. We identify haplotypes carrying signatures of gene conversion that associate with SNV enrichment in the IGK distal region, and a haplotype with an inversion spanning the proximal and distal regions. These data provide a critical resource of curated genomic reference information from diverse ancestries, laying a foundation for advancing our understanding of population-level genetic variation in the IGK locus.

12.
J Biol Chem ; 299(7): 104852, 2023 07.
Artigo em Inglês | MEDLINE | ID: mdl-37224963

RESUMO

The correct coupling of amino acids with transfer RNAs (tRNAs) is vital for translating genetic information into functional proteins. Errors during this process lead to mistranslation, where a codon is translated using the wrong amino acid. While unregulated and prolonged mistranslation is often toxic, growing evidence suggests that organisms, from bacteria to humans, can induce and use mistranslation as a mechanism to overcome unfavorable environmental conditions. Most known cases of mistranslation are caused by translation factors with poor substrate specificity or when substrate discrimination is sensitive to molecular changes such as mutations or posttranslational modifications. Here we report two novel families of tRNAs, encoded by bacteria from the Streptomyces and Kitasatospora genera, that adopted dual identities by integrating the anticodons AUU (for Asn) or AGU (for Thr) into the structure of a distinct proline tRNA. These tRNAs are typically encoded next to a full-length or truncated version of a distinct isoform of bacterial-type prolyl-tRNA synthetase. Using two protein reporters, we showed that these tRNAs translate asparagine and threonine codons with proline. Moreover, when expressed in Escherichia coli, the tRNAs cause varying growth defects due to global Asn-to-Pro and Thr-to-Pro mutations. Yet, proteome-wide substitutions of Asn with Pro induced by tRNA expression increased cell tolerance to the antibiotic carbenicillin, indicating that Pro mistranslation can be beneficial under certain conditions. Collectively, our results significantly expand the catalog of organisms known to possess dedicated mistranslation machinery and support the concept that mistranslation is a mechanism for cellular resiliency against environmental stress.


Assuntos
Código Genético , Biossíntese de Proteínas , RNA de Transferência , Humanos , Aminoácidos/metabolismo , Códon/metabolismo , Escherichia coli/genética , Escherichia coli/metabolismo , Prolina/metabolismo , Biossíntese de Proteínas/genética , Proteínas/metabolismo , RNA de Transferência/genética , RNA de Transferência/metabolismo , Treonina/metabolismo , Streptomyces/genética , Mutação , Proteoma
13.
Am J Hum Genet ; 108(5): 809-824, 2021 05 06.
Artigo em Inglês | MEDLINE | ID: mdl-33794196

RESUMO

Variable number tandem repeats (VNTRs) are composed of large tandemly repeated motifs, many of which are highly polymorphic in copy number. However, because of their large size and repetitive nature, they remain poorly studied. To investigate the regulatory potential of VNTRs, we used read-depth data from Illumina whole-genome sequencing to perform association analysis between copy number of ∼70,000 VNTRs (motif size ≥ 10 bp) with both gene expression (404 samples in 48 tissues) and DNA methylation (235 samples in peripheral blood), identifying thousands of VNTRs that are associated with local gene expression (eVNTRs) and DNA methylation levels (mVNTRs). Using an independent cohort, we validated 73%-80% of signals observed in the two discovery cohorts, while allelic analysis of VNTR length and CpG methylation in 30 Oxford Nanopore genomes gave additional support for mVNTR loci, thus providing robust evidence to support that these represent genuine associations. Further, conditional analysis indicated that many eVNTRs and mVNTRs act as QTLs independently of other local variation. We also observed strong enrichments of eVNTRs and mVNTRs for regulatory features such as enhancers and promoters. Using the Human Genome Diversity Panel, we define sets of VNTRs that show highly divergent copy numbers among human populations and show that these are enriched for regulatory effects and preferentially associate with genes that have been linked with human phenotypes through GWASs. Our study provides strong evidence supporting functional variation at thousands of VNTRs and defines candidate sets of VNTRs, copy number variation of which potentially plays a role in numerous human phenotypes.


Assuntos
Variações do Número de Cópias de DNA/genética , Metilação de DNA , Regulação da Expressão Gênica , Repetições Minissatélites/genética , Locos de Características Quantitativas/genética , Adolescente , Adulto , Algoritmos , Criança , Pré-Escolar , Cromossomos Humanos X/genética , Estudos de Coortes , Ilhas de CpG/genética , Elementos Facilitadores Genéticos/genética , Feminino , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Lactente , Recém-Nascido , Masculino , Pessoa de Meia-Idade , Fenótipo , Regiões Promotoras Genéticas/genética , Adulto Jovem
14.
Nucleic Acids Res ; 50(18): 10201-10211, 2022 10 14.
Artigo em Inglês | MEDLINE | ID: mdl-35882385

RESUMO

Ribosomes are remarkable in their malleability to accept diverse aminoacyl-tRNA substrates from both the same organism and other organisms or domains of life. This is a critical feature of the ribosome that allows the use of orthogonal translation systems for genetic code expansion. Optimization of these orthogonal translation systems generally involves focusing on the compatibility of the tRNA, aminoacyl-tRNA synthetase, and a non-canonical amino acid with each other. As we expand the diversity of tRNAs used to include non-canonical structures, the question arises as to the tRNA suitability on the ribosome. Specifically, we investigated the ribosomal translation of allo-tRNAUTu1, a uniquely shaped (9/3) tRNA exploited for site-specific selenocysteine insertion, using single-molecule fluorescence. With this technique we identified ribosomal disassembly occurring from translocation of allo-tRNAUTu1 from the A to the P site. Using cryo-EM to capture the tRNA on the ribosome, we pinpointed a distinct tertiary interaction preventing fluid translocation. Through a single nucleotide mutation, we disrupted this tertiary interaction and relieved the translation roadblock. With the continued diversification of genetic code expansion, our work highlights a targeted approach to optimize translation by distinct tRNAs as they move through the ribosome.


Continued expansion of the genetic code has required the use of synthetic tRNAs for decoding. Some of these synthetic tRNAs have unique structural features that are not observed in canonical tRNAs. Here, the authors applied single-molecule, biochemical and structural methods to determine whether these distinct features were deleterious for efficient protein translation on the ribosome. With a focus on selenocysteine insertion, the authors explored an allo-tRNA with a 9/3 acceptor domain. They observed a translational roadblock that occurred in A to P site tRNA translocation. This block was mediated by a tertiary interaction across the tRNA core, directing the variable arm position into an unfavorable conformation. A single-nucleotide mutation disrupted this interaction, providing flexibility in the variable arm and promoting efficient protein production.


Assuntos
Biossíntese de Proteínas , RNA de Transferência/ultraestrutura , Ribossomos/ultraestrutura , Aminoácidos/genética , Aminoacil-tRNA Sintetases/genética , Nucleotídeos/metabolismo , RNA de Transferência/metabolismo , Ribossomos/metabolismo , Selenocisteína/química
15.
Nucleic Acids Res ; 50(8): 4601-4615, 2022 05 06.
Artigo em Inglês | MEDLINE | ID: mdl-35466371

RESUMO

Site-specific incorporation of distinct non-canonical amino acids into proteins via genetic code expansion requires mutually orthogonal aminoacyl-tRNA synthetase/tRNA pairs. Pyrrolysyl-tRNA synthetase (PylRS)/tRNAPyl pairs are ideal for genetic code expansion and have been extensively engineered for developing mutually orthogonal pairs. Here, we identify two novel wild-type PylRS/tRNAPyl pairs simultaneously present in the deep-rooted extremely halophilic euryarchaeal methanogen Candidatus Methanohalarchaeum thermophilum HMET1, and show that both pairs are functional in the model halophilic archaeon Haloferax volcanii. These pairs consist of two different PylRS enzymes and two distinct tRNAs with dissimilar discriminator bases. Surprisingly, these two PylRS/tRNAPyl pairs display mutual orthogonality enabled by two unique features, the A73 discriminator base of tRNAPyl2 and a shorter motif 2 loop in PylRS2. In vivo translation experiments show that tRNAPyl2 charging by PylRS2 is defined by the enzyme's shortened motif 2 loop. Finally, we demonstrate that the two HMET1 PylRS/tRNAPyl pairs can simultaneously decode UAG and UAA codons for incorporation of two distinct noncanonical amino acids into protein. This example of a single base change in a tRNA leading to additional coding capacity suggests that the growth of the genetic code is not yet limited by the number of identity elements fitting into the tRNA structure.


Assuntos
Aminoacil-tRNA Sintetases , Euryarchaeota , Aminoacil-tRNA Sintetases/metabolismo , Lisina/metabolismo , RNA de Transferência/genética , RNA de Transferência/metabolismo , Código Genético , Euryarchaeota/genética , Aminoácidos/genética
16.
Proc Natl Acad Sci U S A ; 118(35)2021 08 31.
Artigo em Inglês | MEDLINE | ID: mdl-34413202

RESUMO

Inaccurate expression of the genetic code, also known as mistranslation, is an emerging paradigm in microbial studies. Growing evidence suggests that many microbial pathogens can deliberately mistranslate their genetic code to help invade a host or evade host immune responses. However, discovering different capacities for deliberate mistranslation remains a challenge because each group of pathogens typically employs a unique mistranslation mechanism. In this study, we address this problem by studying duplicated genes of aminoacyl-transfer RNA (tRNA) synthetases. Using bacterial prolyl-tRNA synthetase (ProRS) genes as an example, we identify an anomalous ProRS isoform, ProRSx, and a corresponding tRNA, tRNAProA, that are predominately found in plant pathogens from Streptomyces species. We then show that tRNAProA has an unusual hybrid structure that allows this tRNA to mistranslate alanine codons as proline. Finally, we provide biochemical, genetic, and mass spectrometric evidence that cells which express ProRSx and tRNAProA can translate GCU alanine codons as both alanine and proline. This dual use of alanine codons creates a hidden proteome diversity due to stochastic Ala→Pro mutations in protein sequences. Thus, we show that important plant pathogens are equipped with a tool to alter the identity of their sense codons. This finding reveals the initial example of a natural tRNA synthetase/tRNA pair for dedicated mistranslation of sense codons.


Assuntos
Aminoacil-tRNA Sintetases/metabolismo , Códon , Escherichia coli/metabolismo , Código Genético , Biossíntese de Proteínas , Aminoacil-RNA de Transferência/metabolismo , Streptomyces/metabolismo , Alanina/genética , Alanina/metabolismo , Sequência de Aminoácidos , Aminoacil-tRNA Sintetases/genética , Escherichia coli/genética , Escherichia coli/crescimento & desenvolvimento , Prolina/genética , Prolina/metabolismo , Aminoacil-RNA de Transferência/genética , Homologia de Sequência , Streptomyces/genética , Streptomyces/crescimento & desenvolvimento , Especificidade por Substrato
17.
Genes Immun ; 24(1): 21-31, 2023 02.
Artigo em Inglês | MEDLINE | ID: mdl-36539592

RESUMO

Immunoglobulins (IGs), crucial components of the adaptive immune system, are encoded by three genomic loci. However, the complexity of the IG loci severely limits the effective use of short read sequencing, limiting our knowledge of population diversity in these loci. We leveraged existing long read whole-genome sequencing (WGS) data, fosmid technology, and IG targeted single-molecule, real-time (SMRT) long-read sequencing (IG-Cap) to create haplotype-resolved assemblies of the IG Lambda (IGL) locus from 6 ethnically diverse individuals. In addition, we generated 10 diploid assemblies of IGL from a diverse cohort of individuals utilizing IG-Cap. From these 16 individuals, we identified significant allelic diversity, including 36 novel IGLV alleles. In addition, we observed highly elevated single nucleotide variation (SNV) in IGLV genes relative to IGL intergenic and genomic background SNV density. By comparing SNV calls between our high quality assemblies and existing short read datasets from the same individuals, we show a high propensity for false-positives in the short read datasets. Finally, for the first time, we nucleotide-resolved common 5-10 Kb duplications in the IGLC region that contain functional IGLJ and IGLC genes. Together these data represent a significant advancement in our understanding of genetic variation and population diversity in the IGL locus.


Assuntos
Genes de Imunoglobulinas , Cadeias lambda de Imunoglobulina , Humanos , Cadeias lambda de Imunoglobulina/genética , Genômica , Variação Genética , Nucleotídeos
18.
BMC Bioinformatics ; 24(1): 403, 2023 Oct 27.
Artigo em Inglês | MEDLINE | ID: mdl-37891497

RESUMO

BACKGROUND: Quality control of DNA sequences is an important data preprocessing step in many genomic analyses. However, all existing parallel tools for this purpose are based on a batch processing model, needing to have the complete genetic dataset before processing can even begin. This limitation clearly hinders quality control performance in those scenarios where the dataset must be downloaded from a remote repository and/or copied to a distributed file system for its parallel processing. RESULTS: In this paper we present SeQual-Stream, a streaming tool that allows performing multiple quality control operations on genomic datasets in a fast, distributed and scalable way. To do so, our approach relies on the Apache Spark framework and the Hadoop Distributed File System (HDFS) to fully exploit the stream paradigm and accelerate the preprocessing of large datasets as they are being downloaded and/or copied to HDFS. The experimental results have shown significant improvements in the execution times of SeQual-Stream when compared to a batch processing tool with similar quality control features, providing a maximum speedup of 2.7[Formula: see text] when processing a dataset with more than 250 million DNA sequences, while also demonstrating good scalability features. CONCLUSION: Our solution provides a more scalable and higher performance way to carry out quality control of large genomic datasets by taking advantage of stream processing features. The tool is distributed as free open-source software released under the GNU AGPLv3 license and is publicly available to download at https://github.com/UDC-GAC/SeQual-Stream .


Assuntos
Genômica , Software , Genômica/métodos , Genoma , Sequência de Bases , Algoritmos , Sequenciamento de Nucleotídeos em Larga Escala/métodos
19.
Am J Hum Genet ; 107(4): 654-669, 2020 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-32937144

RESUMO

There is growing recognition that epivariations, most often recognized as promoter hypermethylation events that lead to gene silencing, are associated with a number of human diseases. However, little information exists on the prevalence and distribution of rare epigenetic variation in the human population. In order to address this, we performed a survey of methylation profiles from 23,116 individuals using the Illumina 450k array. Using a robust outlier approach, we identified 4,452 unique autosomal epivariations, including potentially inactivating promoter methylation events at 384 genes linked to human disease. For example, we observed promoter hypermethylation of BRCA1 and LDLR at population frequencies of ∼1 in 3,000 and ∼1 in 6,000, respectively, suggesting that epivariations may underlie a fraction of human disease which would be missed by purely sequence-based approaches. Using expression data, we confirmed that many epivariations are associated with outlier gene expression. Analysis of variation data and monozygous twin pairs suggests that approximately two-thirds of epivariations segregate in the population secondary to underlying sequence mutations, while one-third are likely sporadic events that occur post-zygotically. We identified 25 loci where rare hypermethylation coincided with the presence of an unstable CGG tandem repeat, validated the presence of CGG expansions at several loci, and identified the putative molecular defect underlying most of the known folate-sensitive fragile sites in the genome. Our study provides a catalog of rare epigenetic changes in the human genome, gives insight into the underlying origins and consequences of epivariations, and identifies many hypermethylated CGG repeat expansions.


Assuntos
Proteína BRCA1/genética , Epigênese Genética , Doenças Genéticas Inatas/genética , Genoma Humano , Receptores de LDL/genética , Expansão das Repetições de Trinucleotídeos , Proteína BRCA1/metabolismo , Metilação de DNA , Feminino , Ácido Fólico/metabolismo , Inativação Gênica , Doenças Genéticas Inatas/diagnóstico , Doenças Genéticas Inatas/patologia , Loci Gênicos , Variação Genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , Regiões Promotoras Genéticas , Receptores de LDL/metabolismo , Gêmeos Monozigóticos
20.
J Urol ; 209(5): 890-900, 2023 05.
Artigo em Inglês | MEDLINE | ID: mdl-37026631

RESUMO

PURPOSE: Half of patients with muscle-invasive bladder cancer worldwide may not receive curative-intent therapy. Elderly or frail patients are most affected by this unmet need. TAR-200 is a novel, intravesical drug delivery system that provides sustained, local release of gemcitabine into the bladder over a 21-day dosing cycle. The phase 1 TAR-200-103 study evaluated the safety, tolerability, and preliminary efficacy of TAR-200 in patients with muscle-invasive bladder cancer who either refused or were unfit for curative-intent therapy. MATERIALS AND METHODS: Eligible patients had cT2-cT3bN0M0 urothelial carcinoma of the bladder. TAR-200 was inserted for 4 consecutive 21-day cycles over 84 days. The primary end points were safety and tolerability at 84 days. Secondary end points included rates of clinical complete response and partial response as determined by cystoscopy, biopsy, and imaging; duration of response; and overall survival. RESULTS: Median age of the 35 enrolled patients was 84 years, and most were male (24/35, 68.6%). Treatment-emergent adverse events related to TAR-200 occurred in 15 patients. Two patients experienced treatment-emergent adverse events leading to removal of TAR-200. At 3 months, complete response and partial response rates were 31.4% (11/35) and 8.6% (3/35), respectively, yielding an overall response rate of 40.0% (14/35; 95% CI 23.9-57.9). Median overall survival and duration of response were 27.3 months (95% CI 10.1-not estimable) and 14 months (95% CI 10.6-22.7), respectively. Progression-free rate at 12 months was 70.5%. CONCLUSIONS: TAR-200 was generally safe, well tolerated, and had beneficial preliminary efficacy in this elderly and frail cohort with limited treatment options.


Assuntos
Carcinoma de Células de Transição , Sistemas de Liberação de Medicamentos , Neoplasias da Bexiga Urinária , Idoso , Idoso de 80 Anos ou mais , Feminino , Humanos , Masculino , Administração Intravesical , Carcinoma de Células de Transição/tratamento farmacológico , Desoxicitidina , Músculos/patologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA