RESUMO
Regulation of transcript structure generates transcript diversity and plays an important role in human disease1-7. The advent of long-read sequencing technologies offers the opportunity to study the role of genetic variation in transcript structure8-16. In this Article, we present a large human long-read RNA-seq dataset using the Oxford Nanopore Technologies platform from 88 samples from Genotype-Tissue Expression (GTEx) tissues and cell lines, complementing the GTEx resource. We identified just over 70,000 novel transcripts for annotated genes, and validated the protein expression of 10% of novel transcripts. We developed a new computational package, LORALS, to analyse the genetic effects of rare and common variants on the transcriptome by allele-specific analysis of long reads. We characterized allele-specific expression and transcript structure events, providing new insights into the specific transcript alterations caused by common and rare genetic variants and highlighting the resolution gained from long-read data. We were able to perturb the transcript structure upon knockdown of PTBP1, an RNA binding protein that mediates splicing, thereby finding genetic regulatory effects that are modified by the cellular environment. Finally, we used this dataset to enhance variant interpretation and study rare variants leading to aberrant splicing patterns.
Assuntos
Alelos , Perfilação da Expressão Gênica , Especificidade de Órgãos , RNA-Seq , Transcriptoma , Processamento Alternativo/genética , Linhagem Celular , Conjuntos de Dados como Assunto , Genótipo , Ribonucleoproteínas Nucleares Heterogêneas/deficiência , Ribonucleoproteínas Nucleares Heterogêneas/genética , Humanos , Especificidade de Órgãos/genética , Proteína de Ligação a Regiões Ricas em Polipirimidinas/deficiência , Proteína de Ligação a Regiões Ricas em Polipirimidinas/genética , Reprodutibilidade dos Testes , Transcriptoma/genéticaRESUMO
Genetic variations underlying susceptibility to complex autoimmune and allergic diseases are concentrated within noncoding regulatory elements termed enhancers1. The functions of a large majority of disease-associated enhancers are unknown, in part owing to their distance from the genes they regulate, a lack of understanding of the cell types in which they operate, and our inability to recapitulate the biology of immune diseases in vitro. Here, using shared synteny to guide loss-of-function analysis of homologues of human enhancers in mice, we show that the prominent autoimmune and allergic disease risk locus at chromosome 11q13.52-7 contains a distal enhancer that is functional in CD4+ regulatory T (Treg) cells and required for Treg-mediated suppression of colitis. The enhancer recruits the transcription factors STAT5 and NF-κB to mediate signal-driven expression of Lrrc32, which encodes the protein glycoprotein A repetitions predominant (GARP). Whereas disruption of the Lrrc32 gene results in early lethality, mice lacking the enhancer are viable but lack GARP expression in Foxp3+ Treg cells, which are unable to control colitis in a cell-transfer model of the disease. In human Treg cells, the enhancer forms conformational interactions with the promoter of LRRC32 and enhancer risk variants are associated with reduced histone acetylation and GARP expression. Finally, functional fine-mapping of 11q13.5 using CRISPR-activation (CRISPRa) identifies a CRISPRa-responsive element in the vicinity of risk variant rs11236797 capable of driving GARP expression. These findings provide a mechanistic basis for association of the 11q13.5 risk locus with immune-mediated diseases and identify GARP as a potential target in their therapy.
Assuntos
Cromossomos Humanos Par 11/genética , Colite/genética , Colite/imunologia , Elementos Facilitadores Genéticos/genética , Predisposição Genética para Doença/genética , Linfócitos T Reguladores/imunologia , Acetilação , Alelos , Animais , Cromossomos de Mamíferos/genética , Feminino , Fatores de Transcrição Forkhead/metabolismo , Histonas/metabolismo , Humanos , Proteínas de Membrana/genética , Proteínas de Membrana/metabolismo , Camundongos , Sintenia/genéticaRESUMO
T-cell activation is a critical driver of immune responses. The CD28 costimulation is an essential regulator of CD4 T-cell responses, however, its relative importance in naive and memory T cells is not fully understood. Using different model systems, we observe that human memory T cells are more sensitive to CD28 costimulation than naive T cells. To deconvolute how the T-cell receptor (TCR) and CD28 orchestrate activation of human T cells, we stimulate cells using varying intensities of TCR and CD28 and profiled gene expression. We show that genes involved in cell cycle progression and division are CD28-driven in memory cells, but under TCR control in naive cells. We further demonstrate that T-helper differentiation and cytokine expression are controlled by CD28. Using chromatin accessibility profiling, we observe that AP1 transcriptional regulation is enriched when both TCR and CD28 are engaged, whereas open chromatin near CD28-sensitive genes is enriched for NF-kB motifs. Lastly, we show that CD28-sensitive genes are enriched in GWAS regions associated with immune diseases, implicating a role for CD28 in disease development. Our study provides important insights into the differential role of costimulation in naive and memory T-cell responses and disease susceptibility.
Assuntos
Antígenos CD28/imunologia , Linfócitos T CD4-Positivos/imunologia , Memória Imunológica/genética , Ativação Linfocitária/genética , Transcriptoma , Adulto , Animais , Células CHO , Linhagem Celular , Linhagem Celular Tumoral , Células Cultivadas , Cricetinae , Cricetulus , Citocinas/genética , Citocinas/metabolismo , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Receptores de Antígenos de Linfócitos T/imunologiaRESUMO
Mapping hundreds of genetic variants through genome wide association studies provided an opportunity to gain insights into the pathobiology of immune-mediated diseases. However, as most of the disease variants fall outside the gene coding sequences the functional interpretation of the exact role of the associated variants remains to be determined. The integration of disease-associated variants with large scale genomic maps of cell-type-specific gene regulation at both chromatin and transcript levels deliver examples of functionally prioritized causal variants and genes. In particular, the enrichment of disease variants with histone marks can point towards the cell types most relevant to disease development. Furthermore, chromatin contact maps that link enhancers to promoter regions in a direct way allow the identification of genes that can be regulated by the disease variants. Candidate genes implicated with such approaches can be further examined through the correlation of gene expression with genotypes. Additionally, in the context of immune-mediated diseases it is important to combine genomics with immunology approaches. Genotype correlations with the immune system as a whole, as well as with cellular responses to different stimuli, provide a valuable platform for understanding the functional impact of disease-associated variants. The intersection of immunogenomic resources with disease-associated variants paints a detailed picture of disease causal mechanisms. Here, we provide an overview of recent studies that combine these approaches to identify disease vulnerable pathways.
Assuntos
Mapeamento Cromossômico , Variação Genética , Genômica/métodos , Genótipo , Doenças do Sistema Imunitário/genética , Doenças do Sistema Imunitário/imunologia , Animais , HumanosRESUMO
Most variants associated with complex traits and diseases identified by genome-wide association studies (GWAS) map to noncoding regions of the genome with unknown effects. Using ancestrally diverse, biobank-scale GWAS data, massively parallel CRISPR screens, and single-cell transcriptomic and proteomic sequencing, we discovered 124 cis-target genes of 91 noncoding blood trait GWAS loci. Using precise variant insertion through base editing, we connected specific variants with gene expression changes. We also identified trans-effect networks of noncoding loci when cis target genes encoded transcription factors or microRNAs. Networks were themselves enriched for GWAS variants and demonstrated polygenic contributions to complex traits. This platform enables massively parallel characterization of the target genes and mechanisms of human noncoding variants in both cis and trans.
Assuntos
Doença , Estudo de Associação Genômica Ampla , Herança Multifatorial , Locos de Características Quantitativas , Análise de Célula Única , Humanos , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , Predisposição Genética para Doença , Polimorfismo de Nucleotídeo Único , Proteômica , Células Sanguíneas , RNA-Seq , Doença/genéticaRESUMO
Exonic variants present some of the strongest links between genotype and phenotype. However, these variants can have significant inter-individual pathogenicity differences, known as variable penetrance. In this study, we propose a model where genetically controlled mRNA splicing modulates the pathogenicity of exonic variants. By first cataloging exonic inclusion from RNA-seq data in GTEx v8, we find that pathogenic alleles are depleted on highly included exons. Using a large-scale phased WGS data from the TOPMed consortium, we observe that this effect may be driven by common splice-regulatory genetic variants, and that natural selection acts on haplotype configurations that reduce the transcript inclusion of putatively pathogenic variants, especially when limiting to haploinsufficient genes. Finally, we test if this effect may be relevant for autism risk using families from the Simons Simplex Collection, but find that splicing of pathogenic alleles has a penetrance reducing effect here as well. Overall, our results indicate that common splice-regulatory variants may play a role in reducing the damaging effects of rare exonic variants.
RESUMO
Exonic variants present some of the strongest links between genotype and phenotype. However, these variants can have significant inter-individual pathogenicity differences, known as variable penetrance. In this study, we propose a model where genetically controlled mRNA splicing modulates the pathogenicity of exonic variants. By first cataloging exonic inclusion from RNA-sequencing data in GTEx V8, we find that pathogenic alleles are depleted on highly included exons. Using a large-scale phased whole genome sequencing data from the TOPMed consortium, we observe that this effect may be driven by common splice-regulatory genetic variants, and that natural selection acts on haplotype configurations that reduce the transcript inclusion of putatively pathogenic variants, especially when limiting to haploinsufficient genes. Finally, we test if this effect may be relevant for autism risk using families from the Simons Simplex Collection, but find that splicing of pathogenic alleles has a penetrance reducing effect here as well. Overall, our results indicate that common splice-regulatory variants may play a role in reducing the damaging effects of rare exonic variants.
Assuntos
Sítios de Splice de RNA , Splicing de RNA , Penetrância , Éxons , Genótipo , RNA Mensageiro/genética , Processamento AlternativoRESUMO
Identifying cellular functions dysregulated by disease-associated variants could implicate novel pathways for drug targeting or modulation in cell therapies. However, follow-up studies can be challenging if disease-relevant cell types are difficult to sample. Variants associated with immune diseases point toward the role of CD4+ regulatory T cells (Treg cells). We mapped genetic regulation (quantitative trait loci [QTL]) of gene expression and chromatin activity in Treg cells, and we identified 133 colocalizing loci with immune disease variants. Colocalizations of immune disease genome-wide association study (GWAS) variants with expression QTLs (eQTLs) controlling the expression of CD28 and STAT5A, involved in Treg cell activation and interleukin-2 (IL-2) signaling, support the contribution of Treg cells to the pathobiology of immune diseases. Finally, we identified seven known drug targets suitable for drug repurposing and suggested 63 targets with drug tractability evidence among the GWAS signals that colocalized with Treg cell QTLs. Our study is the first in-depth characterization of immune disease variant effects on Treg cell gene expression modulation and dysregulation of Treg cell function.
RESUMO
The emergence of multidrug-resistant (MDR) typhoid is a major global health threat affecting many countries where the disease is endemic. Here whole-genome sequence analysis of 1,832 Salmonella enterica serovar Typhi (S. Typhi) identifies a single dominant MDR lineage, H58, that has emerged and spread throughout Asia and Africa over the last 30 years. Our analysis identifies numerous transmissions of H58, including multiple transfers from Asia to Africa and an ongoing, unrecognized MDR epidemic within Africa itself. Notably, our analysis indicates that H58 lineages are displacing antibiotic-sensitive isolates, transforming the global population structure of this pathogen. H58 isolates can harbor a complex MDR element residing either on transmissible IncHI1 plasmids or within multiple chromosomal integration sites. We also identify new mutations that define the H58 lineage. This phylogeographical analysis provides a framework to facilitate global management of MDR typhoid and is applicable to similar MDR lineages emerging in other bacterial species.