Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 144
Filtrar
1.
Nature ; 625(7996): 805-812, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-38093011

RESUMO

CRISPR-enabled screening is a powerful tool for the discovery of genes that control T cell function and has nominated candidate targets for immunotherapies1-6. However, new approaches are required to probe specific nucleotide sequences within key genes. Systematic mutagenesis in primary human T cells could reveal alleles that tune specific phenotypes. DNA base editors are powerful tools for introducing targeted mutations with high efficiency7,8. Here we develop a large-scale base-editing mutagenesis platform with the goal of pinpointing nucleotides that encode amino acid residues that tune primary human T cell activation responses. We generated a library of around 117,000 single guide RNA molecules targeting base editors to protein-coding sites across 385 genes implicated in T cell function and systematically identified protein domains and specific amino acid residues that regulate T cell activation and cytokine production. We found a broad spectrum of alleles with variants encoding critical residues in proteins including PIK3CD, VAV1, LCP2, PLCG1 and DGKZ, including both gain-of-function and loss-of-function mutations. We validated the functional effects of many alleles and further demonstrated that base-editing hits could positively and negatively tune T cell cytotoxic function. Finally, higher-resolution screening using a base editor with relaxed protospacer-adjacent motif requirements9 (NG versus NGG) revealed specific structural domains and protein-protein interaction sites that can be targeted to tune T cell functions. Base-editing screens in primary immune cells thus provide biochemical insights with the potential to accelerate immunotherapy design.


Assuntos
Alelos , Edição de Genes , Mutagênese , Linfócitos T , Humanos , Aminoácidos/genética , Sistemas CRISPR-Cas/genética , Mutagênese/genética , RNA Guia de Sistemas CRISPR-Cas/genética , Linfócitos T/imunologia , Linfócitos T/metabolismo , Ativação Linfocitária , Citocinas/biossíntese , Citocinas/metabolismo , Mutação com Ganho de Função , Mutação com Perda de Função
2.
bioRxiv ; 2024 Apr 10.
Artigo em Inglês | MEDLINE | ID: mdl-37292653

RESUMO

Measures of selective constraint on genes have been used for many applications including clinical interpretation of rare coding variants, disease gene discovery, and studies of genome evolution. However, widely-used metrics are severely underpowered at detecting constraint for the shortest ~25% of genes, potentially causing important pathogenic mutations to be over-looked. We developed a framework combining a population genetics model with machine learning on gene features to enable accurate inference of an interpretable constraint metric, s het . Our estimates outperform existing metrics for prioritizing genes important for cell essentiality, human disease, and other phenotypes, especially for short genes. Our new estimates of selective constraint should have wide utility for characterizing genes relevant to human disease. Finally, our inference framework, GeneBayes, provides a flexible platform that can improve estimation of many gene-level properties, such as rare variant burden or gene expression differences.

3.
Cell Genom ; 3(10): 100401, 2023 Oct 11.
Artigo em Inglês | MEDLINE | ID: mdl-37868038

RESUMO

Each human genome has tens of thousands of rare genetic variants; however, identifying impactful rare variants remains a major challenge. We demonstrate how use of personal multi-omics can enable identification of impactful rare variants by using the Multi-Ethnic Study of Atherosclerosis, which included several hundred individuals, with whole-genome sequencing, transcriptomes, methylomes, and proteomes collected across two time points, 10 years apart. We evaluated each multi-omics phenotype's ability to separately and jointly inform functional rare variation. By combining expression and protein data, we observed rare stop variants 62 times and rare frameshift variants 216 times as frequently as controls, compared to 13-27 times as frequently for expression or protein effects alone. We extended a Bayesian hierarchical model, "Watershed," to prioritize specific rare variants underlying multi-omics signals across the regulatory cascade. With this approach, we identified rare variants that exhibited large effect sizes on multiple complex traits including height, schizophrenia, and Alzheimer's disease.

4.
Nat Genet ; 55(11): 1866-1875, 2023 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-37857933

RESUMO

Most signals in genome-wide association studies (GWAS) of complex traits implicate noncoding genetic variants with putative gene regulatory effects. However, currently identified regulatory variants, notably expression quantitative trait loci (eQTLs), explain only a small fraction of GWAS signals. Here, we show that GWAS and cis-eQTL hits are systematically different: eQTLs cluster strongly near transcription start sites, whereas GWAS hits do not. Genes near GWAS hits are enriched in key functional annotations, are under strong selective constraint and have complex regulatory landscapes across different tissue/cell types, whereas genes near eQTLs are depleted of most functional annotations, show relaxed constraint, and have simpler regulatory landscapes. We describe a model to understand these observations, including how natural selection on complex traits hinders discovery of functionally relevant eQTLs. Our results imply that GWAS and eQTL studies are systematically biased toward different types of variant, and support the use of complementary functional approaches alongside the next generation of eQTL studies.


Assuntos
Estudo de Associação Genômica Ampla , Herança Multifatorial , Regulação da Expressão Gênica/genética , Locos de Características Quantitativas/genética , Expressão Gênica , Polimorfismo de Nucleotídeo Único/genética
5.
Genetics ; 225(3)2023 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-37724741

RESUMO

The discrete-time Wright-Fisher (DTWF) model and its diffusion limit are central to population genetics. These models can describe the forward-in-time evolution of allele frequencies in a population resulting from genetic drift, mutation, and selection. Computing likelihoods under the diffusion process is feasible, but the diffusion approximation breaks down for large samples or in the presence of strong selection. Existing methods for computing likelihoods under the DTWF model do not scale to current exome sequencing sample sizes in the hundreds of thousands. Here, we present a scalable algorithm that approximates the DTWF model with provably bounded error. Our approach relies on two key observations about the DTWF model. The first is that transition probabilities under the model are approximately sparse. The second is that transition distributions for similar starting allele frequencies are extremely close as distributions. Together, these observations enable approximate matrix-vector multiplication in linear (as opposed to the usual quadratic) time. We prove similar properties for Hypergeometric distributions, enabling fast computation of likelihoods for subsamples of the population. We show theoretically and in practice that this approximation is highly accurate and can scale to population sizes in the tens of millions, paving the way for rigorous biobank-scale inference. Finally, we use our results to estimate the impact of larger samples on estimating selection coefficients for loss-of-function variants. We find that increasing sample sizes beyond existing large exome sequencing cohorts will provide essentially no additional information except for genes with the most extreme fitness effects.


Assuntos
Bancos de Espécimes Biológicos , Genética Populacional , Frequência do Gene , Deriva Genética , Probabilidade , Modelos Genéticos , Seleção Genética
6.
bioRxiv ; 2023 Oct 24.
Artigo em Inglês | MEDLINE | ID: mdl-37745614

RESUMO

The effects of genetic variation on complex traits act mainly through changes in gene regulation. Although many genetic variants have been linked to target genes in cis, the trans-regulatory cascade mediating their effects remains largely uncharacterized. Mapping trans-regulators based on natural genetic variation, including eQTL mapping, has been challenging due to small effects. Experimental perturbation approaches offer a complementary and powerful approach to mapping trans-regulators. We used CRISPR knockouts of 84 genes in primary CD4+ T cells to perturb an immune cell gene network, targeting both inborn error of immunity (IEI) disease transcription factors (TFs) and background TFs matched in constraint and expression level, but without a known immune disease association. We developed a novel Bayesian structure learning method called Linear Latent Causal Bayes (LLCB) to estimate the gene regulatory network from perturbation data and observed 211 directed edges among the genes which could not be detected in existing CD4+ trans-eQTL data. We used LLCB to characterize the differences between the IEI and background TFs, finding that the gene groups were highly interconnected, but that IEI TFs were much more likely to regulate immune cell specific pathways and immune GWAS genes. We further characterized nine coherent gene programs based on downstream effects of the TFs and linked these modules to regulation of GWAS genes, finding that canonical JAK-STAT family members are regulated by KMT2A, a global epigenetic regulator. These analyses reveal the trans-regulatory cascade from upstream epigenetic regulator to intermediate TFs to downstream effector cytokines and elucidate the logic linking immune GWAS genes to key signaling pathways.

7.
Nature ; 621(7977): 188-195, 2023 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-37648854

RESUMO

γδ T cells are potent anticancer effectors with the potential to target tumours broadly, independent of patient-specific neoantigens or human leukocyte antigen background1-5. γδ T cells can sense conserved cell stress signals prevalent in transformed cells2,3, although the mechanisms behind the targeting of stressed target cells remain poorly characterized. Vγ9Vδ2 T cells-the most abundant subset of human γδ T cells4-recognize a protein complex containing butyrophilin 2A1 (BTN2A1) and BTN3A1 (refs. 6-8), a widely expressed cell surface protein that is activated by phosphoantigens abundantly produced by tumour cells. Here we combined genome-wide CRISPR screens in target cancer cells to identify pathways that regulate γδ T cell killing and BTN3A cell surface expression. The screens showed previously unappreciated multilayered regulation of BTN3A abundance on the cell surface and triggering of γδ T cells through transcription, post-translational modifications and membrane trafficking. In addition, diverse genetic perturbations and inhibitors disrupting metabolic pathways in the cancer cells, particularly ATP-producing processes, were found to alter BTN3A levels. This induction of both BTN3A and BTN2A1 during metabolic crises is dependent on AMP-activated protein kinase (AMPK). Finally, small-molecule activation of AMPK in a cell line model and in patient-derived tumour organoids led to increased expression of the BTN2A1-BTN3A complex and increased Vγ9Vδ2 T cell receptor-mediated killing. This AMPK-dependent mechanism of metabolic stress-induced ligand upregulation deepens our understanding of γδ T cell stress surveillance and suggests new avenues available to enhance γδ T cell anticancer activity.


Assuntos
Sistemas CRISPR-Cas , Edição de Genes , Neoplasias , Receptores de Antígenos de Linfócitos T gama-delta , Linfócitos T , Humanos , Proteínas Quinases Ativadas por AMP/genética , Proteínas Quinases Ativadas por AMP/metabolismo , Linhagem Celular , Membrana Celular/metabolismo , Neoplasias/genética , Neoplasias/imunologia , Neoplasias/metabolismo , Receptores de Antígenos de Linfócitos T gama-delta/imunologia , Receptores de Antígenos de Linfócitos T gama-delta/metabolismo , Linfócitos T/imunologia , Linfócitos T/metabolismo
8.
Nat Ecol Evol ; 7(9): 1515-1524, 2023 09.
Artigo em Inglês | MEDLINE | ID: mdl-37592021

RESUMO

The Iron Age was a dynamic period in central Mediterranean history, with the expansion of Greek and Phoenician colonies and the growth of Carthage into the dominant maritime power of the Mediterranean. These events were facilitated by the ease of long-distance travel following major advances in seafaring. We know from the archaeological record that trade goods and materials were moving across great distances in unprecedented quantities, but it is unclear how these patterns correlate with human mobility. Here, to investigate population mobility and interactions directly, we sequenced the genomes of 30 ancient individuals from coastal cities around the central Mediterranean, in Tunisia, Sardinia and central Italy. We observe a meaningful contribution of autochthonous populations, as well as highly heterogeneous ancestry including many individuals with non-local ancestries from other parts of the Mediterranean region. These results highlight both the role of local populations and the extreme interconnectedness of populations in the Iron Age Mediterranean. By studying these trans-Mediterranean neighbours together, we explore the complex interplay between local continuity and mobility that shaped the Iron Age societies of the central Mediterranean.


Assuntos
DNA Antigo , Migração Humana , Região do Mediterrâneo , Arqueologia , Migração Humana/história , Humanos , Análise de Componente Principal , Genética Humana , DNA Antigo/análise , Análise de Sequência de DNA , Sepultamento , Antropologia , História Antiga
9.
Science ; 381(6658): eade6289, 2023 08 11.
Artigo em Inglês | MEDLINE | ID: mdl-37561850

RESUMO

Skin color, one of the most diverse human traits, is determined by the quantity, type, and distribution of melanin. In this study, we leveraged the light-scattering properties of melanin to conduct a genome-wide screen for regulators of melanogenesis. We identified 169 functionally diverse genes that converge on melanosome biogenesis, endosomal transport, and gene regulation, of which 135 represented previously unknown associations with pigmentation. In agreement with their melanin-promoting function, the majority of screen hits were up-regulated in melanocytes from darkly pigmented individuals. We further unraveled functions of KLF6 as a transcription factor that regulates melanosome maturation and pigmentation in vivo, and of the endosomal trafficking protein COMMD3 in modulating melanosomal pH. Our study reveals a plethora of melanin-promoting genes, with broad implications for human variation, cell biology, and medicine.


Assuntos
Proteínas Adaptadoras de Transdução de Sinal , Fator 6 Semelhante a Kruppel , Melaninas , Melanócitos , Melanossomas , Pigmentação da Pele , Humanos , Melaninas/biossíntese , Melaninas/genética , Melanócitos/metabolismo , Melanossomas/metabolismo , Pigmentação da Pele/genética , Estudo de Associação Genômica Ampla , Proteínas Adaptadoras de Transdução de Sinal/genética , Proteínas Adaptadoras de Transdução de Sinal/metabolismo , Fator 6 Semelhante a Kruppel/genética , Fator 6 Semelhante a Kruppel/metabolismo , Endossomos/metabolismo , Animais , Camundongos , Linhagem Celular Tumoral
10.
Res Sq ; 2023 Jun 13.
Artigo em Inglês | MEDLINE | ID: mdl-37398424

RESUMO

Measures of selective constraint on genes have been used for many applications including clinical interpretation of rare coding variants, disease gene discovery, and studies of genome evolution. However, widely-used metrics are severely underpowered at detecting constraint for the shortest ~25% of genes, potentially causing important pathogenic mutations to be overlooked. We developed a framework combining a population genetics model with machine learning on gene features to enable accurate inference of an interpretable constraint metric, shet. Our estimates outperform existing metrics for prioritizing genes important for cell essentiality, human disease, and other phenotypes, especially for short genes. Our new estimates of selective constraint should have wide utility for characterizing genes relevant to human disease. Finally, our inference framework, GeneBayes, provides a flexible platform that can improve estimation of many gene-level properties, such as rare variant burden or gene expression differences.

11.
Genetics ; 224(3)2023 Jul 06.
Artigo em Inglês | MEDLINE | ID: mdl-37410594

RESUMO

Members of genetically admixed populations possess ancestry from multiple source groups, and studies of human genetic admixture frequently estimate ancestry components corresponding to fractions of individual genomes that trace to specific ancestral populations. However, the same numerical ancestry fraction can represent a wide array of admixture scenarios within an individual's genealogy. Using a mechanistic model of admixture, we consider admixture genealogically: how many ancestors from the source populations does the admixture represent? We consider African-Americans, for whom continent-level estimates produce a 75-85% value for African ancestry on average and 15-25% for European ancestry. Genetic studies together with key features of African-American demographic history suggest ranges for parameters of a simple three-epoch model. Considering parameter sets compatible with estimates of current ancestry levels, we infer that if all genealogical lines of a random African-American born during 1960-1965 are traced back until they reach members of source populations, the mean over parameter sets of the expected number of genealogical lines terminating with African individuals is 314 (interquartile range 240-376), and the mean of the expected number terminating in Europeans is 51 (interquartile range 32-69). Across discrete generations, the peak number of African genealogical ancestors occurs in birth cohorts from the early 1700s, and the probability exceeds 50% that at least one European ancestor was born more recently than 1835. Our genealogical perspective can contribute to further understanding the admixture processes that underlie admixed populations. For African-Americans, the results provide insight both on how many of the ancestors of a typical African-American might have been forcibly displaced in the Transatlantic Slave Trade and on how many separate European admixture events might exist in a typical African-American genealogy.


Assuntos
População Negra , Negro ou Afro-Americano , Humanos , População Negra/genética , Negro ou Afro-Americano/genética , Genética Populacional
12.
Elife ; 122023 Jun 12.
Artigo em Inglês | MEDLINE | ID: mdl-37306301

RESUMO

The formation of paralogs through gene duplication is a core evolutionary process. For paralogs that encode components of protein complexes such as the ribosome, a central question is whether they encode functionally distinct proteins or whether they exist to maintain appropriate total expression of equivalent proteins. Here, we systematically tested evolutionary models of paralog function using the ribosomal protein paralogs Rps27 (eS27) and Rps27l (eS27L) as a case study. Evolutionary analysis suggests that Rps27 and Rps27l likely arose during whole-genome duplication(s) in a common vertebrate ancestor. We show that Rps27 and Rps27l have inversely correlated mRNA abundance across mouse cell types, with the highest Rps27 in lymphocytes and the highest Rps27l in mammary alveolar cells and hepatocytes. By endogenously tagging the Rps27 and Rps27l proteins, we demonstrate that Rps27- and Rps27l-ribosomes associate preferentially with different transcripts. Furthermore, murine Rps27 and Rps27l loss-of-function alleles are homozygous lethal at different developmental stages. However, strikingly, expressing Rps27 protein from the endogenous Rps27l locus or vice versa completely rescues loss-of-function lethality and yields mice with no detectable deficits. Together, these findings suggest that Rps27 and Rps27l are evolutionarily retained because their subfunctionalized expression patterns render both genes necessary to achieve the requisite total expression of two equivalent proteins across cell types. Our work represents the most in-depth characterization of a mammalian ribosomal protein paralog to date and highlights the importance of considering both protein function and expression when investigating paralogs.


Assuntos
Proteínas Ribossômicas , Ribossomos , Animais , Camundongos , Proteínas Ribossômicas/genética , Proteínas Ribossômicas/metabolismo , Ribossomos/metabolismo , Vertebrados/genética , Genoma , Mamíferos/genética
13.
bioRxiv ; 2023 May 22.
Artigo em Inglês | MEDLINE | ID: mdl-37293115

RESUMO

The Discrete-Time Wright Fisher (DTWF) model and its large population diffusion limit are central to population genetics. These models describe the forward-in-time evolution of the frequency of an allele in a population and can include the fundamental forces of genetic drift, mutation, and selection. Computing like-lihoods under the diffusion process is feasible, but the diffusion approximation breaks down for large sample sizes or in the presence of strong selection. Unfortunately, existing methods for computing likelihoods under the DTWF model do not scale to current exome sequencing sample sizes in the hundreds of thousands. Here we present an algorithm that approximates the DTWF model with provably bounded error and runs in time linear in the size of the population. Our approach relies on two key observations about Binomial distributions. The first is that Binomial distributions are approximately sparse. The second is that Binomial distributions with similar success probabilities are extremely close as distributions, allowing us to approximate the DTWF Markov transition matrix as a very low rank matrix. Together, these observations enable matrix-vector multiplication in linear (as opposed to the usual quadratic) time. We prove similar properties for Hypergeometric distributions, enabling fast computation of likelihoods for subsamples of the population. We show theoretically and in practice that this approximation is highly accurate and can scale to population sizes in the billions, paving the way for rigorous biobank-scale population genetic inference. Finally, we use our results to estimate how increasing sample sizes will improve the estimation of selection coefficients acting on loss-of-function variants. We find that increasing sample sizes beyond existing large exome sequencing cohorts will provide essentially no additional information except for genes with the most extreme fitness effects.

14.
Genome Res ; 33(5): 689-702, 2023 May.
Artigo em Inglês | MEDLINE | ID: mdl-37127331

RESUMO

Short tandem repeats (STRs) are a class of rapidly mutating genetic elements typically characterized by repeated units of 1-6 bp. We leveraged whole-genome sequencing data for 152 recombinant inbred (RI) strains from the BXD family of mice to map loci that modulate genome-wide patterns of new mutations arising during parent-to-offspring transmission at STRs. We defined quantitative phenotypes describing the numbers and types of germline STR mutations in each strain and performed quantitative trait locus (QTL) analyses for each of these phenotypes. We identified a locus on Chromosome 13 at which strains inheriting the C57BL/6J (B) haplotype have a higher rate of STR expansions than those inheriting the DBA/2J (D) haplotype. The strongest candidate gene in this locus is Msh3, a known modifier of STR stability in cancer and at pathogenic repeat expansions in mice and humans, as well as a current drug target against Huntington's disease. The D haplotype at this locus harbors a cluster of variants near the 5' end of Msh3, including multiple missense variants near the DNA mismatch recognition domain. In contrast, the B haplotype contains a unique retrotransposon insertion. The rate of expansion covaries positively with Msh3 expression-with higher expression from the B haplotype. Finally, detailed analysis of mutation patterns showed that strains carrying the B allele have higher expansion rates, but slightly lower overall total mutation rates, compared with those with the D allele, particularly at tetranucleotide repeats. Our results suggest an important role for inherited variants in Msh3 in modulating genome-wide patterns of germline mutations at STRs.


Assuntos
Repetições de Microssatélites , Locos de Características Quantitativas , Animais , Camundongos , Haplótipos , Camundongos Endogâmicos C57BL , Camundongos Endogâmicos DBA
15.
Nat Genet ; 55(5): 841-851, 2023 05.
Artigo em Inglês | MEDLINE | ID: mdl-37024583

RESUMO

Transcriptional regulation exhibits extensive robustness, but human genetics indicates sensitivity to transcription factor (TF) dosage. Reconciling such observations requires quantitative studies of TF dosage effects at trait-relevant ranges, largely lacking so far. TFs play central roles in both normal-range and disease-associated variation in craniofacial morphology; we therefore developed an approach to precisely modulate TF levels in human facial progenitor cells and applied it to SOX9, a TF associated with craniofacial variation and disease (Pierre Robin sequence (PRS)). Most SOX9-dependent regulatory elements (REs) are buffered against small decreases in SOX9 dosage, but REs directly and primarily regulated by SOX9 show heightened sensitivity to SOX9 dosage; these RE responses partially predict gene expression responses. Sensitive REs and genes preferentially affect functional chondrogenesis and PRS-like craniofacial shape variation. We propose that such REs and genes underlie the sensitivity of specific phenotypes to TF dosage, while buffering of other genes leads to robust, nonlinear dosage-to-phenotype relationships.


Assuntos
Síndrome de Pierre Robin , Fatores de Transcrição SOX9 , Humanos , Fatores de Transcrição SOX9/genética , Síndrome de Pierre Robin/genética , Regulação da Expressão Gênica , Sequências Reguladoras de Ácido Nucleico , Fenótipo
16.
bioRxiv ; 2023 Dec 04.
Artigo em Inglês | MEDLINE | ID: mdl-36778251

RESUMO

With hundreds of copies of ribosomal DNA (rDNA) it is unknown whether they possess sequence variations that ultimately form different types of ribosomes. Here, we developed an algorithm for variant-calling between paralog genes (termed RGA) and compared rDNA variations with rRNA variations from long-read sequencing of translating ribosomes (RIBO-RT). Our analyses identified dozens of highly abundant rRNA variants, largely indels, that are incorporated into translationally active ribosomes and assemble into distinct ribosome subtypes encoded on different chromosomes. We developed an in-situ rRNA sequencing method (SWITCH-seq) revealing that variants are co-expressed within individual cells and found that they possess different structures. Lastly, we observed tissue-specific rRNA-subtype expression and linked specific rRNA variants to cancer. This study therefore reveals the variation landscape of translating ribosomes within human cells.

17.
Elife ; 112022 09 08.
Artigo em Inglês | MEDLINE | ID: mdl-36073519

RESUMO

Pleiotropy and genetic correlation are widespread features in genome-wide association studies (GWAS), but they are often difficult to interpret at the molecular level. Here, we perform GWAS of 16 metabolites clustered at the intersection of amino acid catabolism, glycolysis, and ketone body metabolism in a subset of UK Biobank. We utilize the well-documented biochemistry jointly impacting these metabolites to analyze pleiotropic effects in the context of their pathways. Among the 213 lead GWAS hits, we find a strong enrichment for genes encoding pathway-relevant enzymes and transporters. We demonstrate that the effect directions of variants acting on biology between metabolite pairs often contrast with those of upstream or downstream variants as well as the polygenic background. Thus, we find that these outlier variants often reflect biology local to the traits. Finally, we explore the implications for interpreting disease GWAS, underscoring the potential of unifying biochemistry with dense metabolomics data to understand the molecular basis of pleiotropy in complex traits and diseases.


Assuntos
Pleiotropia Genética , Estudo de Associação Genômica Ampla , Aminoácidos/genética , Cetonas , Fenótipo , Polimorfismo de Nucleotídeo Único
18.
Nature ; 608(7923): 569-577, 2022 08.
Artigo em Inglês | MEDLINE | ID: mdl-35922514

RESUMO

A major challenge in human genetics is to identify the molecular mechanisms of trait-associated and disease-associated variants. To achieve this, quantitative trait locus (QTL) mapping of genetic variants with intermediate molecular phenotypes such as gene expression and splicing have been widely adopted1,2. However, despite successes, the molecular basis for a considerable fraction of trait-associated and disease-associated variants remains unclear3,4. Here we show that ADAR-mediated adenosine-to-inosine RNA editing, a post-transcriptional event vital for suppressing cellular double-stranded RNA (dsRNA)-mediated innate immune interferon responses5-11, is an important potential mechanism underlying genetic variants associated with common inflammatory diseases. We identified and characterized 30,319 cis-RNA editing QTLs (edQTLs) across 49 human tissues. These edQTLs were significantly enriched in genome-wide association study signals for autoimmune and immune-mediated diseases. Colocalization analysis of edQTLs with disease risk loci further pinpointed key, putatively immunogenic dsRNAs formed by expected inverted repeat Alu elements as well as unexpected, highly over-represented cis-natural antisense transcripts. Furthermore, inflammatory disease risk variants, in aggregate, were associated with reduced editing of nearby dsRNAs and induced interferon responses in inflammatory diseases. This unique directional effect agrees with the established mechanism that lack of RNA editing by ADAR1 leads to the specific activation of the dsRNA sensor MDA5 and subsequent interferon responses and inflammation7-9. Our findings implicate cellular dsRNA editing and sensing as a previously underappreciated mechanism of common inflammatory diseases.


Assuntos
Adenosina Desaminase , Predisposição Genética para Doença , Doenças do Sistema Imunitário , Inflamação , Edição de RNA , RNA de Cadeia Dupla , Adenosina/metabolismo , Adenosina Desaminase/genética , Adenosina Desaminase/metabolismo , Elementos Alu/genética , Doenças Autoimunes/genética , Doenças Autoimunes/imunologia , Doenças Autoimunes/patologia , Estudo de Associação Genômica Ampla , Humanos , Doenças do Sistema Imunitário/genética , Doenças do Sistema Imunitário/imunologia , Doenças do Sistema Imunitário/patologia , Imunidade Inata , Inflamação/genética , Inflamação/imunologia , Inflamação/patologia , Inosina/metabolismo , Helicase IFIH1 Induzida por Interferon/metabolismo , Interferons/genética , Interferons/imunologia , Locos de Características Quantitativas/genética , Edição de RNA/genética , RNA de Cadeia Dupla/genética , Proteínas de Ligação a RNA/metabolismo
19.
Nat Genet ; 54(8): 1133-1144, 2022 08.
Artigo em Inglês | MEDLINE | ID: mdl-35817986

RESUMO

Gene regulatory networks ensure that important genes are expressed at precise levels. When gene expression is sufficiently perturbed, it can lead to disease. To understand how gene expression disruptions percolate through a network, we must first map connections between regulatory genes and their downstream targets. However, we lack comprehensive knowledge of the upstream regulators of most genes. Here, we developed an approach for systematic discovery of upstream regulators of critical immune factors-IL2RA, IL-2 and CTLA4-in primary human T cells. Then, we mapped the network of the target genes of these regulators and putative cis-regulatory elements using CRISPR perturbations, RNA-seq and ATAC-seq. These regulators form densely interconnected networks with extensive feedback loops. Furthermore, this network is enriched for immune-associated disease variants and genes. These results provide insight into how immune-associated disease genes are regulated in T cells and broader principles about the structure of human gene regulatory networks.


Assuntos
Redes Reguladoras de Genes , Genes Reguladores , Linfócitos T , Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , Redes Reguladoras de Genes/genética , Humanos , Linfócitos T/imunologia
20.
Am J Hum Genet ; 109(7): 1286-1297, 2022 07 07.
Artigo em Inglês | MEDLINE | ID: mdl-35716666

RESUMO

Despite the growing number of genome-wide association studies (GWASs), it remains unclear to what extent gene-by-gene and gene-by-environment interactions influence complex traits in humans. The magnitude of genetic interactions in complex traits has been difficult to quantify because GWASs are generally underpowered to detect individual interactions of small effect. Here, we develop a method to test for genetic interactions that aggregates information across all trait-associated loci. Specifically, we test whether SNPs in regions of European ancestry shared between European American and admixed African American individuals have the same causal effect sizes. We hypothesize that in African Americans, the presence of genetic interactions will drive the causal effect sizes of SNPs in regions of European ancestry to be more similar to those of SNPs in regions of African ancestry. We apply our method to two traits: gene expression in 296 African Americans and 482 European Americans in the Multi-Ethnic Study of Atherosclerosis (MESA) and low-density lipoprotein cholesterol (LDL-C) in 74K African Americans and 296K European Americans in the Million Veteran Program (MVP). We find significant evidence for genetic interactions in our analysis of gene expression; for LDL-C, we observe a similar point estimate, although this is not significant, most likely due to lower statistical power. These results suggest that gene-by-gene or gene-by-environment interactions modify the effect sizes of causal variants in human complex traits.


Assuntos
Estudo de Associação Genômica Ampla , Herança Multifatorial , LDL-Colesterol , Expressão Gênica , Humanos , Herança Multifatorial/genética , Polimorfismo de Nucleotídeo Único/genética , População Branca/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA