Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 144
Filtrar
1.
Nature ; 625(7996): 805-812, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-38093011

RESUMEN

CRISPR-enabled screening is a powerful tool for the discovery of genes that control T cell function and has nominated candidate targets for immunotherapies1-6. However, new approaches are required to probe specific nucleotide sequences within key genes. Systematic mutagenesis in primary human T cells could reveal alleles that tune specific phenotypes. DNA base editors are powerful tools for introducing targeted mutations with high efficiency7,8. Here we develop a large-scale base-editing mutagenesis platform with the goal of pinpointing nucleotides that encode amino acid residues that tune primary human T cell activation responses. We generated a library of around 117,000 single guide RNA molecules targeting base editors to protein-coding sites across 385 genes implicated in T cell function and systematically identified protein domains and specific amino acid residues that regulate T cell activation and cytokine production. We found a broad spectrum of alleles with variants encoding critical residues in proteins including PIK3CD, VAV1, LCP2, PLCG1 and DGKZ, including both gain-of-function and loss-of-function mutations. We validated the functional effects of many alleles and further demonstrated that base-editing hits could positively and negatively tune T cell cytotoxic function. Finally, higher-resolution screening using a base editor with relaxed protospacer-adjacent motif requirements9 (NG versus NGG) revealed specific structural domains and protein-protein interaction sites that can be targeted to tune T cell functions. Base-editing screens in primary immune cells thus provide biochemical insights with the potential to accelerate immunotherapy design.


Asunto(s)
Alelos , Edición Génica , Mutagénesis , Linfocitos T , Humanos , Aminoácidos/genética , Sistemas CRISPR-Cas/genética , Mutagénesis/genética , ARN Guía de Sistemas CRISPR-Cas/genética , Linfocitos T/inmunología , Linfocitos T/metabolismo , Activación de Linfocitos , Citocinas/biosíntesis , Citocinas/metabolismo , Mutación con Ganancia de Función , Mutación con Pérdida de Función
2.
bioRxiv ; 2024 Apr 10.
Artículo en Inglés | MEDLINE | ID: mdl-37292653

RESUMEN

Measures of selective constraint on genes have been used for many applications including clinical interpretation of rare coding variants, disease gene discovery, and studies of genome evolution. However, widely-used metrics are severely underpowered at detecting constraint for the shortest ~25% of genes, potentially causing important pathogenic mutations to be over-looked. We developed a framework combining a population genetics model with machine learning on gene features to enable accurate inference of an interpretable constraint metric, s het . Our estimates outperform existing metrics for prioritizing genes important for cell essentiality, human disease, and other phenotypes, especially for short genes. Our new estimates of selective constraint should have wide utility for characterizing genes relevant to human disease. Finally, our inference framework, GeneBayes, provides a flexible platform that can improve estimation of many gene-level properties, such as rare variant burden or gene expression differences.

3.
Nat Genet ; 55(11): 1866-1875, 2023 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-37857933

RESUMEN

Most signals in genome-wide association studies (GWAS) of complex traits implicate noncoding genetic variants with putative gene regulatory effects. However, currently identified regulatory variants, notably expression quantitative trait loci (eQTLs), explain only a small fraction of GWAS signals. Here, we show that GWAS and cis-eQTL hits are systematically different: eQTLs cluster strongly near transcription start sites, whereas GWAS hits do not. Genes near GWAS hits are enriched in key functional annotations, are under strong selective constraint and have complex regulatory landscapes across different tissue/cell types, whereas genes near eQTLs are depleted of most functional annotations, show relaxed constraint, and have simpler regulatory landscapes. We describe a model to understand these observations, including how natural selection on complex traits hinders discovery of functionally relevant eQTLs. Our results imply that GWAS and eQTL studies are systematically biased toward different types of variant, and support the use of complementary functional approaches alongside the next generation of eQTL studies.


Asunto(s)
Estudio de Asociación del Genoma Completo , Herencia Multifactorial , Regulación de la Expresión Génica/genética , Sitios de Carácter Cuantitativo/genética , Expresión Génica , Polimorfismo de Nucleótido Simple/genética
4.
Cell Genom ; 3(10): 100401, 2023 Oct 11.
Artículo en Inglés | MEDLINE | ID: mdl-37868038

RESUMEN

Each human genome has tens of thousands of rare genetic variants; however, identifying impactful rare variants remains a major challenge. We demonstrate how use of personal multi-omics can enable identification of impactful rare variants by using the Multi-Ethnic Study of Atherosclerosis, which included several hundred individuals, with whole-genome sequencing, transcriptomes, methylomes, and proteomes collected across two time points, 10 years apart. We evaluated each multi-omics phenotype's ability to separately and jointly inform functional rare variation. By combining expression and protein data, we observed rare stop variants 62 times and rare frameshift variants 216 times as frequently as controls, compared to 13-27 times as frequently for expression or protein effects alone. We extended a Bayesian hierarchical model, "Watershed," to prioritize specific rare variants underlying multi-omics signals across the regulatory cascade. With this approach, we identified rare variants that exhibited large effect sizes on multiple complex traits including height, schizophrenia, and Alzheimer's disease.

5.
bioRxiv ; 2023 Oct 24.
Artículo en Inglés | MEDLINE | ID: mdl-37745614

RESUMEN

The effects of genetic variation on complex traits act mainly through changes in gene regulation. Although many genetic variants have been linked to target genes in cis, the trans-regulatory cascade mediating their effects remains largely uncharacterized. Mapping trans-regulators based on natural genetic variation, including eQTL mapping, has been challenging due to small effects. Experimental perturbation approaches offer a complementary and powerful approach to mapping trans-regulators. We used CRISPR knockouts of 84 genes in primary CD4+ T cells to perturb an immune cell gene network, targeting both inborn error of immunity (IEI) disease transcription factors (TFs) and background TFs matched in constraint and expression level, but without a known immune disease association. We developed a novel Bayesian structure learning method called Linear Latent Causal Bayes (LLCB) to estimate the gene regulatory network from perturbation data and observed 211 directed edges among the genes which could not be detected in existing CD4+ trans-eQTL data. We used LLCB to characterize the differences between the IEI and background TFs, finding that the gene groups were highly interconnected, but that IEI TFs were much more likely to regulate immune cell specific pathways and immune GWAS genes. We further characterized nine coherent gene programs based on downstream effects of the TFs and linked these modules to regulation of GWAS genes, finding that canonical JAK-STAT family members are regulated by KMT2A, a global epigenetic regulator. These analyses reveal the trans-regulatory cascade from upstream epigenetic regulator to intermediate TFs to downstream effector cytokines and elucidate the logic linking immune GWAS genes to key signaling pathways.

6.
Genetics ; 225(3)2023 11 01.
Artículo en Inglés | MEDLINE | ID: mdl-37724741

RESUMEN

The discrete-time Wright-Fisher (DTWF) model and its diffusion limit are central to population genetics. These models can describe the forward-in-time evolution of allele frequencies in a population resulting from genetic drift, mutation, and selection. Computing likelihoods under the diffusion process is feasible, but the diffusion approximation breaks down for large samples or in the presence of strong selection. Existing methods for computing likelihoods under the DTWF model do not scale to current exome sequencing sample sizes in the hundreds of thousands. Here, we present a scalable algorithm that approximates the DTWF model with provably bounded error. Our approach relies on two key observations about the DTWF model. The first is that transition probabilities under the model are approximately sparse. The second is that transition distributions for similar starting allele frequencies are extremely close as distributions. Together, these observations enable approximate matrix-vector multiplication in linear (as opposed to the usual quadratic) time. We prove similar properties for Hypergeometric distributions, enabling fast computation of likelihoods for subsamples of the population. We show theoretically and in practice that this approximation is highly accurate and can scale to population sizes in the tens of millions, paving the way for rigorous biobank-scale inference. Finally, we use our results to estimate the impact of larger samples on estimating selection coefficients for loss-of-function variants. We find that increasing sample sizes beyond existing large exome sequencing cohorts will provide essentially no additional information except for genes with the most extreme fitness effects.


Asunto(s)
Bancos de Muestras Biológicas , Genética de Población , Frecuencia de los Genes , Flujo Genético , Probabilidad , Modelos Genéticos , Selección Genética
7.
Nat Ecol Evol ; 7(9): 1515-1524, 2023 09.
Artículo en Inglés | MEDLINE | ID: mdl-37592021

RESUMEN

The Iron Age was a dynamic period in central Mediterranean history, with the expansion of Greek and Phoenician colonies and the growth of Carthage into the dominant maritime power of the Mediterranean. These events were facilitated by the ease of long-distance travel following major advances in seafaring. We know from the archaeological record that trade goods and materials were moving across great distances in unprecedented quantities, but it is unclear how these patterns correlate with human mobility. Here, to investigate population mobility and interactions directly, we sequenced the genomes of 30 ancient individuals from coastal cities around the central Mediterranean, in Tunisia, Sardinia and central Italy. We observe a meaningful contribution of autochthonous populations, as well as highly heterogeneous ancestry including many individuals with non-local ancestries from other parts of the Mediterranean region. These results highlight both the role of local populations and the extreme interconnectedness of populations in the Iron Age Mediterranean. By studying these trans-Mediterranean neighbours together, we explore the complex interplay between local continuity and mobility that shaped the Iron Age societies of the central Mediterranean.


Asunto(s)
ADN Antiguo , Migración Humana , Región Mediterránea , Arqueología , Migración Humana/historia , Humanos , Análisis de Componente Principal , Genética Humana , ADN Antiguo/análisis , Análisis de Secuencia de ADN , Entierro , Antropología , Historia Antigua
8.
Science ; 381(6658): eade6289, 2023 08 11.
Artículo en Inglés | MEDLINE | ID: mdl-37561850

RESUMEN

Skin color, one of the most diverse human traits, is determined by the quantity, type, and distribution of melanin. In this study, we leveraged the light-scattering properties of melanin to conduct a genome-wide screen for regulators of melanogenesis. We identified 169 functionally diverse genes that converge on melanosome biogenesis, endosomal transport, and gene regulation, of which 135 represented previously unknown associations with pigmentation. In agreement with their melanin-promoting function, the majority of screen hits were up-regulated in melanocytes from darkly pigmented individuals. We further unraveled functions of KLF6 as a transcription factor that regulates melanosome maturation and pigmentation in vivo, and of the endosomal trafficking protein COMMD3 in modulating melanosomal pH. Our study reveals a plethora of melanin-promoting genes, with broad implications for human variation, cell biology, and medicine.


Asunto(s)
Proteínas Adaptadoras Transductoras de Señales , Factor 6 Similar a Kruppel , Melaninas , Melanocitos , Melanosomas , Pigmentación de la Piel , Humanos , Melaninas/biosíntesis , Melaninas/genética , Melanocitos/metabolismo , Melanosomas/metabolismo , Pigmentación de la Piel/genética , Estudio de Asociación del Genoma Completo , Proteínas Adaptadoras Transductoras de Señales/genética , Proteínas Adaptadoras Transductoras de Señales/metabolismo , Factor 6 Similar a Kruppel/genética , Factor 6 Similar a Kruppel/metabolismo , Endosomas/metabolismo , Animales , Ratones , Línea Celular Tumoral
9.
Nature ; 621(7977): 188-195, 2023 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-37648854

RESUMEN

γδ T cells are potent anticancer effectors with the potential to target tumours broadly, independent of patient-specific neoantigens or human leukocyte antigen background1-5. γδ T cells can sense conserved cell stress signals prevalent in transformed cells2,3, although the mechanisms behind the targeting of stressed target cells remain poorly characterized. Vγ9Vδ2 T cells-the most abundant subset of human γδ T cells4-recognize a protein complex containing butyrophilin 2A1 (BTN2A1) and BTN3A1 (refs. 6-8), a widely expressed cell surface protein that is activated by phosphoantigens abundantly produced by tumour cells. Here we combined genome-wide CRISPR screens in target cancer cells to identify pathways that regulate γδ T cell killing and BTN3A cell surface expression. The screens showed previously unappreciated multilayered regulation of BTN3A abundance on the cell surface and triggering of γδ T cells through transcription, post-translational modifications and membrane trafficking. In addition, diverse genetic perturbations and inhibitors disrupting metabolic pathways in the cancer cells, particularly ATP-producing processes, were found to alter BTN3A levels. This induction of both BTN3A and BTN2A1 during metabolic crises is dependent on AMP-activated protein kinase (AMPK). Finally, small-molecule activation of AMPK in a cell line model and in patient-derived tumour organoids led to increased expression of the BTN2A1-BTN3A complex and increased Vγ9Vδ2 T cell receptor-mediated killing. This AMPK-dependent mechanism of metabolic stress-induced ligand upregulation deepens our understanding of γδ T cell stress surveillance and suggests new avenues available to enhance γδ T cell anticancer activity.


Asunto(s)
Sistemas CRISPR-Cas , Edición Génica , Neoplasias , Receptores de Antígenos de Linfocitos T gamma-delta , Linfocitos T , Humanos , Proteínas Quinasas Activadas por AMP/genética , Proteínas Quinasas Activadas por AMP/metabolismo , Línea Celular , Membrana Celular/metabolismo , Neoplasias/genética , Neoplasias/inmunología , Neoplasias/metabolismo , Receptores de Antígenos de Linfocitos T gamma-delta/inmunología , Receptores de Antígenos de Linfocitos T gamma-delta/metabolismo , Linfocitos T/inmunología , Linfocitos T/metabolismo
10.
Res Sq ; 2023 Jun 13.
Artículo en Inglés | MEDLINE | ID: mdl-37398424

RESUMEN

Measures of selective constraint on genes have been used for many applications including clinical interpretation of rare coding variants, disease gene discovery, and studies of genome evolution. However, widely-used metrics are severely underpowered at detecting constraint for the shortest ~25% of genes, potentially causing important pathogenic mutations to be overlooked. We developed a framework combining a population genetics model with machine learning on gene features to enable accurate inference of an interpretable constraint metric, shet. Our estimates outperform existing metrics for prioritizing genes important for cell essentiality, human disease, and other phenotypes, especially for short genes. Our new estimates of selective constraint should have wide utility for characterizing genes relevant to human disease. Finally, our inference framework, GeneBayes, provides a flexible platform that can improve estimation of many gene-level properties, such as rare variant burden or gene expression differences.

11.
Genetics ; 224(3)2023 Jul 06.
Artículo en Inglés | MEDLINE | ID: mdl-37410594

RESUMEN

Members of genetically admixed populations possess ancestry from multiple source groups, and studies of human genetic admixture frequently estimate ancestry components corresponding to fractions of individual genomes that trace to specific ancestral populations. However, the same numerical ancestry fraction can represent a wide array of admixture scenarios within an individual's genealogy. Using a mechanistic model of admixture, we consider admixture genealogically: how many ancestors from the source populations does the admixture represent? We consider African-Americans, for whom continent-level estimates produce a 75-85% value for African ancestry on average and 15-25% for European ancestry. Genetic studies together with key features of African-American demographic history suggest ranges for parameters of a simple three-epoch model. Considering parameter sets compatible with estimates of current ancestry levels, we infer that if all genealogical lines of a random African-American born during 1960-1965 are traced back until they reach members of source populations, the mean over parameter sets of the expected number of genealogical lines terminating with African individuals is 314 (interquartile range 240-376), and the mean of the expected number terminating in Europeans is 51 (interquartile range 32-69). Across discrete generations, the peak number of African genealogical ancestors occurs in birth cohorts from the early 1700s, and the probability exceeds 50% that at least one European ancestor was born more recently than 1835. Our genealogical perspective can contribute to further understanding the admixture processes that underlie admixed populations. For African-Americans, the results provide insight both on how many of the ancestors of a typical African-American might have been forcibly displaced in the Transatlantic Slave Trade and on how many separate European admixture events might exist in a typical African-American genealogy.


Asunto(s)
Población Negra , Negro o Afroamericano , Humanos , Población Negra/genética , Negro o Afroamericano/genética , Genética de Población
12.
bioRxiv ; 2023 May 22.
Artículo en Inglés | MEDLINE | ID: mdl-37293115

RESUMEN

The Discrete-Time Wright Fisher (DTWF) model and its large population diffusion limit are central to population genetics. These models describe the forward-in-time evolution of the frequency of an allele in a population and can include the fundamental forces of genetic drift, mutation, and selection. Computing like-lihoods under the diffusion process is feasible, but the diffusion approximation breaks down for large sample sizes or in the presence of strong selection. Unfortunately, existing methods for computing likelihoods under the DTWF model do not scale to current exome sequencing sample sizes in the hundreds of thousands. Here we present an algorithm that approximates the DTWF model with provably bounded error and runs in time linear in the size of the population. Our approach relies on two key observations about Binomial distributions. The first is that Binomial distributions are approximately sparse. The second is that Binomial distributions with similar success probabilities are extremely close as distributions, allowing us to approximate the DTWF Markov transition matrix as a very low rank matrix. Together, these observations enable matrix-vector multiplication in linear (as opposed to the usual quadratic) time. We prove similar properties for Hypergeometric distributions, enabling fast computation of likelihoods for subsamples of the population. We show theoretically and in practice that this approximation is highly accurate and can scale to population sizes in the billions, paving the way for rigorous biobank-scale population genetic inference. Finally, we use our results to estimate how increasing sample sizes will improve the estimation of selection coefficients acting on loss-of-function variants. We find that increasing sample sizes beyond existing large exome sequencing cohorts will provide essentially no additional information except for genes with the most extreme fitness effects.

13.
Elife ; 122023 Jun 12.
Artículo en Inglés | MEDLINE | ID: mdl-37306301

RESUMEN

The formation of paralogs through gene duplication is a core evolutionary process. For paralogs that encode components of protein complexes such as the ribosome, a central question is whether they encode functionally distinct proteins or whether they exist to maintain appropriate total expression of equivalent proteins. Here, we systematically tested evolutionary models of paralog function using the ribosomal protein paralogs Rps27 (eS27) and Rps27l (eS27L) as a case study. Evolutionary analysis suggests that Rps27 and Rps27l likely arose during whole-genome duplication(s) in a common vertebrate ancestor. We show that Rps27 and Rps27l have inversely correlated mRNA abundance across mouse cell types, with the highest Rps27 in lymphocytes and the highest Rps27l in mammary alveolar cells and hepatocytes. By endogenously tagging the Rps27 and Rps27l proteins, we demonstrate that Rps27- and Rps27l-ribosomes associate preferentially with different transcripts. Furthermore, murine Rps27 and Rps27l loss-of-function alleles are homozygous lethal at different developmental stages. However, strikingly, expressing Rps27 protein from the endogenous Rps27l locus or vice versa completely rescues loss-of-function lethality and yields mice with no detectable deficits. Together, these findings suggest that Rps27 and Rps27l are evolutionarily retained because their subfunctionalized expression patterns render both genes necessary to achieve the requisite total expression of two equivalent proteins across cell types. Our work represents the most in-depth characterization of a mammalian ribosomal protein paralog to date and highlights the importance of considering both protein function and expression when investigating paralogs.


Asunto(s)
Proteínas Ribosómicas , Ribosomas , Animales , Ratones , Proteínas Ribosómicas/genética , Proteínas Ribosómicas/metabolismo , Ribosomas/metabolismo , Vertebrados/genética , Genoma , Mamíferos/genética
14.
Genome Res ; 33(5): 689-702, 2023 May.
Artículo en Inglés | MEDLINE | ID: mdl-37127331

RESUMEN

Short tandem repeats (STRs) are a class of rapidly mutating genetic elements typically characterized by repeated units of 1-6 bp. We leveraged whole-genome sequencing data for 152 recombinant inbred (RI) strains from the BXD family of mice to map loci that modulate genome-wide patterns of new mutations arising during parent-to-offspring transmission at STRs. We defined quantitative phenotypes describing the numbers and types of germline STR mutations in each strain and performed quantitative trait locus (QTL) analyses for each of these phenotypes. We identified a locus on Chromosome 13 at which strains inheriting the C57BL/6J (B) haplotype have a higher rate of STR expansions than those inheriting the DBA/2J (D) haplotype. The strongest candidate gene in this locus is Msh3, a known modifier of STR stability in cancer and at pathogenic repeat expansions in mice and humans, as well as a current drug target against Huntington's disease. The D haplotype at this locus harbors a cluster of variants near the 5' end of Msh3, including multiple missense variants near the DNA mismatch recognition domain. In contrast, the B haplotype contains a unique retrotransposon insertion. The rate of expansion covaries positively with Msh3 expression-with higher expression from the B haplotype. Finally, detailed analysis of mutation patterns showed that strains carrying the B allele have higher expansion rates, but slightly lower overall total mutation rates, compared with those with the D allele, particularly at tetranucleotide repeats. Our results suggest an important role for inherited variants in Msh3 in modulating genome-wide patterns of germline mutations at STRs.


Asunto(s)
Repeticiones de Microsatélite , Sitios de Carácter Cuantitativo , Animales , Ratones , Haplotipos , Ratones Endogámicos C57BL , Ratones Endogámicos DBA
15.
Nat Genet ; 55(5): 841-851, 2023 05.
Artículo en Inglés | MEDLINE | ID: mdl-37024583

RESUMEN

Transcriptional regulation exhibits extensive robustness, but human genetics indicates sensitivity to transcription factor (TF) dosage. Reconciling such observations requires quantitative studies of TF dosage effects at trait-relevant ranges, largely lacking so far. TFs play central roles in both normal-range and disease-associated variation in craniofacial morphology; we therefore developed an approach to precisely modulate TF levels in human facial progenitor cells and applied it to SOX9, a TF associated with craniofacial variation and disease (Pierre Robin sequence (PRS)). Most SOX9-dependent regulatory elements (REs) are buffered against small decreases in SOX9 dosage, but REs directly and primarily regulated by SOX9 show heightened sensitivity to SOX9 dosage; these RE responses partially predict gene expression responses. Sensitive REs and genes preferentially affect functional chondrogenesis and PRS-like craniofacial shape variation. We propose that such REs and genes underlie the sensitivity of specific phenotypes to TF dosage, while buffering of other genes leads to robust, nonlinear dosage-to-phenotype relationships.


Asunto(s)
Síndrome de Pierre Robin , Factor de Transcripción SOX9 , Humanos , Factor de Transcripción SOX9/genética , Síndrome de Pierre Robin/genética , Regulación de la Expresión Génica , Secuencias Reguladoras de Ácidos Nucleicos , Fenotipo
16.
bioRxiv ; 2023 Dec 04.
Artículo en Inglés | MEDLINE | ID: mdl-36778251

RESUMEN

With hundreds of copies of ribosomal DNA (rDNA) it is unknown whether they possess sequence variations that ultimately form different types of ribosomes. Here, we developed an algorithm for variant-calling between paralog genes (termed RGA) and compared rDNA variations with rRNA variations from long-read sequencing of translating ribosomes (RIBO-RT). Our analyses identified dozens of highly abundant rRNA variants, largely indels, that are incorporated into translationally active ribosomes and assemble into distinct ribosome subtypes encoded on different chromosomes. We developed an in-situ rRNA sequencing method (SWITCH-seq) revealing that variants are co-expressed within individual cells and found that they possess different structures. Lastly, we observed tissue-specific rRNA-subtype expression and linked specific rRNA variants to cancer. This study therefore reveals the variation landscape of translating ribosomes within human cells.

17.
Elife ; 112022 09 08.
Artículo en Inglés | MEDLINE | ID: mdl-36073519

RESUMEN

Pleiotropy and genetic correlation are widespread features in genome-wide association studies (GWAS), but they are often difficult to interpret at the molecular level. Here, we perform GWAS of 16 metabolites clustered at the intersection of amino acid catabolism, glycolysis, and ketone body metabolism in a subset of UK Biobank. We utilize the well-documented biochemistry jointly impacting these metabolites to analyze pleiotropic effects in the context of their pathways. Among the 213 lead GWAS hits, we find a strong enrichment for genes encoding pathway-relevant enzymes and transporters. We demonstrate that the effect directions of variants acting on biology between metabolite pairs often contrast with those of upstream or downstream variants as well as the polygenic background. Thus, we find that these outlier variants often reflect biology local to the traits. Finally, we explore the implications for interpreting disease GWAS, underscoring the potential of unifying biochemistry with dense metabolomics data to understand the molecular basis of pleiotropy in complex traits and diseases.


Asunto(s)
Pleiotropía Genética , Estudio de Asociación del Genoma Completo , Aminoácidos/genética , Cetonas , Fenotipo , Polimorfismo de Nucleótido Simple
18.
Nature ; 608(7923): 569-577, 2022 08.
Artículo en Inglés | MEDLINE | ID: mdl-35922514

RESUMEN

A major challenge in human genetics is to identify the molecular mechanisms of trait-associated and disease-associated variants. To achieve this, quantitative trait locus (QTL) mapping of genetic variants with intermediate molecular phenotypes such as gene expression and splicing have been widely adopted1,2. However, despite successes, the molecular basis for a considerable fraction of trait-associated and disease-associated variants remains unclear3,4. Here we show that ADAR-mediated adenosine-to-inosine RNA editing, a post-transcriptional event vital for suppressing cellular double-stranded RNA (dsRNA)-mediated innate immune interferon responses5-11, is an important potential mechanism underlying genetic variants associated with common inflammatory diseases. We identified and characterized 30,319 cis-RNA editing QTLs (edQTLs) across 49 human tissues. These edQTLs were significantly enriched in genome-wide association study signals for autoimmune and immune-mediated diseases. Colocalization analysis of edQTLs with disease risk loci further pinpointed key, putatively immunogenic dsRNAs formed by expected inverted repeat Alu elements as well as unexpected, highly over-represented cis-natural antisense transcripts. Furthermore, inflammatory disease risk variants, in aggregate, were associated with reduced editing of nearby dsRNAs and induced interferon responses in inflammatory diseases. This unique directional effect agrees with the established mechanism that lack of RNA editing by ADAR1 leads to the specific activation of the dsRNA sensor MDA5 and subsequent interferon responses and inflammation7-9. Our findings implicate cellular dsRNA editing and sensing as a previously underappreciated mechanism of common inflammatory diseases.


Asunto(s)
Adenosina Desaminasa , Predisposición Genética a la Enfermedad , Enfermedades del Sistema Inmune , Inflamación , Edición de ARN , ARN Bicatenario , Adenosina/metabolismo , Adenosina Desaminasa/genética , Adenosina Desaminasa/metabolismo , Elementos Alu/genética , Enfermedades Autoinmunes/genética , Enfermedades Autoinmunes/inmunología , Enfermedades Autoinmunes/patología , Estudio de Asociación del Genoma Completo , Humanos , Enfermedades del Sistema Inmune/genética , Enfermedades del Sistema Inmune/inmunología , Enfermedades del Sistema Inmune/patología , Inmunidad Innata , Inflamación/genética , Inflamación/inmunología , Inflamación/patología , Inosina/metabolismo , Helicasa Inducida por Interferón IFIH1/metabolismo , Interferones/genética , Interferones/inmunología , Sitios de Carácter Cuantitativo/genética , Edición de ARN/genética , ARN Bicatenario/genética , Proteínas de Unión al ARN/metabolismo
19.
Nat Genet ; 54(8): 1133-1144, 2022 08.
Artículo en Inglés | MEDLINE | ID: mdl-35817986

RESUMEN

Gene regulatory networks ensure that important genes are expressed at precise levels. When gene expression is sufficiently perturbed, it can lead to disease. To understand how gene expression disruptions percolate through a network, we must first map connections between regulatory genes and their downstream targets. However, we lack comprehensive knowledge of the upstream regulators of most genes. Here, we developed an approach for systematic discovery of upstream regulators of critical immune factors-IL2RA, IL-2 and CTLA4-in primary human T cells. Then, we mapped the network of the target genes of these regulators and putative cis-regulatory elements using CRISPR perturbations, RNA-seq and ATAC-seq. These regulators form densely interconnected networks with extensive feedback loops. Furthermore, this network is enriched for immune-associated disease variants and genes. These results provide insight into how immune-associated disease genes are regulated in T cells and broader principles about the structure of human gene regulatory networks.


Asunto(s)
Redes Reguladoras de Genes , Genes Reguladores , Linfocitos T , Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas , Redes Reguladoras de Genes/genética , Humanos , Linfocitos T/inmunología
20.
Am J Hum Genet ; 109(7): 1286-1297, 2022 07 07.
Artículo en Inglés | MEDLINE | ID: mdl-35716666

RESUMEN

Despite the growing number of genome-wide association studies (GWASs), it remains unclear to what extent gene-by-gene and gene-by-environment interactions influence complex traits in humans. The magnitude of genetic interactions in complex traits has been difficult to quantify because GWASs are generally underpowered to detect individual interactions of small effect. Here, we develop a method to test for genetic interactions that aggregates information across all trait-associated loci. Specifically, we test whether SNPs in regions of European ancestry shared between European American and admixed African American individuals have the same causal effect sizes. We hypothesize that in African Americans, the presence of genetic interactions will drive the causal effect sizes of SNPs in regions of European ancestry to be more similar to those of SNPs in regions of African ancestry. We apply our method to two traits: gene expression in 296 African Americans and 482 European Americans in the Multi-Ethnic Study of Atherosclerosis (MESA) and low-density lipoprotein cholesterol (LDL-C) in 74K African Americans and 296K European Americans in the Million Veteran Program (MVP). We find significant evidence for genetic interactions in our analysis of gene expression; for LDL-C, we observe a similar point estimate, although this is not significant, most likely due to lower statistical power. These results suggest that gene-by-gene or gene-by-environment interactions modify the effect sizes of causal variants in human complex traits.


Asunto(s)
Estudio de Asociación del Genoma Completo , Herencia Multifactorial , LDL-Colesterol , Expresión Génica , Humanos , Herencia Multifactorial/genética , Polimorfismo de Nucleótido Simple/genética , Población Blanca/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...