Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 25
Filtrar
1.
Nature ; 622(7982): 339-347, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-37794183

RESUMO

Integrating human genomics and proteomics can help elucidate disease mechanisms, identify clinical biomarkers and discover drug targets1-4. Because previous proteogenomic studies have focused on common variation via genome-wide association studies, the contribution of rare variants to the plasma proteome remains largely unknown. Here we identify associations between rare protein-coding variants and 2,923 plasma protein abundances measured in 49,736 UK Biobank individuals. Our variant-level exome-wide association study identified 5,433 rare genotype-protein associations, of which 81% were undetected in a previous genome-wide association study of the same cohort5. We then looked at aggregate signals using gene-level collapsing analysis, which revealed 1,962 gene-protein associations. Of the 691 gene-level signals from protein-truncating variants, 99.4% were associated with decreased protein levels. STAB1 and STAB2, encoding scavenger receptors involved in plasma protein clearance, emerged as pleiotropic loci, with 77 and 41 protein associations, respectively. We demonstrate the utility of our publicly accessible resource through several applications. These include detailing an allelic series in NLRC4, identifying potential biomarkers for a fatty liver disease-associated variant in HSD17B13 and bolstering phenome-wide association studies by integrating protein quantitative trait loci with protein-truncating variants in collapsing analyses. Finally, we uncover distinct proteomic consequences of clonal haematopoiesis (CH), including an association between TET2-CH and increased FLT3 levels. Our results highlight a considerable role for rare variation in plasma protein abundance and the value of proteogenomics in therapeutic discovery.


Assuntos
Bancos de Espécimes Biológicos , Proteínas Sanguíneas , Estudos de Associação Genética , Genômica , Proteômica , Humanos , Alelos , Biomarcadores/sangue , Proteínas Sanguíneas/análise , Proteínas Sanguíneas/genética , Bases de Dados Factuais , Exoma/genética , Hematopoese , Mutação , Plasma/química , Reino Unido
2.
Nature ; 597(7877): 527-532, 2021 09.
Artigo em Inglês | MEDLINE | ID: mdl-34375979

RESUMO

Genome-wide association studies have uncovered thousands of common variants associated with human disease, but the contribution of rare variants to common disease remains relatively unexplored. The UK Biobank contains detailed phenotypic data linked to medical records for approximately 500,000 participants, offering an unprecedented opportunity to evaluate the effect of rare variation on a broad collection of traits1,2. Here we study the relationships between rare protein-coding variants and 17,361 binary and 1,419 quantitative phenotypes using exome sequencing data from 269,171 UK Biobank participants of European ancestry. Gene-based collapsing analyses revealed 1,703 statistically significant gene-phenotype associations for binary traits, with a median odds ratio of 12.4. Furthermore, 83% of these associations were undetectable via single-variant association tests, emphasizing the power of gene-based collapsing analysis in the setting of high allelic heterogeneity. Gene-phenotype associations were also significantly enriched for loss-of-function-mediated traits and approved drug targets. Finally, we performed ancestry-specific and pan-ancestry collapsing analyses using exome sequencing data from 11,933 UK Biobank participants of African, East Asian or South Asian ancestry. Our results highlight a significant contribution of rare variants to common disease. Summary statistics are publicly available through an interactive portal ( http://azphewas.com/ ).


Assuntos
Bancos de Espécimes Biológicos , Bases de Dados Genéticas , Doença/genética , Exoma/genética , Variação Genética/genética , Adulto , Idoso , Feminino , Estudo de Associação Genômica Ampla , Humanos , Masculino , Pessoa de Meia-Idade , Fenótipo , Proteínas/química , Proteínas/genética , Reino Unido , Sequenciamento do Exoma
3.
Am J Hum Genet ; 110(3): 487-498, 2023 03 02.
Artigo em Inglês | MEDLINE | ID: mdl-36809768

RESUMO

Genome-wide association studies (GWASs) have established the contribution of common and low-frequency variants to metabolic blood measurements in the UK Biobank (UKB). To complement existing GWAS findings, we assessed the contribution of rare protein-coding variants in relation to 355 metabolic blood measurements-including 325 predominantly lipid-related nuclear magnetic resonance (NMR)-derived blood metabolite measurements (Nightingale Health Plc) and 30 clinical blood biomarkers-using 412,393 exome sequences from four genetically diverse ancestries in the UKB. Gene-level collapsing analyses were conducted to evaluate a diverse range of rare-variant architectures for the metabolic blood measurements. Altogether, we identified significant associations (p < 1 × 10-8) for 205 distinct genes that involved 1,968 significant relationships for the Nightingale blood metabolite measurements and 331 for the clinical blood biomarkers. These include associations for rare non-synonymous variants in PLIN1 and CREB3L3 with lipid metabolite measurements and SYT7 with creatinine, among others, which may not only provide insights into novel biology but also deepen our understanding of established disease mechanisms. Of the study-wide significant clinical biomarker associations, 40% were not previously detected on analyzing coding variants in a GWAS in the same cohort, reinforcing the importance of studying rare variation to fully understand the genetic architecture of metabolic blood measurements.


Assuntos
Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Bancos de Espécimes Biológicos , Biomarcadores , Lipídeos , Reino Unido , Polimorfismo de Nucleotídeo Único
4.
Am J Hum Genet ; 109(12): 2105-2109, 2022 12 01.
Artigo em Inglês | MEDLINE | ID: mdl-36459978

RESUMO

Synonymous mutations change the DNA sequence of a gene without affecting the amino acid sequence of the encoded protein. Although some synonymous mutations can affect RNA splicing, translational efficiency, and mRNA stability, studies in human genetics, mutagenesis screens, and other experiments and evolutionary analyses have repeatedly shown that most synonymous variants are neutral or only weakly deleterious, with some notable exceptions. Based on a recent study in yeast, there have been claims that synonymous mutations could be as important as nonsynonymous mutations in causing disease, assuming the yeast findings hold up and translate to humans. Here, we argue that there is insufficient evidence to overturn the large, coherent body of knowledge establishing the predominant neutrality of synonymous variants in the human genome.


Assuntos
Evolução Biológica , Saccharomyces cerevisiae , Humanos , Mutação/genética , Sequência de Aminoácidos , Genoma Humano/genética
5.
Nucleic Acids Res ; 50(8): 4289-4301, 2022 05 06.
Artigo em Inglês | MEDLINE | ID: mdl-35474393

RESUMO

Large-scale phenome-wide association studies performed using densely-phenotyped cohorts such as the UK Biobank (UKB), reveal many statistically robust gene-phenotype relationships for both clinical and continuous traits. Here, we present Gene-SCOUT, a tool used to identify genes with similar continuous trait fingerprints to a gene of interest. A fingerprint reflects the continuous traits identified to be statistically associated with a gene of interest based on multiple underlying rare variant genetic architectures. Similarities between genes are evaluated by the cosine similarity measure, to capture concordant effect directionality, elucidating clusters of genes in a high dimensional space. The underlying gene-biomarker population-scale association statistics were obtained from a gene-level rare variant collapsing analysis performed on over 1500 continuous traits using 394 692 UKB participant exomes, with additional metabolomic trait associations provided through Nightingale Health's recent study of 121 394 of these participants. We demonstrate that gene similarity estimates from Gene-SCOUT provide stronger enrichments for clinical traits compared to existing methods. Furthermore, we provide a fully interactive web-resource (http://genescout.public.cgr.astrazeneca.com) to explore the pre-calculated exome-wide similarities. This resource enables a user to examine the biological relevance of the most similar genes for Gene Ontology (GO) enrichment and UKB clinical trait enrichment statistics, as well as a detailed breakdown of the traits underpinning a given fingerprint.


Assuntos
Estudo de Associação Genômica Ampla , Fenômica , Humanos , Estudo de Associação Genômica Ampla/métodos , Fenótipo , Sequenciamento do Exoma , Exoma , Polimorfismo de Nucleotídeo Único
6.
Am J Hum Genet ; 106(5): 659-678, 2020 05 07.
Artigo em Inglês | MEDLINE | ID: mdl-32386536

RESUMO

Access to large-scale genomics datasets has increased the utility of hypothesis-free genome-wide analyses. However, gene signals are often insufficiently powered to reach experiment-wide significance, triggering a process of laborious triaging of genomic-association-study results. We introduce mantis-ml, a multi-dimensional, multi-step machine-learning framework that allows objective assessment of the biological relevance of genes to disease studies. Mantis-ml is an automated machine-learning framework that follows a multi-model approach of stochastic semi-supervised learning to rank disease-associated genes through iterative learning sessions on random balanced datasets across the protein-coding exome. When applied to a range of human diseases, including chronic kidney disease (CKD), epilepsy, and amyotrophic lateral sclerosis (ALS), mantis-ml achieved an average area under curve (AUC) prediction performance of 0.81-0.89. Critically, to prove its value as a tool that can be used to interpret exome-wide association studies, we overlapped mantis-ml predictions with data from published cohort-level association studies. We found a statistically significant enrichment of high mantis-ml predictions among the highest-ranked genes from hypothesis-free cohort-level statistics, indicating a substantial improvement over the performance of current state-of-the-art methods and pointing to the capture of true prioritization signals for disease-associated genes. Finally, we introduce a generic mantis-ml score (GMS) trained with over 1,200 features as a generic-disease-likelihood estimator, outperforming published gene-level scores. In addition to our tool, we provide a gene prioritization atlas that includes mantis-ml's predictions across ten disease areas and empowers researchers to interactively navigate through the gene-triaging framework. Mantis-ml is an intuitive tool that supports the objective triaging of large-scale genomic discovery studies and enhances our understanding of complex genotype-phenotype associations.


Assuntos
Esclerose Lateral Amiotrófica/genética , Epilepsia/genética , Genômica/métodos , Insuficiência Renal Crônica/genética , Aprendizado de Máquina Supervisionado , Animais , Área Sob a Curva , Aprendizado Profundo , Modelos Animais de Doenças , Exoma/genética , Estudos de Associação Genética , Humanos , Camundongos , Redes Neurais de Computação , Curva ROC , Reprodutibilidade dos Testes , Processos Estocásticos
7.
Nature ; 548(7667): 347-351, 2017 08 17.
Artigo em Inglês | MEDLINE | ID: mdl-28792939

RESUMO

A fundamental principle in biology is that the program for early development is established during oogenesis in the form of the maternal transcriptome. How the maternal transcriptome acquires the appropriate content and dosage of transcripts is not fully understood. Here we show that 3' terminal uridylation of mRNA mediated by TUT4 and TUT7 sculpts the mouse maternal transcriptome by eliminating transcripts during oocyte growth. Uridylation mediated by TUT4 and TUT7 is essential for both oocyte maturation and fertility. In comparison to somatic cells, the oocyte transcriptome has a shorter poly(A) tail and a higher relative proportion of terminal oligo-uridylation. Deletion of TUT4 and TUT7 leads to the accumulation of a cohort of transcripts with a high frequency of very short poly(A) tails, and a loss of 3' oligo-uridylation. By contrast, deficiency of TUT4 and TUT7 does not alter gene expression in a variety of somatic cells. In summary, we show that poly(A) tail length and 3' terminal uridylation have essential and specific functions in shaping a functional maternal transcriptome.


Assuntos
Herança Materna/genética , Oócitos/metabolismo , Poli A/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Transcriptoma , Uridina Monofosfato/metabolismo , Animais , Linhagem Celular , Proteínas de Ligação a DNA/deficiência , Proteínas de Ligação a DNA/genética , Feminino , Infertilidade Feminina/genética , Masculino , Camundongos , Camundongos Knockout , Mães , Nucleotidiltransferases/deficiência , Nucleotidiltransferases/genética , Oócitos/crescimento & desenvolvimento , Especificidade de Órgãos , Poli A/química , Estabilidade de RNA
8.
J Am Soc Nephrol ; 30(6): 1109-1122, 2019 06.
Artigo em Inglês | MEDLINE | ID: mdl-31085678

RESUMO

BACKGROUND: Studies have identified many common genetic associations that influence renal function and all-cause CKD, but these explain only a small fraction of variance in these traits. The contribution of rare variants has not been systematically examined. METHODS: We performed exome sequencing of 3150 individuals, who collectively encompassed diverse CKD subtypes, and 9563 controls. To detect causal genes and evaluate the contribution of rare variants we used collapsing analysis, in which we compared the proportion of cases and controls carrying rare variants per gene. RESULTS: The analyses captured five established monogenic causes of CKD: variants in PKD1, PKD2, and COL4A5 achieved study-wide significance, and we observed suggestive case enrichment for COL4A4 and COL4A3. Beyond known disease-associated genes, collapsing analyses incorporating regional variant intolerance identified suggestive dominant signals in CPT2 and several other candidate genes. Biallelic mutations in CPT2 cause carnitine palmitoyltransferase II deficiency, sometimes associated with rhabdomyolysis and acute renal injury. Genetic modifier analysis among cases with APOL1 risk genotypes identified a suggestive signal in AHDC1, implicated in Xia-Gibbs syndrome, which involves intellectual disability and other features. On the basis of the observed distribution of rare variants, we estimate that a two- to three-fold larger cohort would provide 80% power to implicate new genes for all-cause CKD. CONCLUSIONS: This study demonstrates that rare-variant collapsing analyses can validate known genes and identify candidate genes and modifiers for kidney disease. In so doing, these findings provide a motivation for larger-scale investigation of rare-variant risk contributions across major clinical CKD categories.


Assuntos
Colágeno Tipo IV/genética , Sequenciamento do Exoma , Variação Genética/genética , Proteínas Quinases/genética , Insuficiência Renal Crônica/genética , Canais de Cátion TRPP/genética , Estudos de Casos e Controles , Feminino , Humanos , Masculino , Prognóstico , Proteína Quinase D2 , Valores de Referência , Insuficiência Renal Crônica/diagnóstico
9.
Nucleic Acids Res ; 45(3): 1079-1090, 2017 02 17.
Artigo em Inglês | MEDLINE | ID: mdl-28180281

RESUMO

MicroRNAs are important genetic regulators in both animals and plants. They have a range of functions spanning development, differentiation, growth, metabolism and disease. The advent of next-generation sequencing technologies has made it a relatively straightforward task to detect these molecules and their relative expression via sequencing. There are a large number of published studies with deposited datasets. However, there are currently few resources that capitalize on these data to better understand the features, distribution and biogenesis of miRNAs. Herein, we focus on Human and Mouse for which the majority of data are available. We reanalyse sequencing data from 461 samples into a coordinated catalog of microRNA expression. We use this to perform large-scale analyses of miRNA function and biogenesis. These analyses include global expression comparison, co-expression of miRNA clusters and the prediction of miRNA strand-specificity and underlying constraints. Additionally, we report for the first time a global analysis of miRNA epi-transcriptomic modifications and assess their prevalence across tissues, samples and families. Finally, we report a list of potentially mis-annotated miRNAs in miRBase based on their aggregated modification profiles. The results have been collated into a comprehensive online repository of miRNA expression and features such as modifications and RNA editing events, which is available at: http://wwwdev.ebi.ac.uk/enright-dev/miratlas. We believe these findings will further contribute to our understanding of miRNA function in animals and benefit the miRNA community in general.


Assuntos
MicroRNAs/genética , MicroRNAs/metabolismo , Animais , Bases de Dados de Ácidos Nucleicos , Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Camundongos , Anotação de Sequência Molecular , Família Multigênica , Processamento Pós-Transcricional do RNA , Análise de Sequência de RNA , Transcriptoma
10.
Nucleic Acids Res ; 45(21): e177, 2017 Dec 01.
Artigo em Inglês | MEDLINE | ID: mdl-29036314

RESUMO

The discovery of microRNAs (miRNAs) remains an important problem, particularly given the growth of high-throughput sequencing, cell sorting and single cell biology. While a large number of miRNAs have already been annotated, there may well be large numbers of miRNAs that are expressed in very particular cell types and remain elusive. Sequencing allows us to quickly and accurately identify the expression of known miRNAs from small RNA-Seq data. The biogenesis of miRNAs leads to very specific characteristics observed in their sequences. In brief, miRNAs usually have a well-defined 5' end and a more flexible 3' end with the possibility of 3' tailing events, such as uridylation. Previous approaches to the prediction of novel miRNAs usually involve the analysis of structural features of miRNA precursor hairpin sequences obtained from genome sequence. We surmised that it may be possible to identify miRNAs by using these biogenesis features observed directly from sequenced reads, solely or in addition to structural analysis from genome data. To this end, we have developed mirnovo, a machine learning based algorithm, which is able to identify known and novel miRNAs in animals and plants directly from small RNA-Seq data, with or without a reference genome. This method performs comparably to existing tools, however is simpler to use with reduced run time. Its performance and accuracy has been tested on multiple datasets, including species with poorly assembled genomes, RNaseIII (Drosha and/or Dicer) deficient samples and single cells (at both embryonic and adult stage).


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Aprendizado de Máquina , MicroRNAs/química , Análise de Sequência de RNA/métodos , Software , Algoritmos , Animais , Perfilação da Expressão Gênica , Genômica , Humanos , Camundongos , MicroRNAs/metabolismo , RNA de Plantas/química , Pequeno RNA não Traduzido/química , Ribonuclease III/genética , Análise de Célula Única
11.
Bioinformatics ; 33(9): 1418-1420, 2017 05 01.
Artigo em Inglês | MEDLINE | ID: mdl-28453679

RESUMO

Summary: BioPAXViz is a Cytoscape (version 3) application, providing a comprehensive framework for metabolic pathway visualization. Beyond the basic parsing, viewing and browsing roles, the main novel function that BioPAXViz provides is a visual comparative analysis of metabolic pathway topologies across pre-computed pathway phylogenomic profiles given a species phylogeny. Furthermore, BioPAXViz supports the display of hierarchical trees that allow efficient navigation through sets of variants of a single reference pathway. Thus, BioPAXViz can significantly facilitate, and contribute to, the study of metabolic pathway evolution and engineering. Availability and Implementation: BioPAXViz has been developed as a Cytoscape app and is available at: https://github.com/CGU-CERTH/BioPAX.Viz. The software is distributed under the MIT License and is accompanied by example files and data. Additional documentation is available at the aforementioned GitHub repository. Contact: ouzounis@certh.gr.


Assuntos
Biologia Computacional/métodos , Evolução Molecular , Redes e Vias Metabólicas/genética , Software , Filogenia
12.
Bioinformatics ; 31(20): 3365-7, 2015 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-26093149

RESUMO

UNLABELLED: Chimira is a web-based system for microRNA (miRNA) analysis from small RNA-Seq data. Sequences are automatically cleaned, trimmed, size selected and mapped directly to miRNA hairpin sequences. This generates count-based miRNA expression data for subsequent statistical analysis. Moreover, it is capable of identifying epi-transcriptomic modifications in the input sequences. Supported modification types include multiple types of 3'-modifications (e.g. uridylation, adenylation), 5'-modifications and also internal modifications or variation (ADAR editing or single nucleotide polymorphisms). Besides cleaning and mapping of input sequences to miRNAs, Chimira provides a simple and intuitive set of tools for the analysis and interpretation of the results (see also Supplementary Material). These allow the visual study of the differential expression between two specific samples or sets of samples, the identification of the most highly expressed miRNAs within sample pairs (or sets of samples) and also the projection of the modification profile for specific miRNAs across all samples. Other tools have already been published in the past for various types of small RNA-Seq analysis, such as UEA workbench, seqBuster, MAGI, OASIS and CAP-miRSeq, CPSS for modifications identification. A comprehensive comparison of Chimira with each of these tools is provided in the Supplementary Material. Chimira outperforms all of these tools in total execution speed and aims to facilitate simple, fast and reliable analysis of small RNA-Seq data allowing also, for the first time, identification of global microRNA modification profiles in a simple intuitive interface. AVAILABILITY AND IMPLEMENTATION: Chimira has been developed as a web application and it is accessible here: http://www.ebi.ac.uk/research/enright/software/chimira. CONTACT: aje@ebi.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
MicroRNAs/química , MicroRNAs/metabolismo , Análise de Sequência de RNA/métodos , Software , Humanos , Pequeno RNA não Traduzido/química
13.
Sci Adv ; 10(19): eadj1424, 2024 May 10.
Artigo em Inglês | MEDLINE | ID: mdl-38718126

RESUMO

The ongoing expansion of human genomic datasets propels therapeutic target identification; however, extracting gene-disease associations from gene annotations remains challenging. Here, we introduce Mantis-ML 2.0, a framework integrating AstraZeneca's Biological Insights Knowledge Graph and numerous tabular datasets, to assess gene-disease probabilities throughout the phenome. We use graph neural networks, capturing the graph's holistic structure, and train them on hundreds of balanced datasets via a robust semi-supervised learning framework to provide gene-disease probabilities across the human exome. Mantis-ML 2.0 incorporates natural language processing to automate disease-relevant feature selection for thousands of diseases. The enhanced models demonstrate a 6.9% average classification power boost, achieving a median receiver operating characteristic (ROC) area under curve (AUC) score of 0.90 across 5220 diseases from Human Phenotype Ontology, OpenTargets, and Genomics England. Notably, Mantis-ML 2.0 prioritizes associations from an independent UK Biobank phenome-wide association study (PheWAS), providing a stronger form of triaging and mitigating against underpowered PheWAS associations. Results are exposed through an interactive web resource.


Assuntos
Bancos de Espécimes Biológicos , Redes Neurais de Computação , Humanos , Estudo de Associação Genômica Ampla/métodos , Fenótipo , Reino Unido , Fenômica/métodos , Predisposição Genética para Doença , Genômica/métodos , Bases de Dados Genéticas , Algoritmos , Biologia Computacional/métodos , Biobanco do Reino Unido
14.
Commun Biol ; 5(1): 1291, 2022 11 24.
Artigo em Inglês | MEDLINE | ID: mdl-36434048

RESUMO

The druggability of targets is a crucial consideration in drug target selection. Here, we adopt a stochastic semi-supervised ML framework to develop DrugnomeAI, which estimates the druggability likelihood for every protein-coding gene in the human exome. DrugnomeAI integrates gene-level properties from 15 sources resulting in 324 features. The tool generates exome-wide predictions based on labelled sets of known drug targets (median AUC: 0.97), highlighting features from protein-protein interaction networks as top predictors. DrugnomeAI provides generic as well as specialised models stratified by disease type or drug therapeutic modality. The top-ranking DrugnomeAI genes were significantly enriched for genes previously selected for clinical development programs (p value < 1 × 10-308) and for genes achieving genome-wide significance in phenome-wide association studies of 450 K UK Biobank exomes for binary (p value = 1.7 × 10-5) and quantitative traits (p value = 1.6 × 10-7). We accompany our method with a web application ( http://drugnomeai.public.cgr.astrazeneca.com ) to visualise the druggability predictions and the key features that define gene druggability, per disease type and modality.


Assuntos
Aprendizado de Máquina , Software , Humanos , Sistemas de Liberação de Medicamentos
15.
Sci Adv ; 8(34): eabo6371, 2022 08 26.
Artigo em Inglês | MEDLINE | ID: mdl-36026442

RESUMO

Large reference datasets of protein-coding variation in human populations have allowed us to determine which genes and genic subregions are intolerant to germline genetic variation. There is also a growing number of genes implicated in severe Mendelian diseases that overlap with genes implicated in cancer. We hypothesized that cancer-driving mutations might be enriched in genic subregions that are depleted of germline variation relative to somatic variation. We introduce a new metric, OncMTR (oncology missense tolerance ratio), which uses 125,748 exomes in the Genome Aggregation Database (gnomAD) to identify these genic subregions. We demonstrate that OncMTR can significantly predict driver mutations implicated in hematologic malignancies. Divergent OncMTR regions were enriched for cancer-relevant protein domains, and overlaying OncMTR scores on protein structures identified functionally important protein residues. Last, we performed a rare variant, gene-based collapsing analysis on an independent set of 394,694 exomes from the UK Biobank and find that OncMTR markedly improves genetic signals for hematologic malignancies.


Assuntos
Mutação em Linhagem Germinativa , Neoplasias Hematológicas , Células Germinativas , Neoplasias Hematológicas/genética , Humanos
16.
Sci Adv ; 8(46): eadd5430, 2022 11 18.
Artigo em Inglês | MEDLINE | ID: mdl-36383675

RESUMO

We performed collapsing analyses on 454,796 UK Biobank (UKB) exomes to detect gene-level associations with diabetes. Recessive carriers of nonsynonymous variants in MAP3K15 were 30% less likely to develop diabetes (P = 5.7 × 10-10) and had lower glycosylated hemoglobin (ß = -0.14 SD units, P = 1.1 × 10-24). These associations were independent of body mass index, suggesting protection against insulin resistance even in the setting of obesity. We replicated these findings in 96,811 Admixed Americans in the Mexico City Prospective Study (P < 0.05)Moreover, the protective effect of MAP3K15 variants was stronger in individuals who did not carry the Latino-enriched SLC16A11 risk haplotype (P = 6.0 × 10-4). Separately, we identified a Finnish-enriched MAP3K15 protein-truncating variant associated with decreased odds of both type 1 and type 2 diabetes (P < 0.05) in FinnGen. No adverse phenotypes were associated with protein-truncating MAP3K15 variants in the UKB, supporting this gene as a therapeutic target for diabetes.


Assuntos
Diabetes Mellitus Tipo 2 , MAP Quinase Quinase Quinases , Humanos , Diabetes Mellitus Tipo 2/genética , Predisposição Genética para Doença , Transportadores de Ácidos Monocarboxílicos/genética , Obesidade/genética , Estudos Prospectivos , MAP Quinase Quinase Quinases/genética
17.
Nat Commun ; 12(1): 1504, 2021 03 08.
Artigo em Inglês | MEDLINE | ID: mdl-33686085

RESUMO

Elucidating functionality in non-coding regions is a key challenge in human genomics. It has been shown that intolerance to variation of coding and proximal non-coding sequence is a strong predictor of human disease relevance. Here, we integrate intolerance to variation, functional genomic annotations and primary genomic sequence to build JARVIS: a comprehensive deep learning model to prioritize non-coding regions, outperforming other human lineage-specific scores. Despite being agnostic to evolutionary conservation, JARVIS performs comparably or outperforms conservation-based scores in classifying pathogenic single-nucleotide and structural variants. In constructing JARVIS, we introduce the genome-wide residual variation intolerance score (gwRVIS), applying a sliding-window approach to whole genome sequencing data from 62,784 individuals. gwRVIS distinguishes Mendelian disease genes from more tolerant CCDS regions and highlights ultra-conserved non-coding elements as the most intolerant regions in the human genome. Both JARVIS and gwRVIS capture previously inaccessible human-lineage constraint information and will enhance our understanding of the non-coding genome.


Assuntos
Aprendizado Profundo , Genoma Humano , Genômica , DNA Intergênico , Variação Genética , Humanos , Análise de Sequência de DNA , Sequenciamento Completo do Genoma
18.
Commun Biol ; 4(1): 392, 2021 03 23.
Artigo em Inglês | MEDLINE | ID: mdl-33758299

RESUMO

Idiopathic pulmonary fibrosis (IPF) is a fatal disorder characterised by progressive, destructive lung scarring. Despite substantial progress, the genetic determinants of this disease remain incompletely defined. Using whole genome and whole exome sequencing data from 752 individuals with sporadic IPF and 119,055 UK Biobank controls, we performed a variant-level exome-wide association study (ExWAS) and gene-level collapsing analyses. Our variant-level analysis revealed a novel association between a rare missense variant in SPDL1 and IPF (NM_017785.5:g.169588475 G > A p.Arg20Gln; p = 2.4 × 10-7, odds ratio = 2.87, 95% confidence interval: 2.03-4.07). This signal was independently replicated in the FinnGen cohort, which contains 1028 cases and 196,986 controls (combined p = 2.2 × 10-20), firmly associating this variant as an IPF risk allele. SPDL1 encodes Spindly, a protein involved in mitotic checkpoint signalling during cell division that has not been previously described in fibrosis. To the best of our knowledge, these results highlight a novel mechanism underlying IPF, providing the potential for new therapeutic discoveries in a disease of great unmet need.


Assuntos
Proteínas de Ciclo Celular/genética , Fibrose Pulmonar Idiopática/genética , Mutação de Sentido Incorreto , Idoso , Estudos de Casos e Controles , Feminino , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Fibrose Pulmonar Idiopática/diagnóstico , Masculino , Fenótipo , Sequenciamento do Exoma
19.
Circ Genom Precis Med ; 13(6): e003030, 2020 12.
Artigo em Inglês | MEDLINE | ID: mdl-33125268

RESUMO

BACKGROUND: Spontaneous coronary artery dissection (SCAD) occurs when an epicardial coronary artery is narrowed or occluded by an intramural hematoma. SCAD mainly affects women and is associated with pregnancy and systemic arteriopathies, particularly fibromuscular dysplasia. Variants in several genes, such as those causing connective tissue disorders, have been implicated; however, the genetic architecture is poorly understood. Here, we aim to better understand the diagnostic yield of rare variant genetic testing among a cohort of SCAD survivors and to identify genes or gene sets that have a significant enrichment of rare variants. METHODS: We sequenced a cohort of 384 SCAD survivors from the United Kingdom, alongside 13 722 UK Biobank controls and a validation cohort of 92 SCAD survivors. We performed a research diagnostic screen for pathogenic variants and exome-wide and gene-set rare variant collapsing analyses. RESULTS: The majority of patients within both cohorts are female, 29% of the study cohort and 14% validation cohort have a remote arteriopathy. Four cases across the 2 cohorts had a diagnosed connective tissue disorder. We identified pathogenic or likely pathogenic variants in 7 genes (PKD1, COL3A1, SMAD3, TGFB2, LOX, MYLK, and YY1AP1) in 14/384 cases in the study cohort and in 1/92 cases in the validation cohort. In our rare variant collapsing analysis, PKD1 was the highest-ranked gene, and several functionally plausible genes were enriched for rare variants, although no gene achieved study-wide statistical significance. Gene-set enrichment analysis suggested a role for additional genes involved in renal function. CONCLUSIONS: By studying the largest sequenced cohort of SCAD survivors, we demonstrate that, based on current knowledge, only a small proportion have a pathogenic variant that could explain their disease. Our findings strengthen the overlap between SCAD and renal and connective tissue disorders, and we highlight several new genes for future validation.


Assuntos
Anomalias dos Vasos Coronários/genética , Sequenciamento do Exoma , Variação Genética , Genoma Humano , Doenças Vasculares/congênito , Adulto , Idoso , Estudos de Coortes , Feminino , Humanos , Aprendizado de Máquina , Masculino , Pessoa de Meia-Idade , Modelos Genéticos , Reino Unido , Doenças Vasculares/genética , Adulto Jovem
20.
Cell Res ; 29(3): 221-232, 2019 03.
Artigo em Inglês | MEDLINE | ID: mdl-30617251

RESUMO

Several developmental stages of spermatogenesis are transcriptionally quiescent which presents major challenges associated with the regulation of gene expression. Here we identify that the zygotene to pachytene transition is not only associated with the resumption of transcription but also a wave of programmed mRNA degradation that is essential for meiotic progression. We explored whether terminal uridydyl transferase 4- (TUT4-) or TUT7-mediated 3' mRNA uridylation contributes to this wave of mRNA degradation during pachynema. Indeed, both TUT4 and TUT7 are expressed throughout most of spermatogenesis, however, loss of either TUT4 or TUT7 does not have any major impact upon spermatogenesis. Combined TUT4 and TUT7 (TUT4/7) deficiency results in embryonic growth defects, while conditional gene targeting revealed an essential role for TUT4/7 in pachytene progression. Loss of TUT4/7 results in the reduction of miRNA, piRNA and mRNA 3' uridylation. Although this reduction does not greatly alter miRNA or piRNA expression, TUT4/7-mediated uridylation is required for the clearance of many zygotene-expressed transcripts in pachytene cells. We find that TUT4/7-regulated transcripts in pachytene spermatocytes are characterized by having long 3' UTRs with length-adjusted enrichment for AU-rich elements. We also observed these features in TUT4/7-regulated maternal transcripts whose dosage was recently shown to be essential for sculpting a functional maternal transcriptome and meiosis. Therefore, mRNA 3' uridylation is a critical determinant of both male and female germline transcriptomes. In conclusion, we have identified a novel requirement for 3' uridylation-programmed zygotene mRNA clearance in pachytene spermatocytes that is essential for male meiotic progression.


Assuntos
Prófase Meiótica I/genética , Estágio Paquíteno/genética , Processamento Pós-Transcricional do RNA/fisiologia , Espermatogênese/genética , Animais , Feminino , Masculino , Camundongos , Camundongos Endogâmicos C57BL , Estabilidade de RNA/genética , RNA Mensageiro/genética , UDPglucose-Hexose-1-Fosfato Uridiltransferase/metabolismo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA