Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 37
Filtrar
1.
Nature ; 622(7982): 339-347, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-37794183

RESUMO

Integrating human genomics and proteomics can help elucidate disease mechanisms, identify clinical biomarkers and discover drug targets1-4. Because previous proteogenomic studies have focused on common variation via genome-wide association studies, the contribution of rare variants to the plasma proteome remains largely unknown. Here we identify associations between rare protein-coding variants and 2,923 plasma protein abundances measured in 49,736 UK Biobank individuals. Our variant-level exome-wide association study identified 5,433 rare genotype-protein associations, of which 81% were undetected in a previous genome-wide association study of the same cohort5. We then looked at aggregate signals using gene-level collapsing analysis, which revealed 1,962 gene-protein associations. Of the 691 gene-level signals from protein-truncating variants, 99.4% were associated with decreased protein levels. STAB1 and STAB2, encoding scavenger receptors involved in plasma protein clearance, emerged as pleiotropic loci, with 77 and 41 protein associations, respectively. We demonstrate the utility of our publicly accessible resource through several applications. These include detailing an allelic series in NLRC4, identifying potential biomarkers for a fatty liver disease-associated variant in HSD17B13 and bolstering phenome-wide association studies by integrating protein quantitative trait loci with protein-truncating variants in collapsing analyses. Finally, we uncover distinct proteomic consequences of clonal haematopoiesis (CH), including an association between TET2-CH and increased FLT3 levels. Our results highlight a considerable role for rare variation in plasma protein abundance and the value of proteogenomics in therapeutic discovery.


Assuntos
Bancos de Espécimes Biológicos , Proteínas Sanguíneas , Estudos de Associação Genética , Genômica , Proteômica , Humanos , Alelos , Biomarcadores/sangue , Proteínas Sanguíneas/análise , Proteínas Sanguíneas/genética , Bases de Dados Factuais , Exoma/genética , Hematopoese , Mutação , Plasma/química , Reino Unido
2.
Nature ; 597(7877): 527-532, 2021 09.
Artigo em Inglês | MEDLINE | ID: mdl-34375979

RESUMO

Genome-wide association studies have uncovered thousands of common variants associated with human disease, but the contribution of rare variants to common disease remains relatively unexplored. The UK Biobank contains detailed phenotypic data linked to medical records for approximately 500,000 participants, offering an unprecedented opportunity to evaluate the effect of rare variation on a broad collection of traits1,2. Here we study the relationships between rare protein-coding variants and 17,361 binary and 1,419 quantitative phenotypes using exome sequencing data from 269,171 UK Biobank participants of European ancestry. Gene-based collapsing analyses revealed 1,703 statistically significant gene-phenotype associations for binary traits, with a median odds ratio of 12.4. Furthermore, 83% of these associations were undetectable via single-variant association tests, emphasizing the power of gene-based collapsing analysis in the setting of high allelic heterogeneity. Gene-phenotype associations were also significantly enriched for loss-of-function-mediated traits and approved drug targets. Finally, we performed ancestry-specific and pan-ancestry collapsing analyses using exome sequencing data from 11,933 UK Biobank participants of African, East Asian or South Asian ancestry. Our results highlight a significant contribution of rare variants to common disease. Summary statistics are publicly available through an interactive portal ( http://azphewas.com/ ).


Assuntos
Bancos de Espécimes Biológicos , Bases de Dados Genéticas , Doença/genética , Exoma/genética , Variação Genética/genética , Adulto , Idoso , Feminino , Estudo de Associação Genômica Ampla , Humanos , Masculino , Pessoa de Meia-Idade , Fenótipo , Proteínas/química , Proteínas/genética , Reino Unido , Sequenciamento do Exoma
3.
Am J Hum Genet ; 110(10): 1661-1672, 2023 10 05.
Artigo em Inglês | MEDLINE | ID: mdl-37741276

RESUMO

In the effort to treat Mendelian disorders, correcting the underlying molecular imbalance may be more effective than symptomatic treatment. Identifying treatments that might accomplish this goal requires extensive and up-to-date knowledge of molecular pathways-including drug-gene and gene-gene relationships. To address this challenge, we present "parsing modifiers via article annotations" (PARMESAN), a computational tool that searches PubMed and PubMed Central for information to assemble these relationships into a central knowledge base. PARMESAN then predicts putatively novel drug-gene relationships, assigning an evidence-based score to each prediction. We compare PARMESAN's drug-gene predictions to all of the drug-gene relationships displayed by the Drug-Gene Interaction Database (DGIdb) and show that higher-scoring relationship predictions are more likely to match the directionality (up- versus down-regulation) indicated by this database. PARMESAN had more than 200,000 drug predictions scoring above 8 (as one example cutoff), for more than 3,700 genes. Among these predicted relationships, 210 were registered in DGIdb and 201 (96%) had matching directionality. This publicly available tool provides an automated way to prioritize drug screens to target the most-promising drugs to test, thereby saving time and resources in the development of therapeutics for genetic disorders.


Assuntos
PubMed , Humanos , Bases de Dados Factuais
4.
Am J Hum Genet ; 110(3): 487-498, 2023 03 02.
Artigo em Inglês | MEDLINE | ID: mdl-36809768

RESUMO

Genome-wide association studies (GWASs) have established the contribution of common and low-frequency variants to metabolic blood measurements in the UK Biobank (UKB). To complement existing GWAS findings, we assessed the contribution of rare protein-coding variants in relation to 355 metabolic blood measurements-including 325 predominantly lipid-related nuclear magnetic resonance (NMR)-derived blood metabolite measurements (Nightingale Health Plc) and 30 clinical blood biomarkers-using 412,393 exome sequences from four genetically diverse ancestries in the UKB. Gene-level collapsing analyses were conducted to evaluate a diverse range of rare-variant architectures for the metabolic blood measurements. Altogether, we identified significant associations (p < 1 × 10-8) for 205 distinct genes that involved 1,968 significant relationships for the Nightingale blood metabolite measurements and 331 for the clinical blood biomarkers. These include associations for rare non-synonymous variants in PLIN1 and CREB3L3 with lipid metabolite measurements and SYT7 with creatinine, among others, which may not only provide insights into novel biology but also deepen our understanding of established disease mechanisms. Of the study-wide significant clinical biomarker associations, 40% were not previously detected on analyzing coding variants in a GWAS in the same cohort, reinforcing the importance of studying rare variation to fully understand the genetic architecture of metabolic blood measurements.


Assuntos
Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Bancos de Espécimes Biológicos , Biomarcadores , Lipídeos , Reino Unido , Polimorfismo de Nucleotídeo Único
5.
PLoS Genet ; 19(10): e1010952, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-37782669

RESUMO

Heterozygous de novo loss-of-function mutations in the gene expression regulator HNRNPU cause an early-onset developmental and epileptic encephalopathy. To gain insight into pathological mechanisms and lay the potential groundwork for developing targeted therapies, we characterized the neurophysiologic and cell-type-specific transcriptomic consequences of a mouse model of HNRNPU haploinsufficiency. Heterozygous mutants demonstrated global developmental delay, impaired ultrasonic vocalizations, cognitive dysfunction and increased seizure susceptibility, thus modeling aspects of the human disease. Single-cell RNA-sequencing of hippocampal and neocortical cells revealed widespread, yet modest, dysregulation of gene expression across mutant neuronal subtypes. We observed an increased burden of differentially-expressed genes in mutant excitatory neurons of the subiculum-a region of the hippocampus implicated in temporal lobe epilepsy. Evaluation of transcriptomic signature reversal as a therapeutic strategy highlights the potential importance of generating cell-type-specific signatures. Overall, this work provides insight into HNRNPU-mediated disease mechanisms and provides a framework for using single-cell RNA-sequencing to study transcriptional regulators implicated in disease.


Assuntos
Haploinsuficiência , Transcriptoma , Animais , Humanos , Camundongos , Haploinsuficiência/genética , Ribonucleoproteínas Nucleares Heterogêneas/metabolismo , Neurônios/metabolismo , RNA/metabolismo , Convulsões/genética , Transcriptoma/genética
6.
Am J Hum Genet ; 109(12): 2105-2109, 2022 12 01.
Artigo em Inglês | MEDLINE | ID: mdl-36459978

RESUMO

Synonymous mutations change the DNA sequence of a gene without affecting the amino acid sequence of the encoded protein. Although some synonymous mutations can affect RNA splicing, translational efficiency, and mRNA stability, studies in human genetics, mutagenesis screens, and other experiments and evolutionary analyses have repeatedly shown that most synonymous variants are neutral or only weakly deleterious, with some notable exceptions. Based on a recent study in yeast, there have been claims that synonymous mutations could be as important as nonsynonymous mutations in causing disease, assuming the yeast findings hold up and translate to humans. Here, we argue that there is insufficient evidence to overturn the large, coherent body of knowledge establishing the predominant neutrality of synonymous variants in the human genome.


Assuntos
Evolução Biológica , Saccharomyces cerevisiae , Humanos , Mutação/genética , Sequência de Aminoácidos , Genoma Humano/genética
8.
Proc Natl Acad Sci U S A ; 118(51)2021 12 21.
Artigo em Inglês | MEDLINE | ID: mdl-34903660

RESUMO

Extreme phenotype sequencing has led to the identification of high-impact rare genetic variants for many complex disorders but has not been applied to studies of severe schizophrenia. We sequenced 112 individuals with severe, extremely treatment-resistant schizophrenia, 218 individuals with typical schizophrenia, and 4,929 controls. We compared the burden of rare, damaging missense and loss-of-function variants between severe, extremely treatment-resistant schizophrenia, typical schizophrenia, and controls across mutation intolerant genes. Individuals with severe, extremely treatment-resistant schizophrenia had a high burden of rare loss-of-function (odds ratio, 1.91; 95% CI, 1.39 to 2.63; P = 7.8 × 10-5) and damaging missense variants in intolerant genes (odds ratio, 2.90; 95% CI, 2.02 to 4.15; P = 3.2 × 10-9). A total of 48.2% of individuals with severe, extremely treatment-resistant schizophrenia carried at least one rare, damaging missense or loss-of-function variant in intolerant genes compared to 29.8% of typical schizophrenia individuals (odds ratio, 2.18; 95% CI, 1.33 to 3.60; P = 1.6 × 10-3) and 25.4% of controls (odds ratio, 2.74; 95% CI, 1.85 to 4.06; P = 2.9 × 10-7). Restricting to genes previously associated with schizophrenia risk strengthened the enrichment with 8.9% of individuals with severe, extremely treatment-resistant schizophrenia carrying a damaging missense or loss-of-function variant compared to 2.3% of typical schizophrenia (odds ratio, 5.48; 95% CI, 1.52 to 19.74; P = 0.02) and 1.6% of controls (odds ratio, 5.82; 95% CI, 3.00 to 11.28; P = 2.6 × 10-8). These results demonstrate the power of extreme phenotype case selection in psychiatric genetics and an approach to augment schizophrenia gene discovery efforts.


Assuntos
Predisposição Genética para Doença/genética , Esquizofrenia/genética , Idoso , Transtorno do Espectro Autista/genética , Estudos de Casos e Controles , Deficiências do Desenvolvimento/genética , Feminino , Estudo de Associação Genômica Ampla , Humanos , Mutação com Perda de Função , Masculino , Análise da Randomização Mendeliana , Pessoa de Meia-Idade , Mutação de Sentido Incorreto , Risco , Esquizofrenia Resistente ao Tratamento/genética , Índice de Gravidade de Doença
9.
Am J Hum Genet ; 107(1): 83-95, 2020 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-32516569

RESUMO

Synonymous codon usage has been identified as a determinant of translational efficiency and mRNA stability in model organisms and human cell lines. However, whether natural selection shapes human codon content to optimize translation efficiency is unclear. Furthermore, aside from those that affect splicing, synonymous mutations are typically ignored as potential contributors to disease. Using genetic sequencing data from nearly 200,000 individuals, we uncover clear evidence that natural selection optimizes codon content in the human genome. In deriving intolerance metrics to quantify gene-level constraint on synonymous variation, we discover that dosage-sensitive genes, DNA-damage-response genes, and cell-cycle-regulated genes are particularly intolerant to synonymous variation. Notably, we illustrate that reductions in codon optimality in BRCA1 can attenuate its function. Our results reveal that synonymous mutations most likely play an underappreciated role in human variation.


Assuntos
Uso do Códon/genética , Genoma Humano/genética , Seleção Genética/genética , Códon/genética , Evolução Molecular , Humanos , Mutação/genética , Splicing de RNA/genética , Estabilidade de RNA/genética
10.
Ann Neurol ; 89(2): 199-211, 2021 02.
Artigo em Inglês | MEDLINE | ID: mdl-33159466

RESUMO

Advances in genetic discoveries have created substantial opportunities for precision medicine in neurodevelopmental disorders. Many of the genes implicated in these diseases encode proteins that regulate gene expression, such as chromatin-associated proteins, transcription factors, and RNA-binding proteins. The identification of targeted therapeutics for individuals carrying mutations in these genes remains a challenge, as the encoded proteins can theoretically regulate thousands of downstream targets in a considerable number of cell types. Here, we propose the application of a drug discovery approach originally developed for cancer called "transcriptome reversal" for these neurodevelopmental disorders. This approach attempts to identify compounds that reverse gene-expression signatures associated with disease states. ANN NEUROL 2021;89:199-211.


Assuntos
Regulação da Expressão Gênica/genética , Células-Tronco Neurais/efeitos dos fármacos , Transtornos do Neurodesenvolvimento/tratamento farmacológico , Neurônios/efeitos dos fármacos , Transcriptoma/efeitos dos fármacos , Animais , Anti-Inflamatórios não Esteroides/farmacologia , Anticonvulsivantes/farmacologia , Antidepressivos/farmacologia , Antipsicóticos/farmacologia , Carbamazepina/farmacologia , Simulação por Computador , Descoberta de Drogas , Epirizol/farmacologia , Perfilação da Expressão Gênica , Humanos , Células-Tronco Pluripotentes Induzidas , Células MCF-7 , Camundongos , Naloxona/farmacologia , Antagonistas de Entorpecentes/farmacologia , Células-Tronco Neurais/metabolismo , Transtornos do Neurodesenvolvimento/genética , Neurônios/metabolismo , Células PC-3 , Perfenazina/farmacologia , Cultura Primária de Células , RNA-Seq , Risperidona/farmacologia , Análise de Célula Única , Trazodona/farmacologia , Trimipramina/farmacologia
11.
Epilepsia ; 63(3): 723-735, 2022 03.
Artigo em Inglês | MEDLINE | ID: mdl-35032048

RESUMO

OBJECTIVE: We aimed to identify genes associated with genetic generalized epilepsy (GGE) by combining large cohorts enriched with individuals with a positive family history. Secondarily, we set out to compare the association of genes independently with familial and sporadic GGE. METHODS: We performed a case-control whole exome sequencing study in unrelated individuals of European descent diagnosed with GGE (previously recruited and sequenced through multiple international collaborations) and ancestry-matched controls. The association of ultra-rare variants (URVs; in 18 834 protein-coding genes) with epilepsy was examined in 1928 individuals with GGE (vs. 8578 controls), then separately in 945 individuals with familial GGE (vs. 8626 controls), and finally in 1005 individuals with sporadic GGE (vs. 8621 controls). We additionally examined the association of URVs with familial and sporadic GGE in two gene sets important for inhibitory signaling (19 genes encoding γ-aminobutyric acid type A [GABAA ] receptors, 113 genes representing the GABAergic pathway). RESULTS: GABRG2 was associated with GGE (p = 1.8 × 10-5 ), approaching study-wide significance in familial GGE (p = 3.0 × 10-6 ), whereas no gene approached a significant association with sporadic GGE. Deleterious URVs in the most intolerant subgenic regions in genes encoding GABAA receptors were associated with familial GGE (odds ratio [OR] = 3.9, 95% confidence interval [CI] = 1.9-7.8, false discovery rate [FDR]-adjusted p = .0024), whereas their association with sporadic GGE had marginally lower odds (OR = 3.1, 95% CI = 1.3-6.7, FDR-adjusted p = .022). URVs in GABAergic pathway genes were associated with familial GGE (OR = 1.8, 95% CI = 1.3-2.5, FDR-adjusted p = .0024) but not with sporadic GGE (OR = 1.3, 95% CI = .9-1.9, FDR-adjusted p = .19). SIGNIFICANCE: URVs in GABRG2 are likely an important risk factor for familial GGE. The association of gene sets of GABAergic signaling with familial GGE is more prominent than with sporadic GGE.


Assuntos
Epilepsia Generalizada , Predisposição Genética para Doença , Estudos de Casos e Controles , Epilepsia Generalizada/genética , Predisposição Genética para Doença/genética , Humanos , Receptores de GABA-A/genética , Sequenciamento do Exoma , Ácido gama-Aminobutírico
13.
PLoS Comput Biol ; 14(10): e1006506, 2018 10.
Artigo em Inglês | MEDLINE | ID: mdl-30273353

RESUMO

Here we present an open-source R package 'meaRtools' that provides a platform for analyzing neuronal networks recorded on Microelectrode Arrays (MEAs). Cultured neuronal networks monitored with MEAs are now being widely used to characterize in vitro models of neurological disorders and to evaluate pharmaceutical compounds. meaRtools provides core algorithms for MEA spike train analysis, feature extraction, statistical analysis and plotting of multiple MEA recordings with multiple genotypes and treatments. meaRtools functionality covers novel solutions for spike train analysis, including algorithms to assess electrode cross-correlation using the spike train tiling coefficient (STTC), mutual information, synchronized bursts and entropy within cultured wells. Also integrated is a solution to account for bursts variability originating from mixed-cell neuronal cultures. The package provides a statistical platform built specifically for MEA data that can combine multiple MEA recordings and compare extracted features between different genetic models or treatments. We demonstrate the utilization of meaRtools to successfully identify epilepsy-like phenotypes in neuronal networks from Celf4 knockout mice. The package is freely available under the GPL license (GPL> = 3) and is updated frequently on the CRAN web-server repository. The package, along with full documentation can be downloaded from: https://cran.r-project.org/web/packages/meaRtools/.


Assuntos
Potenciais de Ação/fisiologia , Biologia Computacional/métodos , Neurônios/fisiologia , Software , Algoritmos , Animais , Células Cultivadas , Eletrofisiologia , Camundongos , Camundongos Knockout , Microeletrodos
15.
Genet Med ; 17(10): 774-81, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-25590979

RESUMO

PURPOSE: Despite the recognized clinical value of exome-based diagnostics, methods for comprehensive genomic interpretation remain immature. Diagnoses are based on known or presumed pathogenic variants in genes already associated with a similar phenotype. Here, we extend this paradigm by evaluating novel bioinformatics approaches to aid identification of new gene-disease associations. METHODS: We analyzed 119 trios to identify both diagnostic genotypes in known genes and candidate genotypes in novel genes. We considered qualifying genotypes based on their population frequency and in silico predicted effects we also characterized the patterns of genotypes enriched among this collection of patients. RESULTS: We obtained a genetic diagnosis for 29 (24%) of our patients. We showed that patients carried an excess of damaging de novo mutations in intolerant genes, particularly those shown to be essential in mice (P = 3.4 × 10(-8)). This enrichment is only partially explained by mutations found in known disease-causing genes. CONCLUSION: This work indicates that the application of appropriate bioinformatics analyses to clinical sequence data can also help implicate novel disease genes and suggest expanded phenotypes for known disease genes. These analyses further suggest that some cases resolved by whole-exome sequencing will have direct therapeutic implications.


Assuntos
Exoma , Doenças Genéticas Inatas/diagnóstico , Doenças Genéticas Inatas/genética , Sequenciamento de Nucleotídeos em Larga Escala , Biologia Computacional/métodos , Feminino , Estudos de Associação Genética , Genômica/métodos , Genótipo , Humanos , Masculino , Mutação , Fenótipo
16.
Curr Neurol Neurosci Rep ; 15(10): 70, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-26319171

RESUMO

Epilepsy is a serious neurological disease with substantial genetic contribution. We have recently made major advances in understanding the genetics and etiology of the epilepsies. However, current antiepileptic drugs are ineffective in nearly one third of patients. Most of these drugs were developed without knowledge of the underlying causes of the epilepsy to be treated; thus, it seems reasonable to assume that further improvements require a deeper understanding of epilepsy pathophysiology. Although once the rate-limiting step, gene discovery is now occurring at an unprecedented rapid rate, especially in the epileptic encephalopathies. However, to place these genetic findings in a biological context and discover treatment options for patients, we must focus on developing an efficient framework for functional evaluation of the mutations that cause epilepsy. In this review, we discuss guidelines for gene discovery, emerging functional assays and models, and novel therapeutics to highlight the developing framework of precision medicine in the epilepsies.


Assuntos
Epilepsia/genética , Animais , Anticonvulsivantes/uso terapêutico , Eletrodos Implantados , Epilepsia/fisiopatologia , Epilepsia/terapia , Predisposição Genética para Doença , Humanos , Mutação
17.
Dev Cell ; 59(16): 2171-2188.e7, 2024 Aug 19.
Artigo em Inglês | MEDLINE | ID: mdl-39106860

RESUMO

Proneural transcription factors establish molecular cascades to orchestrate neuronal diversity. One such transcription factor, Atonal homolog 1 (Atoh1), gives rise to cerebellar excitatory neurons and over 30 distinct nuclei in the brainstem critical for hearing, breathing, and balance. Although Atoh1 lineage neurons have been qualitatively described, the transcriptional programs that drive their fate decisions and the full extent of their diversity remain unknown. Here, we analyzed single-cell RNA sequencing and ATOH1 DNA binding in Atoh1 lineage neurons of the developing mouse hindbrain. This high-resolution dataset identified markers for specific brainstem nuclei and demonstrated that transcriptionally heterogeneous progenitors require ATOH1 for proper migration. Moreover, we identified a sizable population of proliferating unipolar brush cell progenitors in the mouse Atoh1 lineage, previously described in humans as the origin of one medulloblastoma subtype. Collectively, our data provide insights into the developing mouse hindbrain and markers for functional assessment of understudied neuronal populations.


Assuntos
Fatores de Transcrição Hélice-Alça-Hélice Básicos , Linhagem da Célula , Neurônios , Rombencéfalo , Análise de Célula Única , Transcriptoma , Animais , Rombencéfalo/metabolismo , Rombencéfalo/citologia , Fatores de Transcrição Hélice-Alça-Hélice Básicos/metabolismo , Fatores de Transcrição Hélice-Alça-Hélice Básicos/genética , Camundongos , Neurônios/metabolismo , Neurônios/citologia , Linhagem da Célula/genética , Análise de Célula Única/métodos , Transcriptoma/genética , Diferenciação Celular , Regulação da Expressão Gênica no Desenvolvimento , Neurogênese/genética , Movimento Celular
18.
Sci Adv ; 10(19): eadj1424, 2024 May 10.
Artigo em Inglês | MEDLINE | ID: mdl-38718126

RESUMO

The ongoing expansion of human genomic datasets propels therapeutic target identification; however, extracting gene-disease associations from gene annotations remains challenging. Here, we introduce Mantis-ML 2.0, a framework integrating AstraZeneca's Biological Insights Knowledge Graph and numerous tabular datasets, to assess gene-disease probabilities throughout the phenome. We use graph neural networks, capturing the graph's holistic structure, and train them on hundreds of balanced datasets via a robust semi-supervised learning framework to provide gene-disease probabilities across the human exome. Mantis-ML 2.0 incorporates natural language processing to automate disease-relevant feature selection for thousands of diseases. The enhanced models demonstrate a 6.9% average classification power boost, achieving a median receiver operating characteristic (ROC) area under curve (AUC) score of 0.90 across 5220 diseases from Human Phenotype Ontology, OpenTargets, and Genomics England. Notably, Mantis-ML 2.0 prioritizes associations from an independent UK Biobank phenome-wide association study (PheWAS), providing a stronger form of triaging and mitigating against underpowered PheWAS associations. Results are exposed through an interactive web resource.


Assuntos
Redes Neurais de Computação , Humanos , Algoritmos , Biologia Computacional/métodos , Bases de Dados Genéticas , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla/métodos , Genômica/métodos , Fenômica/métodos , Fenótipo , Biobanco do Reino Unido , Reino Unido
19.
Nat Genet ; 56(9): 1821-1831, 2024 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-39261665

RESUMO

The emergence of biobank-level datasets offers new opportunities to discover novel biomarkers and develop predictive algorithms for human disease. Here, we present an ensemble machine-learning framework (machine learning with phenotype associations, MILTON) utilizing a range of biomarkers to predict 3,213 diseases in the UK Biobank. Leveraging the UK Biobank's longitudinal health record data, MILTON predicts incident disease cases undiagnosed at time of recruitment, largely outperforming available polygenic risk scores. We further demonstrate the utility of MILTON in augmenting genetic association analyses in a phenome-wide association study of 484,230 genome-sequenced samples, along with 46,327 samples with matched plasma proteomics data. This resulted in improved signals for 88 known (P < 1 × 10-8) gene-disease relationships alongside 182 gene-disease relationships that did not achieve genome-wide significance in the nonaugmented baseline cohorts. We validated these discoveries in the FinnGen biobank alongside two orthogonal machine-learning methods built for gene-disease prioritization. All extracted gene-disease associations and incident disease predictive biomarkers are publicly available ( http://milton.public.cgr.astrazeneca.com ).


Assuntos
Bancos de Espécimes Biológicos , Biomarcadores , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Aprendizado de Máquina , Humanos , Reino Unido , Estudo de Associação Genômica Ampla/métodos , Estudos de Casos e Controles , Herança Multifatorial/genética , Proteômica/métodos , Fenótipo , Polimorfismo de Nucleotídeo Único , Algoritmos , Multiômica , Biobanco do Reino Unido
20.
Nat Genet ; 56(9): 1832-1840, 2024 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-39192095

RESUMO

Telomeres protect chromosome ends from damage and their length is linked with human disease and aging. We developed a joint telomere length metric, combining quantitative PCR and whole-genome sequencing measurements from 462,666 UK Biobank participants. This metric increased SNP heritability, suggesting that it better captures genetic regulation of telomere length. Exome-wide rare-variant and gene-level collapsing association studies identified 64 variants and 30 genes significantly associated with telomere length, including allelic series in ACD and RTEL1. Notably, 16% of these genes are known drivers of clonal hematopoiesis-an age-related somatic mosaicism associated with myeloid cancers and several nonmalignant diseases. Somatic variant analyses revealed gene-specific associations with telomere length, including lengthened telomeres in individuals with large SRSF2-mutant clones, compared with shortened telomeres in individuals with clonal expansions driven by other genes. Collectively, our findings demonstrate the impact of rare variants on telomere length, with larger effects observed among genes also associated with clonal hematopoiesis.


Assuntos
Bancos de Espécimes Biológicos , Polimorfismo de Nucleotídeo Único , Telômero , Sequenciamento Completo do Genoma , Humanos , Telômero/genética , Reino Unido , Sequenciamento Completo do Genoma/métodos , Homeostase do Telômero/genética , Masculino , Feminino , Hematopoiese Clonal/genética , Estudo de Associação Genômica Ampla/métodos , Idoso , DNA Helicases/genética , Pessoa de Meia-Idade , Biobanco do Reino Unido
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA