Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 32
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Mol Cell ; 80(6): 1078-1091.e6, 2020 12 17.
Artigo em Inglês | MEDLINE | ID: mdl-33290746

RESUMO

We report that the SARS-CoV-2 nucleocapsid protein (N-protein) undergoes liquid-liquid phase separation (LLPS) with viral RNA. N-protein condenses with specific RNA genomic elements under physiological buffer conditions and condensation is enhanced at human body temperatures (33°C and 37°C) and reduced at room temperature (22°C). RNA sequence and structure in specific genomic regions regulate N-protein condensation while other genomic regions promote condensate dissolution, potentially preventing aggregation of the large genome. At low concentrations, N-protein preferentially crosslinks to specific regions characterized by single-stranded RNA flanked by structured elements and these features specify the location, number, and strength of N-protein binding sites (valency). Liquid-like N-protein condensates form in mammalian cells in a concentration-dependent manner and can be altered by small molecules. Condensation of N-protein is RNA sequence and structure specific, sensitive to human body temperature, and manipulatable with small molecules, and therefore presents a screenable process for identifying antiviral compounds effective against SARS-CoV-2.


Assuntos
COVID-19/metabolismo , Proteínas do Nucleocapsídeo de Coronavírus/metabolismo , Genoma Viral , Nucleocapsídeo/metabolismo , RNA Viral/metabolismo , SARS-CoV-2/metabolismo , Animais , Antivirais/farmacologia , COVID-19/genética , Chlorocebus aethiops , Proteínas do Nucleocapsídeo de Coronavírus/genética , Avaliação Pré-Clínica de Medicamentos , Células HEK293 , Humanos , Nucleocapsídeo/genética , Fosfoproteínas/genética , Fosfoproteínas/metabolismo , SARS-CoV-2/genética , Células Vero , Tratamento Farmacológico da COVID-19
2.
Nat Rev Genet ; 22(12): 774-790, 2021 12.
Artigo em Inglês | MEDLINE | ID: mdl-34341555

RESUMO

Interpreting the effects of genetic variants is key to understanding individual susceptibility to disease and designing personalized therapeutic approaches. Modern experimental technologies are enabling the generation of massive compendia of human genome sequence data and associated molecular and phenotypic traits, together with genome-scale expression, epigenomics and other functional genomic data. Integrative computational models can leverage these data to understand variant impact, elucidate the effect of dysregulated genes on biological pathways in specific disease and tissue contexts, and interpret disease risk beyond what is feasible with experiments alone. In this Review, we discuss recent developments in machine learning algorithms for genome interpretation and for integrative molecular-level modelling of cells, tissues and organs relevant to disease. More specifically, we highlight existing methods and key challenges and opportunities in identifying specific disease-causing genetic variants and linking them to molecular pathways and, ultimately, to disease phenotypes.


Assuntos
Predisposição Genética para Doença , Variação Genética , Modelos Genéticos , Mutação , Epigenômica , Expressão Gênica , Redes Reguladoras de Genes , Genoma Humano , Humanos , Aprendizado de Máquina , Fenótipo
3.
Genome Res ; 31(6): 1097-1105, 2021 06.
Artigo em Inglês | MEDLINE | ID: mdl-33888512

RESUMO

To enable large-scale analyses of transcription regulation in model species, we developed DeepArk, a set of deep learning models of the cis-regulatory activities for four widely studied species: Caenorhabditis elegans, Danio rerio, Drosophila melanogaster, and Mus musculus DeepArk accurately predicts the presence of thousands of different context-specific regulatory features, including chromatin states, histone marks, and transcription factors. In vivo studies show that DeepArk can predict the regulatory impact of any genomic variant (including rare or not previously observed) and enables the regulatory annotation of understudied model species.


Assuntos
Aprendizado Profundo , Drosophila melanogaster , Animais , Caenorhabditis elegans/genética , Drosophila melanogaster/genética , Regulação da Expressão Gênica , Camundongos , Peixe-Zebra/genética
4.
Genome Res ; 31(2): 337-347, 2021 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-33361113

RESUMO

Understanding the changes in diverse molecular pathways underlying the development of breast tumors is critical for improving diagnosis, treatment, and drug development. Here, we used RNA-profiling of canine mammary tumors (CMTs) coupled with a robust analysis framework to model molecular changes in human breast cancer. Our study leveraged a key advantage of the canine model, the frequent presence of multiple naturally occurring tumors at diagnosis, thus providing samples spanning normal tissue and benign and malignant tumors from each patient. We showed human breast cancer signals, at both expression and mutation level, are evident in CMTs. Profiling multiple tumors per patient enabled by the CMT model allowed us to resolve statistically robust transcription patterns and biological pathways specific to malignant tumors versus those arising in benign tumors or shared with normal tissues. We showed that multiple histological samples per patient is necessary to effectively capture these progression-related signatures, and that carcinoma-specific signatures are predictive of survival for human breast cancer patients. To catalyze and support similar analyses and use of the CMT model by other biomedical researchers, we provide FREYA, a robust data processing pipeline and statistical analyses framework.

5.
PLoS Genet ; 15(9): e1008382, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31553718

RESUMO

Comprehensive information on the timing and location of gene expression is fundamental to our understanding of embryonic development and tissue formation. While high-throughput in situ hybridization projects provide invaluable information about developmental gene expression patterns for model organisms like Drosophila, the output of these experiments is primarily qualitative, and a high proportion of protein coding genes and most non-coding genes lack any annotation. Accurate data-centric predictions of spatio-temporal gene expression will therefore complement current in situ hybridization efforts. Here, we applied a machine learning approach by training models on all public gene expression and chromatin data, even from whole-organism experiments, to provide genome-wide, quantitative spatio-temporal predictions for all genes. We developed structured in silico nano-dissection, a computational approach that predicts gene expression in >200 tissue-developmental stages. The algorithm integrates expression signals from a compendium of 6,378 genome-wide expression and chromatin profiling experiments in a cell lineage-aware fashion. We systematically evaluated our performance via cross-validation and experimentally confirmed 22 new predictions for four different embryonic tissues. The model also predicts complex, multi-tissue expression and developmental regulation with high accuracy. We further show the potential of applying these genome-wide predictions to extract tissue specificity signals from non-tissue-dissected experiments, and to prioritize tissues and stages for disease modeling. This resource, together with the exploratory tools are freely available at our webserver http://find.princeton.edu, which provides a valuable tool for a range of applications, from predicting spatio-temporal expression patterns to recognizing tissue signatures from differential gene expression profiles.


Assuntos
Perfilação da Expressão Gênica/métodos , Regulação da Expressão Gênica no Desenvolvimento/genética , Estudo de Associação Genômica Ampla/métodos , Algoritmos , Animais , Biologia Computacional/métodos , Simulação por Computador , Drosophila/genética , Desenvolvimento Embrionário/genética , Previsões/métodos , Regulação da Expressão Gênica no Desenvolvimento/fisiologia , Genes Controladores do Desenvolvimento/genética , Aprendizado de Máquina , Análise Espaço-Temporal , Transcriptoma/genética
6.
Kidney Int ; 98(6): 1502-1518, 2020 12.
Artigo em Inglês | MEDLINE | ID: mdl-33038424

RESUMO

COVID-19 morbidity and mortality are increased via unknown mechanisms in patients with diabetes and kidney disease. SARS-CoV-2 uses angiotensin-converting enzyme 2 (ACE2) for entry into host cells. Because ACE2 is a susceptibility factor for infection, we investigated how diabetic kidney disease and medications alter ACE2 receptor expression in kidneys. Single cell RNA profiling of kidney biopsies from healthy living donors and patients with diabetic kidney disease revealed ACE2 expression primarily in proximal tubular epithelial cells. This cell-specific localization was confirmed by in situ hybridization. ACE2 expression levels were unaltered by exposures to renin-angiotensin-aldosterone system inhibitors in diabetic kidney disease. Bayesian integrative analysis of a large compendium of public -omics datasets identified molecular network modules induced in ACE2-expressing proximal tubular epithelial cells in diabetic kidney disease (searchable at hb.flatironinstitute.org/covid-kidney) that were linked to viral entry, immune activation, endomembrane reorganization, and RNA processing. The diabetic kidney disease ACE2-positive proximal tubular epithelial cell module overlapped with expression patterns seen in SARS-CoV-2-infected cells. Similar cellular programs were seen in ACE2-positive proximal tubular epithelial cells obtained from urine samples of 13 hospitalized patients with COVID-19, suggesting a consistent ACE2-coregulated proximal tubular epithelial cell expression program that may interact with the SARS-CoV-2 infection processes. Thus SARS-CoV-2 receptor networks can seed further research into risk stratification and therapeutic strategies for COVID-19-related kidney damage.


Assuntos
Enzima de Conversão de Angiotensina 2/metabolismo , COVID-19/metabolismo , Nefropatias Diabéticas/metabolismo , Túbulos Renais Proximais/metabolismo , SARS-CoV-2/metabolismo , Adulto , Idoso , Antagonistas de Receptores de Angiotensina/farmacologia , Antagonistas de Receptores de Angiotensina/uso terapêutico , Inibidores da Enzima Conversora de Angiotensina/farmacologia , Inibidores da Enzima Conversora de Angiotensina/uso terapêutico , COVID-19/complicações , COVID-19/virologia , Estudos de Casos e Controles , Nefropatias Diabéticas/tratamento farmacológico , Feminino , Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Interações Hospedeiro-Patógeno , Humanos , Túbulos Renais Proximais/efeitos dos fármacos , Masculino , Pessoa de Meia-Idade
8.
Genome Res ; 26(5): 670-80, 2016 05.
Artigo em Inglês | MEDLINE | ID: mdl-26975778

RESUMO

We can now routinely identify coding variants within individual human genomes. A pressing challenge is to determine which variants disrupt the function of disease-associated genes. Both experimental and computational methods exist to predict pathogenicity of human genetic variation. However, a systematic performance comparison between them has been lacking. Therefore, we developed and exploited a panel of 26 yeast-based functional complementation assays to measure the impact of 179 variants (101 disease- and 78 non-disease-associated variants) from 22 human disease genes. Using the resulting reference standard, we show that experimental functional assays in a 1-billion-year diverged model organism can identify pathogenic alleles with significantly higher precision and specificity than current computational methods.


Assuntos
Teste de Complementação Genética/métodos , Doenças Genéticas Inatas , Saccharomyces cerevisiae , Transcrição Gênica , Doenças Genéticas Inatas/genética , Doenças Genéticas Inatas/metabolismo , Humanos , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo
9.
J Biol Chem ; 288(12): 8519-8530, 2013 Mar 22.
Artigo em Inglês | MEDLINE | ID: mdl-23306196

RESUMO

Insulin-induced gene proteins (INSIGs) function in control of cellular cholesterol. Mammalian INSIGs exert control by directly interacting with proteins containing sterol-sensing domains (SSDs) when sterol levels are elevated. Mammalian 3-hydroxy-3-methylglutaryl (HMG)-CoA reductase (HMGR) undergoes sterol-dependent, endoplasmic-reticulum (ER)-associated degradation (ERAD) that is mediated by INSIG interaction with the HMGR SSD. The yeast HMGR isozyme Hmg2 also undergoes feedback-regulated ERAD in response to the early pathway-derived isoprene gernanylgeranyl pyrophosphate (GGPP). Hmg2 has an SSD, and its degradation is controlled by the INSIG homologue Nsg1. However, yeast Nsg1 promotes Hmg2 stabilization by inhibiting GGPP-stimulated ERAD. We have proposed that the seemingly disparate INSIG functions can be unified by viewing INSIGs as sterol-dependent chaperones of SSD clients. Accordingly, we tested the role of sterols in the Nsg1 regulation of Hmg2. We found that both Nsg1-mediated stabilization of Hmg2 and the Nsg1-Hmg2 interaction required the early sterol lanosterol. Lowering lanosterol in the cell allowed GGPP-stimulated Hmg2 ERAD. Thus, Hmg2-regulated degradation is controlled by a two-signal logic; GGPP promotes degradation, and lanosterol inhibits degradation. These data reveal that the sterol dependence of INSIG-client interaction has been preserved for over 1 billion years. We propose that the INSIGs are a class of sterol-dependent chaperones that bind to SSD clients, thus harnessing ER quality control in the homeostasis of sterols.


Assuntos
Degradação Associada com o Retículo Endoplasmático , Hidroximetilglutaril-CoA Redutases/metabolismo , Lanosterol/fisiologia , Chaperonas Moleculares/fisiologia , Proteínas de Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/fisiologia , Saccharomyces cerevisiae/metabolismo , Vias Biossintéticas , Inibidores Enzimáticos/farmacologia , Lanosterol/metabolismo , Ácido Mevalônico/metabolismo , Naftalenos/farmacologia , Fosfatos de Poli-Isoprenil/biossíntese , Ligação Proteica , Estabilidade Proteica , Proteólise , Esteróis/biossíntese , Terbinafina , Terpenos/metabolismo
10.
medRxiv ; 2024 Aug 16.
Artigo em Inglês | MEDLINE | ID: mdl-39185525

RESUMO

Unraveling the phenotypic and genetic complexity of autism is extremely challenging yet critical for understanding the biology, inheritance, trajectory, and clinical manifestations of the many forms of the condition. Here, we leveraged broad phenotypic data from a large cohort with matched genetics to characterize classes of autism and their patterns of core, associated, and co-occurring traits, ultimately demonstrating that phenotypic patterns are associated with distinct genetic and molecular programs. We used a generative mixture modeling approach to identify robust, clinically-relevant classes of autism which we validate and replicate in a large independent cohort. We link the phenotypic findings to distinct patterns of de novo and inherited variation which emerge from the deconvolution of these genetic signals, and demonstrate that class-specific common variant scores strongly align with clinical outcomes. We further provide insights into the distinct biological pathways and processes disrupted by the sets of mutations in each class. Remarkably, we discover class-specific differences in the developmental timing of genes that are dysregulated, and these temporal patterns correspond to clinical milestone and outcome differences between the classes. These analyses embrace the phenotypic complexity of children with autism, unraveling genetic and molecular programs underlying their heterogeneity and suggesting specific biological dysregulation patterns and mechanistic hypotheses.

11.
Cell Rep Methods ; 3(9): 100580, 2023 09 25.
Artigo em Inglês | MEDLINE | ID: mdl-37703883

RESUMO

Human biology is rooted in highly specialized cell types programmed by a common genome, 98% of which is outside of genes. Genetic variation in the enormous noncoding space is linked to the majority of disease risk. To address the problem of linking these variants to expression changes in primary human cells, we introduce ExPectoSC, an atlas of modular deep-learning-based models for predicting cell-type-specific gene expression directly from sequence. We provide models for 105 primary human cell types covering 7 organ systems, demonstrate their accuracy, and then apply them to prioritize relevant cell types for complex human diseases. The resulting atlas of sequence-based gene expression and variant effects is publicly available in a user-friendly interface and readily extensible to any primary cell types. We demonstrate the accuracy of our approach through systematic evaluations and apply the models to prioritize ClinVar clinical variants of uncertain significance, verifying our top predictions experimentally.


Assuntos
Ascomicetos , Humanos , Expressão Gênica/genética
12.
Nat Comput Sci ; 3(7): 644-657, 2023 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-37974651

RESUMO

Resolving chromatin-remodeling-linked gene expression changes at cell-type resolution is important for understanding disease states. Here we describe MAGICAL (Multiome Accessibility Gene Integration Calling and Looping), a hierarchical Bayesian approach that leverages paired single-cell RNA sequencing and single-cell transposase-accessible chromatin sequencing from different conditions to map disease-associated transcription factors, chromatin sites, and genes as regulatory circuits. By simultaneously modeling signal variation across cells and conditions in both omics data types, MAGICAL achieved high accuracy on circuit inference. We applied MAGICAL to study Staphylococcus aureus sepsis from peripheral blood mononuclear single-cell data that we generated from subjects with bloodstream infection and uninfected controls. MAGICAL identified sepsis-associated regulatory circuits predominantly in CD14 monocytes, known to be activated by bacterial sepsis. We addressed the challenging problem of distinguishing host regulatory circuit responses to methicillin-resistant and methicillin-susceptible S. aureus infections. Although differential expression analysis failed to show predictive value, MAGICAL identified epigenetic circuit biomarkers that distinguished methicillin-resistant from methicillin-susceptible S. aureus infections.

13.
J Biol Chem ; 286(30): 26298-307, 2011 Jul 29.
Artigo em Inglês | MEDLINE | ID: mdl-21628456

RESUMO

The sterol-sensing domain (SSD) is a conserved motif in membrane proteins responsible for sterol regulation. Mammalian proteins SREBP cleavage-activating protein (SCAP) and HMG-CoA reductase (HMGR) both possess SSDs required for feedback regulation of sterol-related genes and sterol synthetic rate. Although these two SSD proteins clearly sense sterols, the range of signals detected by this eukaryotic motif is not clear. The yeast HMG-CoA reductase isozyme Hmg2, like its mammalian counterpart, undergoes endoplasmic reticulum (ER)-associated degradation that is subject to feedback control by the sterol pathway. The primary degradation signal for yeast Hmg2 degradation is the 20-carbon isoprene geranylgeranyl pyrophosphate, rather than a sterol. Nevertheless, the Hmg2 protein possesses an SSD, leading us to test its role in feedback control of Hmg2 stability. We mutated highly conserved SSD residues of Hmg2 and evaluated regulated degradation. Our results indicated that the SSD was required for sterol pathway signals to stimulate Hmg2 ER-associated degradation and was employed for detection of both geranylgeranyl pyrophosphate and a secondary oxysterol signal. Our data further indicate that the SSD allows a signal-dependent structural change in Hmg2 that promotes entry into the ER degradation pathway. Thus, the eukaryotic SSD is capable of significant plasticity in signal recognition or response. We propose that the harnessing of cellular quality control pathways to bring about feedback regulation of normal proteins is a unifying theme for the action of all SSDs.


Assuntos
Retículo Endoplasmático/metabolismo , Hidroximetilglutaril-CoA Redutases/metabolismo , Proteínas de Saccharomyces cerevisiae/metabolismo , Saccharomyces cerevisiae/enzimologia , Retículo Endoplasmático/genética , Hidroximetilglutaril-CoA Redutases/genética , Isoenzimas/genética , Isoenzimas/metabolismo , Fosfatos de Poli-Isoprenil/metabolismo , Estrutura Terciária de Proteína , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/genética
14.
Cell Syst ; 12(4): 353-362.e6, 2021 04 21.
Artigo em Inglês | MEDLINE | ID: mdl-33689683

RESUMO

Systematic study of tissue-specific function of enhancers and their disease associations is a major challenge. We present an integrative machine-learning framework, FENRIR, that integrates thousands of disparate epigenetic and functional genomics datasets to infer tissue-specific functional relationships between enhancers for 140 diverse human tissues and cell types, providing a regulatory-region-centric approach to systematically identify disease-associated enhancers. We demonstrated its power to accurately prioritize enhancers associated with 25 complex diseases. In a case study on autism, FENRIR-prioritized enhancers showed a significant proband-specific de novo mutation enrichment in a large, sibling-controlled cohort, indicating pathogenic signal. We experimentally validated transcriptional regulatory activities of eight enhancers, including enhancers not previously reported with autism, and demonstrated their differential regulatory potential between proband and sibling alleles. Thus, FENRIR is an accurate and effective framework for the study of tissue-specific enhancers and their role in disease. FENRIR can be accessed at fenrir.flatironinstitute.org/.


Assuntos
Elementos Facilitadores Genéticos/genética , Redes Reguladoras de Genes/genética , Humanos
15.
Nat Genet ; 53(2): 166-173, 2021 02.
Artigo em Inglês | MEDLINE | ID: mdl-33462483

RESUMO

Despite the strong genetic basis of psychiatric disorders, the underlying molecular mechanisms are largely unmapped. RNA-binding proteins (RBPs) are responsible for most post-transcriptional regulation, from splicing to translation to localization. RBPs thus act as key gatekeepers of cellular homeostasis, especially in the brain. However, quantifying the pathogenic contribution of noncoding variants impacting RBP target sites is challenging. Here, we leverage a deep learning approach that can accurately predict the RBP target site dysregulation effects of mutations and discover that RBP dysregulation is a principal contributor to psychiatric disorder risk. RBP dysregulation explains a substantial amount of heritability not captured by large-scale molecular quantitative trait loci studies and has a stronger impact than common coding region variants. We share the genome-wide profiles of RBP dysregulation, which we use to identify DDHD2 as a candidate schizophrenia risk gene. This resource provides a new analytical framework to connect the full range of RNA regulation to complex disease.


Assuntos
Transtornos Mentais/genética , Fosfolipases/genética , Proteínas de Ligação a RNA/genética , Regiões 3' não Traduzidas , Aprendizado Profundo , Regulação da Expressão Gênica , Frequência do Gene , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Mutação , Proteínas do Fator Nuclear 90/genética , Fatores de Alongamento de Peptídeos/genética , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , RNA Helicases/genética , Processamento Pós-Transcricional do RNA , Ribonucleoproteína Nuclear Pequena U5/genética , Esquizofrenia/genética , Transativadores/genética
16.
medRxiv ; 2020 Aug 21.
Artigo em Inglês | MEDLINE | ID: mdl-32511461

RESUMO

COVID-19 morbidity and mortality is increased in patients with diabetes and kidney disease via unknown mechanisms. SARS-CoV-2 uses angiotensin-converting enzyme 2 (ACE2) for entry into host cells. Since ACE2 is a susceptibility factor for infection, we investigated how diabetic kidney disease (DKD) and medications alter ACE2 receptor expression in kidneys. Single cell RNA profiling of healthy living donor (LD) and DKD kidney biopsies revealed ACE2 expression primarily in proximal tubular epithelial cells (PTEC). This cell specific localization was confirmed by in situ hybridization. ACE2 expression levels were unaltered by exposures to renin angiotensin aldosterone system inhibitors in DKD. Bayesian integrative analysis of a large compendium of public -omics datasets identified molecular network modules induced in ACE2-expressing PTEC in DKD (searchable at hb.flatironinstitute.org/covid-kidney) that were linked to viral entry, immune activation, endomembrane reorganization, and RNA processing. The DKD ACE2-positive PTEC module overlapped with expression patterns seen in SARS-CoV-2 infected cells. Similar cellular programs were seen in ACE2-positive PTEC obtained from urine samples of 13 COVID-19 patients who were hospitalized, suggesting a consistent ACE2-coregulated PTEC expression program that may interact with the SARS-CoV-2 infection processes. Thus SARS-CoV-2 receptor networks can seed further research into risk stratification and therapeutic strategies for COVID-19 related kidney damage.

17.
Nucleic Acids Res ; 35(Database issue): D468-71, 2007 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-17142221

RESUMO

The recent explosion in protein data generated from both directed small-scale studies and large-scale proteomics efforts has greatly expanded the quantity of available protein information and has prompted the Saccharomyces Genome Database (SGD; http://www.yeastgenome.org/) to enhance the depth and accessibility of protein annotations. In particular, we have expanded ongoing efforts to improve the integration of experimental information and sequence-based predictions and have redesigned the protein information web pages. A key feature of this redesign is the development of a GBrowse-derived interactive Proteome Browser customized to improve the visualization of sequence-based protein information. This Proteome Browser has enabled SGD to unify the display of hidden Markov model (HMM) domains, protein family HMMs, motifs, transmembrane regions, signal peptides, hydropathy plots and profile hits using several popular prediction algorithms. In addition, a physico-chemical properties page has been introduced to provide easy access to basic protein information. Improvements to the layout of the Protein Information page and integration of the Proteome Browser will facilitate the ongoing expansion of sequence-specific experimental information captured in SGD, including post-translational modifications and other user-defined annotations. Finally, SGD continues to improve upon the availability of genetic and physical interaction data in an ongoing collaboration with BioGRID by providing direct access to more than 82,000 manually-curated interactions.


Assuntos
Bases de Dados de Proteínas , Proteômica , Proteínas de Saccharomyces cerevisiae/química , Gráficos por Computador , Genoma Fúngico , Internet , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/genética , Análise de Sequência de Proteína , Interface Usuário-Computador
18.
Cell Syst ; 8(2): 152-162.e6, 2019 02 27.
Artigo em Inglês | MEDLINE | ID: mdl-30685436

RESUMO

A key challenge for the diagnosis and treatment of complex human diseases is identifying their molecular basis. Here, we developed a unified computational framework, URSAHD (Unveiling RNA Sample Annotation for Human Diseases), that leverages machine learning and the hierarchy of anatomical relationships present among diseases to integrate thousands of clinical gene expression profiles and identify molecular characteristics specific to each of the hundreds of complex diseases. URSAHD can distinguish between closely related diseases more accurately than literature-validated genes or traditional differential-expression-based computational approaches and is applicable to any disease, including rare and understudied ones. We demonstrate the utility of URSAHD in classifying related nervous system cancers and experimentally verifying novel neuroblastoma-associated genes identified by URSAHD. We highlight the applications for potential targeted drug-repurposing and for quantitatively assessing the molecular response to clinical therapies. URSAHD is freely available for public use, including the use of underlying models, at ursahd.princeton.edu.


Assuntos
Perfilação da Expressão Gênica/métodos , Genômica/métodos , Aprendizado de Máquina/normas , Transcriptoma/genética , Humanos
19.
Nat Genet ; 51(6): 973-980, 2019 06.
Artigo em Inglês | MEDLINE | ID: mdl-31133750

RESUMO

We address the challenge of detecting the contribution of noncoding mutations to disease with a deep-learning-based framework that predicts the specific regulatory effects and the deleterious impact of genetic variants. Applying this framework to 1,790 autism spectrum disorder (ASD) simplex families reveals a role in disease for noncoding mutations-ASD probands harbor both transcriptional- and post-transcriptional-regulation-disrupting de novo mutations of significantly higher functional impact than those in unaffected siblings. Further analysis suggests involvement of noncoding mutations in synaptic transmission and neuronal development and, taken together with previous studies, reveals a convergent genetic landscape of coding and noncoding mutations in ASD. We demonstrate that sequences carrying prioritized mutations identified in probands possess allele-specific regulatory activity, and we highlight a link between noncoding mutations and heterogeneity in the IQ of ASD probands. Our predictive genomics framework illuminates the role of noncoding mutations in ASD and prioritizes mutations with high impact for further study, and is broadly applicable to complex human diseases.


Assuntos
Transtorno do Espectro Autista/genética , Aprendizado Profundo , Predisposição Genética para Doença , Genoma Humano , Genômica , Mutação , RNA não Traduzido , Algoritmos , Alelos , Transtorno do Espectro Autista/diagnóstico , Biologia Computacional/métodos , Expressão Gênica , Regulação da Expressão Gênica , Genes Reporter , Estudos de Associação Genética , Genômica/métodos , Humanos , Fenótipo , Processamento Pós-Transcricional do RNA , Transcrição Gênica
20.
Nucleic Acids Res ; 34(Database issue): D442-5, 2006 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-16381907

RESUMO

Sequencing and annotation of the entire Saccharomyces cerevisiae genome has made it possible to gain a genome-wide perspective on yeast genes and gene products. To make this information available on an ongoing basis, the Saccharomyces Genome Database (SGD) (http://www.yeastgenome.org/) has created the Genome Snapshot (http://db.yeastgenome.org/cgi-bin/genomeSnapShot.pl). The Genome Snapshot summarizes the current state of knowledge about the genes and chromosomal features of S.cerevisiae. The information is organized into two categories: (i) number of each type of chromosomal feature annotated in the genome and (ii) number and distribution of genes annotated to Gene Ontology terms. Detailed lists are accessible through SGD's Advanced Search tool (http://db.yeastgenome.org/cgi-bin/search/featureSearch), and all the data presented on this page are available from the SGD ftp site (ftp://ftp.yeastgenome.org/yeast/).


Assuntos
Bases de Dados Genéticas , Genoma Fúngico , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/genética , Cromossomos Fúngicos , Gráficos por Computador , Genômica , Internet , Proteínas de Saccharomyces cerevisiae/classificação , Proteínas de Saccharomyces cerevisiae/fisiologia , Interface Usuário-Computador
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA