Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 27
Filtrar
1.
Genome Res ; 32(3): 524-533, 2022 03.
Artigo em Inglês | MEDLINE | ID: mdl-35193937

RESUMO

Understanding how each person's unique genotype influences their individual patterns of gene regulation has the potential to improve our understanding of human health and development, and to refine genotype-specific disease risk assessments and treatments. However, the effects of genetic variants are not typically considered when constructing gene regulatory networks, despite the fact that many disease-associated genetic variants are thought to have regulatory effects, including the disruption of transcription factor (TF) binding. We developed EGRET (Estimating the Genetic Regulatory Effect on TFs), which infers a genotype-specific gene regulatory network for each individual in a study population. EGRET begins by constructing a genotype-informed TF-gene prior network derived using TF motif predictions, expression quantitative trait locus (eQTL) data, individual genotypes, and the predicted effects of genetic variants on TF binding. It then uses a technique known as message passing to integrate this prior network with gene expression and TF protein-protein interaction data to produce a refined, genotype-specific regulatory network. We used EGRET to infer gene regulatory networks for two blood-derived cell lines and identified genotype-associated, cell line-specific regulatory differences that we subsequently validated using allele-specific expression, chromatin accessibility QTLs, and differential ChIP-seq TF binding. We also inferred EGRET networks for three cell types from each of 119 individuals and identified cell type-specific regulatory differences associated with diseases related to those cell types. EGRET is, to our knowledge, the first method that infers networks reflective of individual genetic variation in a way that provides insight into the genetic regulatory associations driving complex phenotypes.


Assuntos
Redes Reguladoras de Genes , Fatores de Transcrição , Cromatina , Imunoprecipitação da Cromatina , Genótipo , Humanos , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo
2.
Nucleic Acids Res ; 50(D1): D610-D621, 2022 01 07.
Artigo em Inglês | MEDLINE | ID: mdl-34508353

RESUMO

Gene regulation plays a fundamental role in shaping tissue identity, function, and response to perturbation. Regulatory processes are controlled by complex networks of interacting elements, including transcription factors, miRNAs and their target genes. The structure of these networks helps to determine phenotypes and can ultimately influence the development of disease or response to therapy. We developed GRAND (https://grand.networkmedicine.org) as a database for computationally-inferred, context-specific gene regulatory network models that can be compared between biological states, or used to predict which drugs produce changes in regulatory network structure. The database includes 12 468 genome-scale networks covering 36 human tissues, 28 cancers, 1378 unperturbed cell lines, as well as 173 013 TF and gene targeting scores for 2858 small molecule-induced cell line perturbation paired with phenotypic information. GRAND allows the networks to be queried using phenotypic information and visualized using a variety of interactive tools. In addition, it includes a web application that matches disease states to potentially therapeutic small molecule drugs using regulatory network properties.


Assuntos
Bases de Dados Genéticas , Bases de Dados de Produtos Farmacêuticos , Redes Reguladoras de Genes/genética , Software , Regulação da Expressão Gênica/genética , Genoma Humano/genética , Humanos , MicroRNAs/classificação , MicroRNAs/genética , Fatores de Transcrição/classificação , Fatores de Transcrição/genética
3.
PLoS Genet ; 17(11): e1009912, 2021 11.
Artigo em Inglês | MEDLINE | ID: mdl-34784346

RESUMO

α1-anti-trypsin (A1AT), encoded by SERPINA1, is a neutrophil elastase inhibitor that controls the inflammatory response in the lung. Severe A1AT deficiency increases risk for Chronic Obstructive Pulmonary Disease (COPD), however, the role of A1AT in COPD in non-deficient individuals is not well known. We identify a 2.1-fold increase (p = 2.5x10-6) in the use of a distal poly-adenylation site in primary lung tissue RNA-seq in 82 COPD cases when compared to 64 controls and replicate this in an independent study of 376 COPD and 267 controls. This alternative polyadenylation event involves two sites, a proximal and distal site, 61 and 1683 nucleotides downstream of the A1AT stop codon. To characterize this event, we measured the distal ratio in human primary tissue short read RNA-seq data and corroborated our results with long read RNA-seq data. Integrating these results with 3' end RNA-seq and nanoluciferase reporter assay experiments we show that use of the distal site yields mRNA transcripts with over 50-fold decreased translation efficiency and A1AT expression. We identified seven RNA binding proteins using enhanced CrossLinking and ImmunoPrecipitation precipitation (eCLIP) with one or more binding sites in the SERPINA1 3' UTR. We combined these data with measurements of the distal ratio in shRNA knockdown experiments, nuclear and cytoplasmic fractionation, and chemical RNA structure probing. We identify Quaking Homolog (QKI) as a modulator of SERPINA1 mRNA translation and confirm the role of QKI in SERPINA1 translation with luciferase reporter assays. Analysis of single-cell RNA-seq showed differences in the distribution of the SERPINA1 distal ratio among hepatocytes, macrophages, αß-Tcells and plasma cells in the liver. Alveolar Type 1,2, dendritic cells and macrophages also vary in their distal ratio in the lung. Our work reveals a complex post-transcriptional mechanism that regulates alternative polyadenylation and A1AT expression in COPD.


Assuntos
Pulmão/metabolismo , Doença Pulmonar Obstrutiva Crônica/genética , alfa 1-Antitripsina/genética , Linhagem Celular , Códon de Terminação/genética , Regulação da Expressão Gênica/genética , Hepatócitos/metabolismo , Humanos , Fígado/metabolismo , Pulmão/patologia , Macrófagos/metabolismo , Poliadenilação/genética , Proteínas Secretadas Inibidoras de Proteinases/genética , Proteínas Secretadas Inibidoras de Proteinases/metabolismo , Doença Pulmonar Obstrutiva Crônica/patologia , RNA-Seq , Análise de Célula Única , Linfócitos T/metabolismo
4.
PLoS Pathog ; 16(4): e1008452, 2020 04.
Artigo em Inglês | MEDLINE | ID: mdl-32255801

RESUMO

The Mycobacterium tuberculosis Ser/Thr protein kinases PknA and PknB are essential for growth and have been proposed as possible drug targets. We used a titratable conditional depletion system to investigate the functions of these kinases. Depletion of PknA or PknB or both kinases resulted in growth arrest, shortening of cells, and time-dependent loss of acid-fast staining with a concomitant decrease in mycolate synthesis and accumulation of trehalose monomycolate. Depletion of PknA and/or PknB resulted in markedly increased susceptibility to ß-lactam antibiotics, and to the key tuberculosis drug rifampin. Phosphoproteomic analysis showed extensive changes in protein phosphorylation in response to PknA depletion and comparatively fewer changes with PknB depletion. These results identify candidate substrates of each kinase and suggest specific and coordinate roles for PknA and PknB in regulating multiple essential physiologies. These findings support these kinases as targets for new antituberculosis drugs and provide a valuable resource for targeted investigation of mechanisms by which protein phosphorylation regulates pathways required for growth and virulence in M. tuberculosis.


Assuntos
Antituberculosos/farmacologia , Proteínas de Bactérias/metabolismo , Mycobacterium tuberculosis/enzimologia , Proteínas Serina-Treonina Quinases/metabolismo , Proteínas de Bactérias/genética , Fatores Corda/metabolismo , Regulação Bacteriana da Expressão Gênica/efeitos dos fármacos , Humanos , Mycobacterium tuberculosis/efeitos dos fármacos , Mycobacterium tuberculosis/genética , Mycobacterium tuberculosis/crescimento & desenvolvimento , Proteínas Serina-Treonina Quinases/genética , Tuberculose/microbiologia
5.
Genomics ; 113(6): 4184-4195, 2021 11.
Artigo em Inglês | MEDLINE | ID: mdl-34763026

RESUMO

Cigarette smoking induces a profound transcriptomic and systemic inflammatory response. Previous studies have focused on gene level differential expression of smoking, but the genome-wide effects of smoking on alternative isoform regulation have not yet been described. We conducted RNA sequencing in whole-blood samples of 454 current and 767 former smokers in the COPDGene Study, and we analyzed the effects of smoking on differential usage of isoforms and exons. At 10% FDR, we detected 3167 differentially expressed genes, 945 differentially used isoforms and 160 differentially used exons. Isoform switch analysis revealed widespread 3' UTR lengthening associated with cigarette smoking. The lengthening of these 3' UTRs was consistent with alternative usage of distal polyadenylation sites, and these extended 3' UTR regions were significantly enriched with functional sequence elements including microRNA and RNA-protein binding sites. These findings warrant further studies on alternative polyadenylation events as potential biomarkers and novel therapeutic targets for smoking-related diseases.


Assuntos
Fumar Cigarros , Poliadenilação , Regiões 3' não Traduzidas , Fumar Cigarros/efeitos adversos , Fumar Cigarros/genética , Isoformas de Proteínas/genética , Fumar/efeitos adversos , Fumar/genética
6.
Br J Cancer ; 122(4): 569-577, 2020 02.
Artigo em Inglês | MEDLINE | ID: mdl-31806877

RESUMO

BACKGROUND: Genome-wide association studies (GWASes) have identified many noncoding germline single-nucleotide polymorphisms (SNPs) that are associated with an increased risk of developing cancer. However, how these SNPs affect cancer risk is still largely unknown. METHODS: We used a systems biology approach to analyse the regulatory role of cancer-risk SNPs in thirteen tissues. By using data from the Genotype-Tissue Expression (GTEx) project, we performed an expression quantitative trait locus (eQTL) analysis. We represented both significant cis- and trans-eQTLs as edges in tissue-specific eQTL bipartite networks. RESULTS: Each tissue-specific eQTL network is organised into communities that group sets of SNPs and functionally related genes. When mapping cancer-risk SNPs to these networks, we find that in each tissue, these SNPs are significantly overrepresented in communities enriched for immune response processes, as well as tissue-specific functions. Moreover, cancer-risk SNPs are more likely to be 'cores' of their communities, influencing the expression of many genes within the same biological processes. Finally, cancer-risk SNPs preferentially target oncogenes and tumour-suppressor genes, suggesting that they may alter the expression of these key cancer genes. CONCLUSIONS: This approach provides a new way of understanding genetic effects on cancer risk and provides a biological context for interpreting the results of GWAS cancer studies.


Assuntos
Genes Supressores de Tumor , Predisposição Genética para Doença/genética , Neoplasias/genética , Neoplasias/imunologia , Oncogenes/genética , Polimorfismo de Nucleotídeo Único , Humanos , Locos de Características Quantitativas
8.
Proc Natl Acad Sci U S A ; 114(37): E7841-E7850, 2017 09 12.
Artigo em Inglês | MEDLINE | ID: mdl-28851834

RESUMO

Characterizing the collective regulatory impact of genetic variants on complex phenotypes is a major challenge in developing a genotype to phenotype map. Using expression quantitative trait locus (eQTL) analyses, we constructed bipartite networks in which edges represent significant associations between genetic variants and gene expression levels and found that the network structure informs regulatory function. We show, in 13 tissues, that these eQTL networks are organized into dense, highly modular communities grouping genes often involved in coherent biological processes. We find communities representing shared processes across tissues, as well as communities associated with tissue-specific processes that coalesce around variants in tissue-specific active chromatin regions. Node centrality is also highly informative, with the global and community hubs differing in regulatory potential and likelihood of being disease associated.


Assuntos
Estudo de Associação Genômica Ampla/métodos , Especificidade de Órgãos/genética , Locos de Características Quantitativas/genética , Expressão Gênica/genética , Regulação da Expressão Gênica/genética , Redes Reguladoras de Genes/genética , Predisposição Genética para Doença/genética , Variação Genética , Genótipo , Humanos , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/fisiologia , Transcriptoma/genética
9.
Hum Genomics ; 12(1): 1, 2018 01 15.
Artigo em Inglês | MEDLINE | ID: mdl-29335020

RESUMO

BACKGROUND: Genome-wide association studies (GWAS) have identified single nucleotide polymorphisms (SNPs) significantly associated with chronic obstructive pulmonary disease (COPD). However, many genetic variants show suggestive evidence for association but do not meet the strict threshold for genome-wide significance. Integrative analysis of multiple omics datasets has the potential to identify novel genes involved in disease pathogenesis by leveraging these variants in a functional, regulatory context. RESULTS: We performed expression quantitative trait locus (eQTL) analysis using genome-wide SNP genotyping and gene expression profiling of lung tissue samples from 86 COPD cases and 31 controls, testing for SNPs associated with gene expression levels. These results were integrated with a prior COPD GWAS using an ensemble statistical and network methods approach to identify relevant genes and observe them in the context of overall genetic control of gene expression to highlight co-regulated genes and disease pathways. We identified 250,312 unique SNPs and 4997 genes in the cis(local)-eQTL analysis (5% false discovery rate). The top gene from the integrative analysis was MAPT, a gene recently identified in an independent GWAS of lung function. The genes HNRNPAB and PCBP2 with RNA binding activity and the gene ACVR1B were identified in network communities with validated disease relevance. CONCLUSIONS: The integration of lung tissue gene expression with genome-wide SNP genotyping and subsequent intersection with prior GWAS and omics studies highlighted candidate genes within COPD loci and in communities harboring known COPD genes. This integration also identified novel disease genes in sub-threshold regions that would otherwise have been missed through GWAS.


Assuntos
Predisposição Genética para Doença , Genoma Humano/genética , Estudo de Associação Genômica Ampla , Doença Pulmonar Obstrutiva Crônica/genética , Receptores de Ativinas Tipo I/genética , Adulto , Idoso , Feminino , Regulação da Expressão Gênica , Genômica , Ribonucleoproteínas Nucleares Heterogêneas Grupo A-B/genética , Humanos , Pulmão/metabolismo , Masculino , Pessoa de Meia-Idade , Polimorfismo de Nucleotídeo Único/genética , Doença Pulmonar Obstrutiva Crônica/patologia , Locos de Características Quantitativas/genética , Proteínas de Ligação a RNA/genética , Proteínas tau/genética
10.
BMC Bioinformatics ; 18(1): 437, 2017 Oct 03.
Artigo em Inglês | MEDLINE | ID: mdl-28974199

RESUMO

BACKGROUND: Although ultrahigh-throughput RNA-Sequencing has become the dominant technology for genome-wide transcriptional profiling, the vast majority of RNA-Seq studies typically profile only tens of samples, and most analytical pipelines are optimized for these smaller studies. However, projects are generating ever-larger data sets comprising RNA-Seq data from hundreds or thousands of samples, often collected at multiple centers and from diverse tissues. These complex data sets present significant analytical challenges due to batch and tissue effects, but provide the opportunity to revisit the assumptions and methods that we use to preprocess, normalize, and filter RNA-Seq data - critical first steps for any subsequent analysis. RESULTS: We find that analysis of large RNA-Seq data sets requires both careful quality control and the need to account for sparsity due to the heterogeneity intrinsic in multi-group studies. We developed Yet Another RNA Normalization software pipeline (YARN), that includes quality control and preprocessing, gene filtering, and normalization steps designed to facilitate downstream analysis of large, heterogeneous RNA-Seq data sets and we demonstrate its use with data from the Genotype-Tissue Expression (GTEx) project. CONCLUSIONS: An R package instantiating YARN is available at http://bioconductor.org/packages/yarn .


Assuntos
Bases de Dados Genéticas , Especificidade de Órgãos/genética , Análise de Sequência de RNA/métodos , Análise de Sequência de RNA/normas , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Humanos , Anotação de Sequência Molecular , Análise de Componente Principal , Controle de Qualidade , Padrões de Referência , Tamanho da Amostra , Software
11.
BMC Genomics ; 18(1): 723, 2017 Sep 12.
Artigo em Inglês | MEDLINE | ID: mdl-28899340

RESUMO

BACKGROUND: Cell lines are an indispensable tool in biomedical research and often used as surrogates for tissues. Although there are recognized important cellular and transcriptomic differences between cell lines and tissues, a systematic overview of the differences between the regulatory processes of a cell line and those of its tissue of origin has not been conducted. The RNA-Seq data generated by the GTEx project is the first available data resource in which it is possible to perform a large-scale transcriptional and regulatory network analysis comparing cell lines with their tissues of origin. RESULTS: We compared 127 paired Epstein-Barr virus transformed lymphoblastoid cell lines (LCLs) and whole blood samples, and 244 paired primary fibroblast cell lines and skin samples. While gene expression analysis confirms that these cell lines carry the expression signatures of their primary tissues, albeit at reduced levels, network analysis indicates that expression changes are the cumulative result of many previously unreported alterations in transcription factor (TF) regulation. More specifically, cell cycle genes are over-expressed in cell lines compared to primary tissues, and this alteration in expression is a result of less repressive TF targeting. We confirmed these regulatory changes for four TFs, including SMAD5, using independent ChIP-seq data from ENCODE. CONCLUSIONS: Our results provide novel insights into the regulatory mechanisms controlling the expression differences between cell lines and tissues. The strong changes in TF regulation that we observe suggest that network changes, in addition to transcriptional levels, should be considered when using cell lines as models for tissues.


Assuntos
Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Ciclo Celular/genética , Linhagem Celular , Humanos , Especificidade de Órgãos
12.
PLoS Comput Biol ; 12(9): e1005033, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-27618581

RESUMO

Genome Wide Association Studies (GWAS) and expression quantitative trait locus (eQTL) analyses have identified genetic associations with a wide range of human phenotypes. However, many of these variants have weak effects and understanding their combined effect remains a challenge. One hypothesis is that multiple SNPs interact in complex networks to influence functional processes that ultimately lead to complex phenotypes, including disease states. Here we present CONDOR, a method that represents both cis- and trans-acting SNPs and the genes with which they are associated as a bipartite graph and then uses the modular structure of that graph to place SNPs into a functional context. In applying CONDOR to eQTLs in chronic obstructive pulmonary disease (COPD), we found the global network "hub" SNPs were devoid of disease associations through GWAS. However, the network was organized into 52 communities of SNPs and genes, many of which were enriched for genes in specific functional classes. We identified local hubs within each community ("core SNPs") and these were enriched for GWAS SNPs for COPD and many other diseases. These results speak to our intuition: rather than single SNPs influencing single genes, we see groups of SNPs associated with the expression of families of functionally related genes and that disease SNPs are associated with the perturbation of those functions. These methods are not limited in their application to COPD and can be used in the analysis of a wide variety of disease processes and other phenotypic traits.


Assuntos
Estudo de Associação Genômica Ampla/métodos , Locos de Características Quantitativas/genética , Algoritmos , Humanos , Modelos Estatísticos , Polimorfismo de Nucleotídeo Único/genética , Doença Pulmonar Obstrutiva Crônica/genética
13.
J Biol Phys ; 42(3): 339-50, 2016 06.
Artigo em Inglês | MEDLINE | ID: mdl-27012959

RESUMO

Thermodynamics is an important driving factor for chemical processes and for life. Earlier work has shown that each cancer has its own molecular signaling network that supports its life cycle and that different cancers have different thermodynamic entropies characterizing their signaling networks. The respective thermodynamic entropies correlate with 5-year survival for each cancer. We now show that by overlaying mRNA transcription data from a specific tumor type onto a human protein-protein interaction network, we can derive the Gibbs free energy for the specific cancer. The Gibbs free energy correlates with 5-year survival (Pearson correlation of -0.7181, p value of 0.0294). Using an expression relating entropy and Gibbs free energy to enthalpy, we derive an empirical relation for cancer network enthalpy. Combining this with previously published results, we now show a complete set of extensive thermodynamic properties and cancer type with 5-year survival.


Assuntos
Entropia , Proteínas de Neoplasias/metabolismo , Mapas de Interação de Proteínas , Epigênese Genética , Probabilidade , Análise de Sobrevida
14.
bioRxiv ; 2024 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-38464142

RESUMO

Single Nucleotide Polymorphisms (SNPs) associated with traits typically explain a small part of the trait genetic heritability-with the remainder thought to be distributed throughout the genome. Such SNPs are likely to alter expression levels of biologically relevant genes. Expression Quantitative Trait Locus (eQTL) networks analysis has helped to functionally characterize such variants. We systematically analyze the distribution of SNP heritability for ten traits across 29 tissue-specific eQTL networks. We find that heritability is clustered in a small number or tissue-specific, functionally relevant SNP-gene modules and that the greatest occurs in local "hubs" that are both the cornerstone of the network's modules and tissue-specific regulatory elements. The network structure could thus both amplify the genotype-phenotype connection and buffer the deleterious effect of the genetic variations on other traits. Together, these results define a conceptual framework for understanding complex trait architecture and identifying key mutations carrying most of the heritability.

15.
medRxiv ; 2024 May 20.
Artigo em Inglês | MEDLINE | ID: mdl-38826461

RESUMO

Rationale: Genetic variants and gene expression predict risk of chronic obstructive pulmonary disease (COPD), but their effect on COPD heterogeneity is unclear. Objectives: Define high-risk COPD subtypes using both genetics (polygenic risk score, PRS) and blood gene expression (transcriptional risk score, TRS) and assess differences in clinical and molecular characteristics. Methods: We defined high-risk groups based on PRS and TRS quantiles by maximizing differences in protein biomarkers in a COPDGene training set and identified these groups in COPDGene and ECLIPSE test sets. We tested multivariable associations of subgroups with clinical outcomes and compared protein-protein interaction networks and drug repurposing analyses between high-risk groups. Measurements and Main Results: We examined two high-risk omics-defined groups in non-overlapping test sets (n=1,133 NHW COPDGene, n=299 African American (AA) COPDGene, n=468 ECLIPSE). We defined "High activity" (low PRS/high TRS) and "severe risk" (high PRS/high TRS) subgroups. Participants in both subgroups had lower body-mass index (BMI), lower lung function, and alterations in metabolic, growth, and immune signaling processes compared to a low-risk (low PRS, low TRS) reference subgroup. "High activity" but not "severe risk" participants had greater prospective FEV 1 decline (COPDGene: -51 mL/year; ECLIPSE: - 40 mL/year) and their proteomic profiles were enriched in gene sets perturbed by treatment with 5-lipoxygenase inhibitors and angiotensin-converting enzyme (ACE) inhibitors. Conclusions: Concomitant use of polygenic and transcriptional risk scores identified clinical and molecular heterogeneity amongst high-risk individuals. Proteomic and drug repurposing analysis identified subtype-specific enrichment for therapies and suggest prior drug repurposing failures may be explained by patient selection.

16.
Genome Biol ; 24(1): 45, 2023 03 09.
Artigo em Inglês | MEDLINE | ID: mdl-36894939

RESUMO

Inference and analysis of gene regulatory networks (GRNs) require software that integrates multi-omic data from various sources. The Network Zoo (netZoo; netzoo.github.io) is a collection of open-source methods to infer GRNs, conduct differential network analyses, estimate community structure, and explore the transitions between biological states. The netZoo builds on our ongoing development of network methods, harmonizing the implementations in various computing languages and between methods to allow better integration of these tools into analytical pipelines. We demonstrate the utility using multi-omic data from the Cancer Cell Line Encyclopedia. We will continue to expand the netZoo to incorporate additional methods.


Assuntos
Redes Reguladoras de Genes , Neoplasias , Humanos , Algoritmos , Software , Multiômica , Biologia Computacional/métodos
17.
Cell Rep Methods ; 2(5): 100218, 2022 05 23.
Artigo em Inglês | MEDLINE | ID: mdl-35637906

RESUMO

Expression quantitative trait locus (eQTL) analysis associates SNPs with gene expression; these relationships can be represented as a bipartite network with association strength as "edge weights" between SNPs and genes. However, most eQTL networks use binary edge weights based on thresholded FDR estimates: definitions that influence reproducibility and downstream analyses. We constructed twenty-nine tissue-specific eQTL networks using GTEx data and evaluated a comprehensive set of network specifications based on false discovery rates, test statistics, and p values, focusing on the degree centrality-a metric of an SNP or gene node's potential network influence. We found a thresholded Benjamini-Hochberg q value weighted by the Z-statistic balances metric reproducibility and computational efficiency. Our estimated gene degrees positively correlate with gene degrees in gene regulatory networks, demonstrating that these networks are complementary in understanding regulation. Gene degrees also correlate with genetic diversity, and heritability analyses show that highly connected nodes are enriched for tissue-relevant traits.


Assuntos
Redes Reguladoras de Genes , Locos de Características Quantitativas , Locos de Características Quantitativas/genética , Reprodutibilidade dos Testes , Redes Reguladoras de Genes/genética , Fenótipo , Genômica
18.
Front Genet ; 12: 649942, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33968133

RESUMO

Profiling of whole transcriptomes has become a cornerstone of molecular biology and an invaluable tool for the characterization of clinical phenotypes and the identification of disease subtypes. Analyses of these data are becoming ever more sophisticated as we move beyond simple comparisons to consider networks of higher-order interactions and associations. Gene regulatory networks (GRNs) model the regulatory relationships of transcription factors and genes and have allowed the identification of differentially regulated processes in disease systems. In this perspective, we discuss gene targeting scores, which measure changes in inferred regulatory network interactions, and their use in identifying disease-relevant processes. In addition, we present an example analysis for pancreatic ductal adenocarcinoma (PDAC), demonstrating the power of gene targeting scores to identify differential processes between complex phenotypes, processes that would have been missed by only performing differential expression analysis. This example demonstrates that gene targeting scores are an invaluable addition to gene expression analysis in the characterization of diseases and other complex phenotypes.

19.
Proc AAAI Conf Artif Intell ; 35(11): 10263-10272, 2021 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-34707916

RESUMO

Bipartite network inference is a ubiquitous problem across disciplines. One important example in the field molecular biology is gene regulatory network inference. Gene regulatory networks are an instrumental tool aiding in the discovery of the molecular mechanisms driving diverse diseases, including cancer. However, only noisy observations of the projections of these regulatory networks are typically assayed. In an effort to better estimate regulatory networks from their noisy projections, we formulate a non-convex but analytically tractable optimization problem called OTTER. This problem can be interpreted as relaxed graph matching between the two projections of the bipartite network. OTTER's solutions can be derived explicitly and inspire a spectral algorithm, for which we provide network recovery guarantees. We also provide an alternative approach based on gradient descent that is more robust to noise compared to the spectral algorithm. Interestingly, this gradient descent approach resembles the message passing equations of an established gene regulatory network inference method, PANDA. Using three cancer-related data sets, we show that OTTER outperforms state-of-the-art inference methods in predicting transcription factor binding to gene regulatory regions. To encourage new graph matching applications to this problem, we have made all networks and validation data publicly available.

20.
NPJ Syst Biol Appl ; 7(1): 8, 2021 01 29.
Artigo em Inglês | MEDLINE | ID: mdl-33514755

RESUMO

The ability of Mycobacterium tuberculosis (Mtb) to adapt to diverse stresses in its host environment is crucial for pathogenesis. Two essential Mtb serine/threonine protein kinases, PknA and PknB, regulate cell growth in response to environmental stimuli, but little is known about their downstream effects. By combining RNA-Seq data, following treatment with either an inhibitor of both PknA and PknB or an inactive control, with publicly available ChIP-Seq and protein-protein interaction data for transcription factors, we show that the Mtb transcription factor (TF) regulatory network propagates the effects of kinase inhibition and leads to widespread changes in regulatory programs involved in cell wall integrity, stress response, and energy production, among others. We also observe that changes in TF regulatory activity correlate with kinase-specific phosphorylation of those TFs. In addition to characterizing the downstream regulatory effects of PknA/PknB inhibition, this demonstrates the need for regulatory network approaches that can incorporate signal-driven transcription factor modifications.


Assuntos
Proteínas de Bactérias/metabolismo , Inibidores de Proteínas Quinases/farmacologia , Proteínas Serina-Treonina Quinases/metabolismo , Proteínas de Bactérias/antagonistas & inibidores , Proteínas de Bactérias/genética , Parede Celular/metabolismo , Expressão Gênica/genética , Regulação Bacteriana da Expressão Gênica/genética , Mycobacterium tuberculosis/efeitos dos fármacos , Mycobacterium tuberculosis/crescimento & desenvolvimento , Mycobacterium tuberculosis/metabolismo , Fosforilação/efeitos dos fármacos , Inibidores de Proteínas Quinases/metabolismo , Proteínas Serina-Treonina Quinases/antagonistas & inibidores , Proteínas Serina-Treonina Quinases/genética
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa