RESUMO
MOTIVATION: Gene regulatory network (GRN) inference reveals the influences genes have on one another in cellular regulatory systems. If the experimental data are inadequate for reliable inference of the network, informative priors have been shown to improve the accuracy of inferences. RESULTS: This study explores the potential of undirected, confidence-weighted networks, such as those in functional association databases, as a prior source for GRN inference. Such networks often erroneously indicate symmetric interaction between genes and may contain mostly correlation-based interaction information. Despite these drawbacks, our testing on synthetic datasets indicates that even noisy priors reflect some causal information that can improve GRN inference accuracy. Our analysis on yeast data indicates that using the functional association databases FunCoup and STRING as priors can give a small improvement in GRN inference accuracy with biological data.
Assuntos
Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Saccharomyces cerevisiae/genéticaRESUMO
The soybean aphid (Aphis glycines) is a major phloem-feeding pest of soybean (Glycine max). A. glycines feeding can cause the diversion of photosynthates and transmission of plant viruses, resulting in significant yield losses. In this study, we used oligonucleotide microarrays to characterize the long-term transcriptional response to soybean aphid colonization of two related soybean cultivars, one with the Rag1 aphid-resistance gene and one aphid-susceptible cultivar (without Rag1). Transcriptome profiles were determined after 1 and 7 days of aphid infestation. Our results revealed a susceptible response involving hundreds of transcripts, whereas only one transcript changed in the resistant response to aphids. This nonexistent resistance response might be explained by the fact that many defense-related transcripts are constitutively expressed in resistant plants, whereas these same genes are activated in susceptible plants only during aphid infestation. Analysis of phytohormone-related transcripts in the susceptible response showed different hormone profiles for the two time points, and suggest that aphids are able to suppress hormone signals in susceptible plants. A significant activation of abscissic acid, normally associated with abiotic stress responses, at day 7, might be a decoy strategy implemented by the aphid to suppress effective salicylic acid- and jasmonate-related defenses.
Assuntos
Afídeos/fisiologia , Glycine max/genética , Doenças das Plantas/imunologia , Reguladores de Crescimento de Plantas/genética , Proteínas de Plantas/genética , Animais , Resistência à Doença , Suscetibilidade a Doenças , Etilenos/farmacologia , Comportamento Alimentar , Anotação de Sequência Molecular , Análise de Sequência com Séries de Oligonucleotídeos , Floema/genética , Floema/imunologia , Floema/parasitologia , Floema/fisiologia , Doenças das Plantas/parasitologia , Reguladores de Crescimento de Plantas/farmacologia , Folhas de Planta/genética , Folhas de Planta/imunologia , Folhas de Planta/parasitologia , Folhas de Planta/fisiologia , RNA de Plantas/genética , Salicilatos/farmacologia , Glycine max/imunologia , Glycine max/parasitologia , Glycine max/fisiologia , TranscriptomaRESUMO
OBJECTIVE: To use cell-based gene signatures to identify patients with systemic lupus erythematous (SLE) in the phase II/III APRIL-SLE and phase IIb ADDRESS II trials most likely to respond to atacicept. METHODS: A published immune cell deconvolution algorithm based on Affymetrix gene array data was applied to whole blood gene expression from patients entering APRIL-SLE. Five distinct patient clusters were identified. Patient characteristics, biomarkers, and clinical response to atacicept were assessed per cluster. A modified immune cell deconvolution algorithm was developed based on RNA sequencing data and applied to ADDRESS II data to identify similar patient clusters and their responses. RESULTS: Patients in APRIL-SLE (N = 105) were segregated into the following five clusters (P1-5) characterized by dominant cell subset signatures: high neutrophils, T helper cells and natural killer (NK) cells (P1), high plasma cells and activated NK cells (P2), high B cells and neutrophils (P3), high B cells and low neutrophils (P4), or high activated dendritic cells, activated NK cells, and neutrophils (P5). Placebo- and atacicept-treated patients in clusters P2,4,5 had markedly higher British Isles Lupus Assessment Group (BILAG) A/B flare rates than those in clusters P1,3, with a greater treatment effect of atacicept on lowering flares in clusters P2,4,5. In ADDRESS II, placebo-treated patients from P2,4,5 were less likely to be SLE Responder Index (SRI)-4, SRI-6, and BILAG-Based Combined Lupus Assessment responders than those in P1,3; the response proportions again suggested lower placebo effect and a greater treatment differential for atacicept in P2,4,5. CONCLUSION: This exploratory analysis indicates larger differences between placebo- and atacicept-treated patients with SLE in a molecularly defined patient subset.
RESUMO
Systemic lupus erythematosus (SLE) is an autoimmune disease affecting multiple organ systems. Many investigational agents have failed or shown only modest effects when added to standard of care (SoC) therapy in placebo-controlled trials, and only two therapies have been approved for SLE in the last 60 years. Clinical trial outcomes have shown discordance in drug effects between clinical endpoints. Herein, we characterized longitudinal disease activity in the SLE population and the sources of variability by developing a latent disease trajectory model for SLE component endpoints (Systemic Lupus Erythematosus Disease Activity Index [SLEDAI], Physician's Global Assessment [PGA], British Isles Lupus Assessment Group Index [BILAG]) and composite endpoints (Systemic Lupus Erythematosus Responder Index [SRI], BILAG-based Composite Lupus Assessment [BICLA], and Lupus Low Disease Activity State [LLDAS]) using patient-level historical SoC data from nine phase II and III studies. Across all endpoints, in predictions up to 52 weeks from the final disease trajectory model, the following baseline covariates were associated with a greater decrease in SLE disease activity and higher response to placebo + SoC: Hispanic ethnicity from Central/South America, absence of hypocomplementemia, recent SLE diagnosis, and high baseline disease activity score using SLEDAI and BILAG separately. No discernible differences were observed in the trajectory of response to placebo + SoC across different SoC medications (antimalarial and immunosuppressant such as mycophenolate, methotrexate, and azathioprine). Across all endpoints, disease trajectory showed no difference in Asian versus non-Asian patients, supporting Asia-inclusive global SLE drug development. These results describe the first population approach to support a model-informed drug development framework in SLE.
Assuntos
Anticorpos Monoclonais Humanizados , Lúpus Eritematoso Sistêmico , Humanos , Anticorpos Monoclonais Humanizados/uso terapêutico , Índice de Gravidade de Doença , Resultado do Tratamento , Lúpus Eritematoso Sistêmico/tratamento farmacológico , Lúpus Eritematoso Sistêmico/diagnóstico , Imunossupressores/uso terapêutico , Gravidade do Paciente , ProbabilidadeRESUMO
The gene regulatory network (GRN) of human cells encodes mechanisms to ensure proper functioning. However, if this GRN is dysregulated, the cell may enter into a disease state such as cancer. Understanding the GRN as a system can therefore help identify novel mechanisms underlying disease, which can lead to new therapies. To deduce regulatory interactions relevant to cancer, we applied a recent computational inference framework to data from perturbation experiments in squamous carcinoma cell line A431. GRNs were inferred using several methods, and the false discovery rate was controlled by the NestBoot framework. We developed a novel approach to assess the predictiveness of inferred GRNs against validation data, despite the lack of a gold standard. The best GRN was significantly more predictive than the null model, both in cross-validated benchmarks and for an independent dataset of the same genes under a different perturbation design. The inferred GRN captures many known regulatory interactions central to cancer-relevant processes in addition to predicting many novel interactions, some of which were experimentally validated, thus providing mechanistic insights that are useful for future cancer research.
Assuntos
Carcinogênese/genética , Redes Reguladoras de Genes , Genes Neoplásicos , Neoplasias Encefálicas/genética , Neoplasias Encefálicas/patologia , Carcinoma de Células Escamosas/genética , Carcinoma de Células Escamosas/patologia , Linhagem Celular Tumoral , Meios de Cultura Livres de Soro , Técnicas de Silenciamento de Genes , Glioma/genética , Glioma/patologia , Humanos , Método de Monte Carlo , Interferência de RNA , RNA Interferente Pequeno/genéticaRESUMO
Soybean aphids (Aphis glycines Matsumura) are specialized insects that feed on soybean (Glycine max) phloem sap. Transcriptome analyses have shown that resistant soybean plants mount a fast response that limits aphid feeding and population growth. Conversely, defense responses in susceptible plants are slower and it is hypothesized that aphids block effective defenses in the compatible interaction. Unlike other pests, aphids can colonize plants for long periods of time; yet the effect on the plant transcriptome after long-term aphid feeding has not been analyzed for any plant-aphid interaction. We analyzed the susceptible and resistant (Rag1) transcriptome response to aphid feeding in soybean plants colonized by aphids (biotype 1) for 21 days. We found a reduced resistant response and a low level of aphid growth on Rag1 plants, while susceptible plants showed a strong response consistent with pattern-triggered immunity. GO-term analyses identified chitin regulation as one of the most overrepresented classes of genes, suggesting that chitin could be one of the hemipteran-associated molecular pattern that triggers this defense response. Transcriptome analyses also indicated the phenylpropanoid pathway, specifically isoflavonoid biosynthesis, was induced in susceptible plants in response to long-term aphid feeding. Metabolite analyses corroborated this finding. Aphid-treated susceptible plants accumulated daidzein, formononetin, and genistein, although glyceollins were present at low levels in these plants. Choice experiments indicated that daidzein may have a deterrent effect on aphid feeding. Mass spectrometry imaging showed these isoflavones accumulate likely in the mesophyll cells or epidermis and are absent from the vasculature, suggesting that isoflavones are part of a non-phloem defense response that can reduce aphid feeding. While it is likely that aphid can initially block defense responses in compatible interactions, it appears that susceptible soybean plants can eventually mount an effective defense in response to long-term soybean aphid colonization.
RESUMO
BACKGROUND: Pharmacodynamic biomarkers are becoming increasingly valuable for assessing drug activity and target modulation in clinical trials. However, identifying quality biomarkers is challenging due to the increasing volume and heterogeneity of relevant data describing the biological networks that underlie disease mechanisms. A biological pathway network typically includes entities (e.g. genes, proteins and chemicals/drugs) as well as the relationships between these and is typically curated or mined from structured databases and textual co-occurrence data. We propose a hybrid Natural Language Processing and directed relationships-based network analysis approach using IBM Watson for Drug Discovery to rank all human genes and identify potential candidate biomarkers, requiring only an initial determination of a specific target-disease relationship. METHODS: Through natural language processing of scientific literature, Watson for Drug Discovery creates a network of semantic relationships between biological concepts such as genes, drugs, and diseases. Using Bruton's tyrosine kinase as a case study, Watson for Drug Discovery's automatically extracted relationship network was compared with a prominent manually curated physical interaction network. Additionally, potential biomarkers for Bruton's tyrosine kinase inhibition were predicted using a matrix factorization approach and subsequently compared with expert-generated biomarkers. RESULTS: Watson's natural language processing generated a relationship network matching 55 (86%) genes upstream of BTK and 98 (95%) genes downstream of Bruton's tyrosine kinase in a prominent manually curated physical interaction network. Matrix factorization analysis predicted 11 of 13 genes identified by Merck subject matter experts in the top 20% of Watson for Drug Discovery's 13,595 ranked genes, with 7 in the top 5%. CONCLUSION: Taken together, these results suggest that Watson for Drug Discovery's automatic relationship network identifies the majority of upstream and downstream genes in biological pathway networks and can be used to help with the identification and prioritization of pharmacodynamic biomarker evaluation, accelerating the early phases of disease hypothesis generation.
Assuntos
Biomarcadores/análise , Descoberta de Drogas/métodos , Tirosina Quinase da Agamaglobulinemia/antagonistas & inibidores , Tirosina Quinase da Agamaglobulinemia/genética , Tirosina Quinase da Agamaglobulinemia/metabolismo , Área Sob a Curva , Bases de Dados Factuais , Humanos , Redes e Vias Metabólicas , Processamento de Linguagem Natural , Curva ROC , Bibliotecas de Moléculas Pequenas/farmacocinéticaRESUMO
The soybean aphid (Aphis glycines) is one of the main insect pests of soybean (Glycine max) worldwide. Genomics approaches have provided important data on transcriptome changes, both in the insect and in the plant, in response to the plant-aphid interaction. However, the difficulties to transform soybean and to rear soybean aphid on artificial media have hindered our ability to systematically test the function of genes identified by those analyses as mediators of plant resistance to the insect. An efficient approach to produce transgenic soybean material is the production of transformed hairy roots using Agrobacterium rhizogenes; however, soybean aphids colonize leaves or stems and thus this approach has not been utilized. Here, we developed a hairy root system that allowed effective aphid feeding. We show that this system supports aphid performance similar to that observed in leaves. The use of hairy roots to study plant resistance is validated by experiments showing that roots generated from cotyledons of resistant lines carrying the Rag1 or Rag2 resistance genes are also resistant to aphid feeding, while related susceptible lines are not. Our results demonstrate that hairy roots are a good system to study soybean aphid-soybean interactions, providing a quick and effective method that could be used for functional analysis of the resistance response to this insect.
Assuntos
Afídeos/patogenicidade , Glycine max/parasitologia , Agrobacterium/fisiologia , Animais , Controle Biológico de Vetores , Folhas de Planta/metabolismo , Folhas de Planta/parasitologia , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Raízes de Plantas/metabolismo , Raízes de Plantas/parasitologia , Glycine max/metabolismoRESUMO
A key question in network inference, that has not been properly answered, is what accuracy can be expected for a given biological dataset and inference method. We present GeneSPIDER - a Matlab package for tuning, running, and evaluating inference algorithms that allows independent control of network and data properties to enable data-driven benchmarking. GeneSPIDER is uniquely suited to address this question by first extracting salient properties from the experimental data and then generating simulated networks and data that closely match these properties. It enables data-driven algorithm selection, estimation of inference accuracy from biological data, and a more multifaceted benchmarking. Included are generic pipelines for the design of perturbation experiments, bootstrapping, analysis of linear dependence, sample selection, scaling of SNR, and performance evaluation. With GeneSPIDER we aim to move the goal of network inference benchmarks from simple performance measurement to a deeper understanding of how the accuracy of an algorithm is determined by different combinations of network and data properties.
Assuntos
Algoritmos , Redes Reguladoras de Genes , Animais , Benchmarking , Humanos , Modelos GenéticosRESUMO
The NUDIX enzymes are involved in cellular metabolism and homeostasis, as well as mRNA processing. Although highly conserved throughout all organisms, their biological roles and biochemical redundancies remain largely unclear. To address this, we globally resolve their individual properties and inter-relationships. We purify 18 of the human NUDIX proteins and screen 52 substrates, providing a substrate redundancy map. Using crystal structures, we generate sequence alignment analyses revealing four major structural classes. To a certain extent, their substrate preference redundancies correlate with structural classes, thus linking structure and activity relationships. To elucidate interdependence among the NUDIX hydrolases, we pairwise deplete them generating an epistatic interaction map, evaluate cell cycle perturbations upon knockdown in normal and cancer cells, and analyse their protein and mRNA expression in normal and cancer tissues. Using a novel FUSION algorithm, we integrate all data creating a comprehensive NUDIX enzyme profile map, which will prove fundamental to understanding their biological functionality.
Assuntos
Perfilação da Expressão Gênica/métodos , Redes Reguladoras de Genes , Família Multigênica , Pirofosfatases/genética , Células A549 , Linhagem Celular , Linhagem Celular Tumoral , Regulação Enzimológica da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Humanos , Células MCF-7 , Filogenia , Pirofosfatases/classificação , Pirofosfatases/metabolismo , Interferência de RNA , Especificidade por Substrato , Nudix HidrolasesRESUMO
Statistical regularisation methods such as LASSO and related L1 regularised regression methods are commonly used to construct models of gene regulatory networks. Although they can theoretically infer the correct network structure, they have been shown in practice to make errors, i.e. leave out existing links and include non-existing links. We show that L1 regularisation methods typically produce a poor network model when the analysed data are ill-conditioned, i.e. the gene expression data matrix has a high condition number, even if it contains enough information for correct network inference. However, the correct structure of network models can be obtained for informative data, data with such a signal to noise ratio that existing links can be proven to exist, when these methods fail, by using least-squares regression and setting small parameters to zero, or by using robust network inference, a recent method taking the intersection of all non-rejectable models. Since available experimental data sets are generally ill-conditioned, we recommend to check the condition number of the data matrix to avoid this pitfall of L1 regularised inference, and to also consider alternative methods.
Assuntos
Redes Reguladoras de Genes , Modelos Biológicos , Modelos Estatísticos , Algoritmos , Conjuntos de Dados como Assunto , Reprodutibilidade dos TestesRESUMO
Gene regulatory network inference (that is, determination of the regulatory interactions between a set of genes) provides mechanistic insights of central importance to research in systems biology. Most contemporary network inference methods rely on a sparsity/regularization coefficient, which we call ζ (zeta), to determine the degree of sparsity of the network estimates, that is, the total number of links between the nodes. However, they offer little or no advice on how to select this sparsity coefficient, in particular, for biological data with few samples. We show that an empty network is more accurate than estimates obtained for a poor choice of ζ. In order to avoid such poor choices, we propose a method for optimization of ζ, which maximizes the accuracy of the inferred network for any sparsity-dependent inference method and data set. Our procedure is based on leave-one-out cross-optimization and selection of the ζ value that minimizes the prediction error. We also illustrate the adverse effects of noise, few samples, and uninformative experiments on network inference as well as our method for optimization of ζ. We demonstrate that our ζ optimization method for two widely used inference algorithms--Glmnet and NIR--gives accurate and informative estimates of the network structure, given that the data is informative enough.
Assuntos
Algoritmos , Regulação da Expressão Gênica , Modelos BiológicosRESUMO
BACKGROUND: Phytohormones mediate plant defense responses to pests and pathogens. In particular, the hormones jasmonic acid, ethylene, salicylic acid, and abscisic acid have been shown to dictate and fine-tune defense responses, and identification of the phytohormone components of a particular defense response is commonly used to characterize it. Identification of phytohormone regulation is particularly important in transcriptome analyses. Currently there is no computational tool to determine the relative activity of these hormones that can be applied to transcriptome analyses in soybean. FINDINGS: We developed a pathway analysis method that provides a broad measure of the activation or suppression of individual phytohormone pathways based on changes in transcript expression of pathway-related genes. The magnitude and significance of these changes are used to determine a pathway score for a phytohormone for a given comparison in a microarray experiment. Scores for individual hormones can then be compared to determine the dominant phytohormone in a given defense response. To validate this method, it was applied to publicly available data from previous microarray experiments that studied the response of soybean plants to Asian soybean rust and soybean cyst nematode. The results of the analyses for these experiments agreed with our current understanding of the role of phytohormones in these defense responses. CONCLUSIONS: This method is useful in providing a broad measure of the relative induction and suppression of soybean phytohormones during a defense response. This method could be used as part of microarray studies that include individual transcript analysis, gene set analysis, and other methods for a comprehensive defense response characterization.