Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 65
Filtrar
1.
BMC Genomics ; 24(1): 387, 2023 Jul 10.
Artigo em Inglês | MEDLINE | ID: mdl-37430204

RESUMO

BACKGROUND: Accessory proteins have diverse roles in coronavirus pathobiology. One of them in SARS-CoV (the causative agent of the severe acute respiratory syndrome outbreak in 2002-2003) is encoded by the open reading frame 8 (ORF8). Among the most dramatic genomic changes observed in SARS-CoV isolated from patients during the peak of the pandemic in 2003 was the acquisition of a characteristic 29-nucleotide deletion in ORF8. This deletion cause splitting of ORF8 into two smaller ORFs, namely ORF8a and ORF8b. Functional consequences of this event are not entirely clear. RESULTS: Here, we performed evolutionary analyses of ORF8a and ORF8b genes and documented that in both cases the frequency of synonymous mutations was greater than that of nonsynonymous ones. These results suggest that ORF8a and ORF8b are under purifying selection, thus proteins translated from these ORFs are likely to be functionally important. Comparisons with several other SARS-CoV genes revealed that another accessory gene, ORF7a, has a similar ratio of nonsynonymous to synonymous mutations suggesting that ORF8a, ORF8b, and ORF7a are under similar selection pressure. CONCLUSIONS: Our results for SARS-CoV echo the known excess of deletions in the ORF7a-ORF7b-ORF8 complex of accessory genes in SARS-CoV-2. A high frequency of deletions in this gene complex might reflect recurrent searches in "functional space" of various accessory protein combinations that may eventually produce more advantageous configurations of accessory proteins similar to the fixed deletion in the SARS-CoV ORF8 gene.


Assuntos
COVID-19 , Humanos , Fases de Leitura Aberta , SARS-CoV-2/genética , Evolução Biológica , Nucleotídeos
2.
Brief Bioinform ; 19(3): 506-523, 2018 05 01.
Artigo em Inglês | MEDLINE | ID: mdl-28069634

RESUMO

Large-scale perturbation databases, such as Connectivity Map (CMap) or Library of Integrated Network-based Cellular Signatures (LINCS), provide enormous opportunities for computational pharmacogenomics and drug design. A reason for this is that in contrast to classical pharmacology focusing at one target at a time, the transcriptomics profiles provided by CMap and LINCS open the door for systems biology approaches on the pathway and network level. In this article, we provide a review of recent developments in computational pharmacogenomics with respect to CMap and LINCS and related applications.


Assuntos
Biologia Computacional/métodos , Perfilação da Expressão Gênica/métodos , Farmacogenética , Bibliotecas de Moléculas Pequenas/farmacologia , Transcriptoma , Bases de Dados Factuais , Redes Reguladoras de Genes , Humanos
3.
BMC Cancer ; 19(1): 1176, 2019 Dec 03.
Artigo em Inglês | MEDLINE | ID: mdl-31796020

RESUMO

BACKGROUND: Deciphering the meaning of the human DNA is an outstanding goal which would revolutionize medicine and our way for treating diseases. In recent years, non-coding RNAs have attracted much attention and shown to be functional in part. Yet the importance of these RNAs especially for higher biological functions remains under investigation. METHODS: In this paper, we analyze RNA-seq data, including non-coding and protein coding RNAs, from lung adenocarcinoma patients, a histologic subtype of non-small-cell lung cancer, with deep learning neural networks and other state-of-the-art classification methods. The purpose of our paper is three-fold. First, we compare the classification performance of different versions of deep belief networks with SVMs, decision trees and random forests. Second, we compare the classification capabilities of protein coding and non-coding RNAs. Third, we study the influence of feature selection on the classification performance. RESULTS: As a result, we find that deep belief networks perform at least competitively to other state-of-the-art classifiers. Second, data from non-coding RNAs perform better than coding RNAs across a number of different classification methods. This demonstrates the equivalence of predictive information as captured by non-coding RNAs compared to protein coding RNAs, conventionally used in computational diagnostics tasks. Third, we find that feature selection has in general a negative effect on the classification performance which means that unfiltered data with all features give the best classification results. CONCLUSIONS: Our study is the first to use ncRNAs beyond miRNAs for the computational classification of cancer and for performing a direct comparison of the classification capabilities of protein coding RNAs and non-coding RNAs.


Assuntos
Neoplasias Pulmonares/classificação , Neoplasias Pulmonares/genética , RNA Mensageiro/metabolismo , RNA não Traduzido/genética , Biologia Computacional/métodos , Árvores de Decisões , Humanos , Neoplasias Pulmonares/patologia , Aprendizado de Máquina , MicroRNAs/genética , Redes Neurais de Computação , RNA Mensageiro/genética , Análise de Sequência de RNA/métodos
4.
Brief Bioinform ; 17(3): 393-407, 2016 05.
Artigo em Inglês | MEDLINE | ID: mdl-26342128

RESUMO

Transcriptome sequencing (RNA-seq) is gradually replacing microarrays for high-throughput studies of gene expression. The main challenge of analyzing microarray data is not in finding differentially expressed genes, but in gaining insights into the biological processes underlying phenotypic differences. To interpret experimental results from microarrays, gene set analysis (GSA) has become the method of choice, in particular because it incorporates pre-existing biological knowledge (in a form of functionally related gene sets) into the analysis. Here we provide a brief review of several statistically different GSA approaches (competitive and self-contained) that can be adapted from microarrays practice as well as those specifically designed for RNA-seq. We evaluate their performance (in terms of Type I error rate, power, robustness to the sample size and heterogeneity, as well as the sensitivity to different types of selection biases) on simulated and real RNA-seq data. Not surprisingly, the performance of various GSA approaches depends only on the statistical hypothesis they test and does not depend on whether the test was developed for microarrays or RNA-seq data. Interestingly, we found that competitive methods have lower power as well as robustness to the samples heterogeneity than self-contained methods, leading to poor results reproducibility. We also found that the power of unsupervised competitive methods depends on the balance between up- and down-regulated genes in tested gene sets. These properties of competitive methods have been overlooked before. Our evaluation provides a concise guideline for selecting GSA approaches, best performing under particular experimental settings in the context of RNA-seq.


Assuntos
RNA/genética , Perfilação da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Reprodutibilidade dos Testes , Tamanho da Amostra , Análise de Sequência de RNA
5.
BMC Bioinformatics ; 18(1): 61, 2017 Jan 24.
Artigo em Inglês | MEDLINE | ID: mdl-28118818

RESUMO

BACKGROUND: Gene set analysis (in a form of functionally related genes or pathways) has become the method of choice for analyzing omics data in general and gene expression data in particular. There are many statistical methods that either summarize gene-level statistics for a gene set or apply a multivariate statistic that accounts for intergene correlations. Most available methods detect complex departures from the null hypothesis but lack the ability to identify the specific alternative hypothesis that rejects the null. RESULTS: GSAR (Gene Set Analysis in R) is an open-source R/Bioconductor software package for gene set analysis (GSA). It implements self-contained multivariate non-parametric statistical methods testing a complex null hypothesis against specific alternatives, such as differences in mean (shift), variance (scale), or net correlation structure. The package also provides a graphical visualization tool, based on the union of two minimum spanning trees, for correlation networks to examine the change in the correlation structures of a gene set between two conditions and highlight influential genes (hubs). CONCLUSIONS: Package GSAR provides a set of multivariate non-parametric statistical methods that test a complex null hypothesis against specific alternatives. The methods in package GSAR are applicable to any type of omics data that can be represented in a matrix format. The package, with detailed instructions and examples, is freely available under the GPL (> = 2) license from the Bioconductor web site.


Assuntos
Biologia Computacional/métodos , Bases de Dados Genéticas , Perfilação da Expressão Gênica , Software , Linhagem Celular Tumoral , Expressão Gênica , Humanos , Modelos Teóricos , Análise Multivariada , Fenótipo , Análise de Sequência de RNA , Proteína Supressora de Tumor p53/genética , Proteína Supressora de Tumor p53/metabolismo
6.
Bioinformatics ; 32(21): 3345-3347, 2016 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-27402900

RESUMO

MOTIVATION: Data from RNA-seq experiments provide us with many new possibilities to gain insights into biological and disease mechanisms of cellular functioning. However, the reproducibility and robustness of RNA-seq data analysis results is often unclear. This is in part attributed to the two counter acting goals of (i) a cost efficient and (ii) an optimal experimental design leading to a compromise, e.g. in the sequencing depth of experiments. RESULTS: We introduce an R package called samExploreR that allows the subsampling (m out of n bootstraping) of short-reads based on SAM files facilitating the investigation of sequencing depth related questions for the experimental design. Overall, this provides a systematic way for exploring the reproducibility and robustness of general RNA-seq studies. We exemplify the usage of samExploreR by studying the influence of the sequencing depth and the annotation on the identification of differentially expressed genes. AVAILABILITY AND IMPLEMENTATION: samExploreR is available as an R package from Bioconductor. CONTACT: v@bio-complexity.comSupplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
RNA/genética , Análise de Sequência de RNA , Reprodutibilidade dos Testes , Projetos de Pesquisa , Software
7.
Methods ; 108: 56-64, 2016 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-27090004

RESUMO

Helicases are enzymes involved in nucleic acid metabolism, playing major roles in replication, transcription, and repair. Defining helicases oligomerization state and transient and persistent protein interactions is essential for understanding of their function. In this article we review current methods for the protein-protein interaction analysis, and discuss examples of its application to the study of helicases: Pif1 and DDX3. Proteomics methods are our main focus - affinity pull-downs and chemical cross-linking followed by mass spectrometry. We review advantages and limitations of these methods and provide general guidelines for their implementation in the functional analysis of helicases.


Assuntos
DNA Helicases/genética , Replicação do DNA/genética , Mapeamento de Interação de Proteínas/métodos , Mapas de Interação de Proteínas/genética , DNA Helicases/química , Reparo do DNA/genética
8.
Bioinformatics ; 30(3): 360-8, 2014 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-24292935

RESUMO

MOTIVATION: To date, gene set analysis approaches primarily focus on identifying differentially expressed gene sets (pathways). Methods for identifying differentially coexpressed pathways also exist but are mostly based on aggregated pairwise correlations or other pairwise measures of coexpression. Instead, we propose Gene Sets Net Correlations Analysis (GSNCA), a multivariate differential coexpression test that accounts for the complete correlation structure between genes. RESULTS: In GSNCA, weight factors are assigned to genes in proportion to the genes' cross-correlations (intergene correlations). The problem of finding the weight vectors is formulated as an eigenvector problem with a unique solution. GSNCA tests the null hypothesis that for a gene set there is no difference in the weight vectors of the genes between two conditions. In simulation studies and the analyses of experimental data, we demonstrate that GSNCA captures changes in the structure of genes' cross-correlations rather than differences in the averaged pairwise correlations. Thus, GSNCA infers differences in coexpression networks, however, bypassing method-dependent steps of network inference. As an additional result from GSNCA, we define hub genes as genes with the largest weights and show that these genes correspond frequently to major and specific pathway regulators, as well as to genes that are most affected by the biological difference between two conditions. In summary, GSNCA is a new approach for the analysis of differentially coexpressed pathways that also evaluates the importance of the genes in the pathways, thus providing unique information that may result in the generation of novel biological hypotheses. AVAILABILITY AND IMPLEMENTATION: Implementation of the GSNCA test in R is available upon request from the authors.


Assuntos
Perfilação da Expressão Gênica/métodos , Redes Reguladoras de Genes , Linhagem Celular Tumoral , Genes p53 , Humanos , Análise Multivariada , Análise de Sequência com Séries de Oligonucleotídeos
9.
Nucleic Acids Res ; 41(7): e82, 2013 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-23389952

RESUMO

In this article, we focus on the analysis of competitive gene set methods for detecting the statistical significance of pathways from gene expression data. Our main result is to demonstrate that some of the most frequently used gene set methods, GSEA, GSEArot and GAGE, are severely influenced by the filtering of the data in a way that such an analysis is no longer reconcilable with the principles of statistical inference, rendering the obtained results in the worst case inexpressive. A possible consequence of this is that these methods can increase their power by the addition of unrelated data and noise. Our results are obtained within a bootstrapping framework that allows a rigorous assessment of the robustness of results and enables power estimates. Our results indicate that when using competitive gene set methods, it is imperative to apply a stringent gene filtering criterion. However, even when genes are filtered appropriately, for gene expression data from chips that do not provide a genome-scale coverage of the expression values of all mRNAs, this is not enough for GSEA, GSEArot and GAGE to ensure the statistical soundness of the applied procedure. For this reason, for biomedical and clinical studies, we strongly advice not to use GSEA, GSEArot and GAGE for such data sets.


Assuntos
Perfilação da Expressão Gênica/métodos , Neoplasias da Mama/genética , Neoplasias da Mama/metabolismo , Interpretação Estatística de Dados , Feminino , Regulação Neoplásica da Expressão Gênica , Genômica/métodos , Humanos , Masculino , Análise de Sequência com Séries de Oligonucleotídeos , Leucemia-Linfoma Linfoblástico de Células Precursoras/genética , Leucemia-Linfoma Linfoblástico de Células Precursoras/metabolismo , Neoplasias da Próstata/genética , Neoplasias da Próstata/metabolismo , Tamanho da Amostra
10.
Chin J Cancer ; 34(10): 427-38, 2015 Aug 08.
Artigo em Inglês | MEDLINE | ID: mdl-26253000

RESUMO

BACKGROUND: Data from RNA-seq experiments provide a wealth of information about the transcriptome of an organism. However, the analysis of such data is very demanding. In this study, we aimed to establish robust analysis procedures that can be used in clinical practice. METHODS: We studied RNA-seq data from triple-negative breast cancer patients. Specifically, we investigated the subsampling of RNA-seq data. RESULTS: The main results of our investigations are as follows: (1) the subsampling of RNA-seq data gave biologically realistic simulations of sequencing experiments with smaller sequencing depth but not direct scaling of count matrices; (2) the saturation of results required an average sequencing depth larger than 32 million reads and an individual sequencing depth larger than 46 million reads; and (3) for an abrogated feature selection, higher moments of the distribution of all expressed genes had a higher sensitivity for signal detection than the corresponding mean values. CONCLUSIONS: Our results reveal important characteristics of RNA-seq data that must be understood before one can apply such an approach to translational medicine.


Assuntos
Perfilação da Expressão Gênica , RNA , Neoplasias de Mama Triplo Negativas , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Transcriptoma
11.
BMC Bioinformatics ; 15: 397, 2014 Dec 05.
Artigo em Inglês | MEDLINE | ID: mdl-25475910

RESUMO

BACKGROUND: Over the last few years transcriptome sequencing (RNA-Seq) has almost completely taken over microarrays for high-throughput studies of gene expression. Currently, the most popular use of RNA-Seq is to identify genes which are differentially expressed between two or more conditions. Despite the importance of Gene Set Analysis (GSA) in the interpretation of the results from RNA-Seq experiments, the limitations of GSA methods developed for microarrays in the context of RNA-Seq data are not well understood. RESULTS: We provide a thorough evaluation of popular multivariate and gene-level self-contained GSA approaches on simulated and real RNA-Seq data. The multivariate approach employs multivariate non-parametric tests combined with popular normalizations for RNA-Seq data. The gene-level approach utilizes univariate tests designed for the analysis of RNA-Seq data to find gene-specific P-values and combines them into a pathway P-value using classical statistical techniques. Our results demonstrate that the Type I error rate and the power of multivariate tests depend only on the test statistics and are insensitive to the different normalizations. In general standard multivariate GSA tests detect pathways that do not have any bias in terms of pathways size, percentage of differentially expressed genes, or average gene length in a pathway. In contrast the Type I error rate and the power of gene-level GSA tests are heavily affected by the methods for combining P-values, and all aforementioned biases are present in detected pathways. CONCLUSIONS: Our result emphasizes the importance of using self-contained non-parametric multivariate tests for detecting differentially expressed pathways for RNA-Seq data and warns against applying gene-level GSA tests, especially because of their high level of Type I error rates for both, simulated and real data.


Assuntos
Perfilação da Expressão Gênica/métodos , Genes Ligados ao Cromossomo X , Sequenciamento de Nucleotídeos em Larga Escala/métodos , RNA/genética , Análise de Sequência de RNA/métodos , Transdução de Sinais , Simulação por Computador , Feminino , Humanos , Linfócitos/metabolismo , Masculino , Software
12.
BMC Bioinformatics ; 15 Suppl 6: S6, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25079297

RESUMO

Cancer is a complex disease that has proven to be difficult to understand on the single-gene level. For this reason a functional elucidation needs to take interactions among genes on a systems-level into account. In this study, we infer a colon cancer network from a large-scale gene expression data set by using the method BC3Net. We provide a structural and a functional analysis of this network and also connect its molecular interaction structure with the chromosomal locations of the genes enabling the definition of cis- and trans-interactions. Furthermore, we investigate the interaction of genes that can be found in close neighborhoods on the chromosomes to gain insight into regulatory mechanisms. To our knowledge this is the first study analyzing the genome-scale colon cancer network.


Assuntos
Neoplasias do Colo/genética , Redes Reguladoras de Genes , Neoplasias do Colo/metabolismo , Biologia Computacional , Perfilação da Expressão Gênica , Humanos , Proteínas/genética , Proteínas/metabolismo
14.
Circulation ; 126(20): 2418-27, 2012 Nov 13.
Artigo em Inglês | MEDLINE | ID: mdl-23065385

RESUMO

BACKGROUND: Carotid intima-media thickening is associated with increased cardiovascular risk in humans. We discovered that intima formation and cell proliferation in response to carotid injury is greater in SJL/J (SJL) in comparison with C3HeB/FeJ (C3H/F) mice. The purpose of this study was to identify candidate genes contributing to intima formation. METHODS AND RESULTS: We performed microarray and bioinformatic analyses of carotid arteries from C3H/F and SJL mice. Kyoto Encyclopedia of Genes and Genomes analysis showed that the ribosome pathway was significantly up-regulated in C3H/F in comparison with SJL mice. Expression of a ribosomal protein, RpL17, was >40-fold higher in C3H/F carotids in comparison with SJL. Aortic vascular smooth muscle cells from C3H/F grew slower in comparison to SJL. To determine the role of RpL17 in vascular smooth muscle cell growth regulation, we analyzed the relationship between RpL17 expression and cell cycle progression. Cultured vascular smooth muscle cells from mice, rats, and humans showed that RpL17 expression inversely correlated with growth as shown by decreased cells in S phase and increased cells in G(0)/G(1). To prove that RpL17 acted as a growth inhibitor in vivo, we used pluronic gel delivery of RpL17 small interfering RNA to C3H/F carotid arteries. This resulted in an 8-fold increase in the number of proliferating cells. Furthermore, following partial carotid ligation in SJL mice, RpL17 expression in the intima and media decreased, but the number of proliferating cells increased. CONCLUSIONS: RpL17 acts as a vascular smooth muscle cell growth inhibitor (akin to a tumor suppressor) and represents a potential therapeutic target to limit carotid intima-media thickening.


Assuntos
Artérias Carótidas/citologia , Artérias Carótidas/fisiologia , Proliferação de Células , Músculo Liso Vascular/citologia , Músculo Liso Vascular/fisiologia , Proteínas Ribossômicas/fisiologia , Túnica Íntima/citologia , Animais , Ciclo Celular/fisiologia , Células Cultivadas , Biologia Computacional , Fase G1/fisiologia , Humanos , Masculino , Camundongos , Camundongos Endogâmicos C3H , Camundongos Endogâmicos , Análise em Microsséries , Ratos , Fase de Repouso do Ciclo Celular/fisiologia , Fase S/fisiologia , Túnica Íntima/fisiologia , Túnica Média/citologia , Túnica Média/fisiologia
15.
Front Agron ; 52023.
Artigo em Inglês | MEDLINE | ID: mdl-38223701

RESUMO

Major food crops, such as rice and maize, display severe yield losses (30-50%) under salt stress. Furthermore, problems associated with soil salinity are anticipated to worsen due to climate change. Therefore, it is necessary to implement sustainable agricultural strategies, such as exploiting beneficial plant-microbe associations, for increased crop yields. Plants can develop associations with beneficial microbes, including arbuscular mycorrhiza and plant growth-promoting bacteria (PGPB). PGPB improve plant growth via multiple mechanisms, including protection against biotic and abiotic stresses. Azospirillum brasilense, one of the most studied PGPB, can mitigate salt stress in different crops. However, little is known about the molecular mechanisms by which A. brasilense mitigates salt stress. This study shows that total and root plant mass is improved in A. brasilense-inoculated rice plants compared to the uninoculated plants grown under high salt concentrations (100 mM and 200 mM NaCl). We observed this growth improvement at seven- and fourteen days post-treatment (dpt). Next, we used transcriptomic approaches and identified differentially expressed genes (DEGs) in rice roots when exposed to three treatments: 1) A. brasilense, 2) salt (200 mM NaCl), and 3) A. brasilense and salt (200 mM NaCl), at seven dpt. We identified 786 DEGs in the A. brasilense-treated plants, 4061 DEGs in the salt-stressed plants, and 1387 DEGs in the salt-stressed A. brasilense-treated plants. In the A. brasilense-treated plants, we identified DEGs involved in defense, hormone, and nutrient transport, among others. In the salt-stressed plants, we identified DEGs involved in abscisic acid and jasmonic acid signaling, antioxidant enzymes, sodium and potassium transport, and calcium signaling, among others. In the salt-stressed A. brasilense-treated plants, we identified some genes involved in salt stress response and tolerance (e.g., abscisic acid and jasmonic acid signaling, antioxidant enzymes, calcium signaling), and sodium and potassium transport differentially expressed, among others. We also identified some A. brasilense-specific plant DEGs, such as nitrate transporters and defense genes. Furthermore, our results suggest genes involved in auxin and ethylene signaling are likely to play an important role during these interactions. Overall, our transcriptomic data indicate that A. brasilense improves rice growth under salt stress by regulating the expression of key genes involved in defense and stress response, abscisic acid and jasmonic acid signaling, and ion and nutrient transport, among others. Our findings will provide essential insights into salt stress mitigation in rice by A. brasilense.

16.
Mol Ecol ; 21(17): 4287-99, 2012 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-22774776

RESUMO

Gene expression responds to the environment and can also evolve rapidly in response to altered selection regimes. Little is known, however, about the extent to which evolutionary adaptation to a particular type of stress involves changes in the within-generation ('plastic') responses of gene expression to the stress. We used microarrays to quantify gene expression plasticity in response to ethanol in laboratory populations of Drosophila melanogaster differing in their history of ethanol exposure. Two populations ('R' populations) were maintained on regular medium, two ('E') were maintained on medium supplemented with ethanol, and two ('M') were maintained in a mixed regime in which half of the population was reared on one medium type, and half on the other, each generation. After more than 300 generations, embryos from each population were collected and exposed to either ethanol or water as a control, and RNA was extracted from the larvae shortly after hatching. Nearly 2000 transcripts showed significant within-generation responses to ethanol exposure. Evolutionary history also affected gene expression: the E and M populations were largely indistinguishable in expression, but differed significantly in expression from the R populations for over 100 transcripts, the majority of which did not show plastic responses. Notably, in no case was the interaction between selection regime and ethanol exposure significant after controlling for multiple comparisons, indicating that adaptation to ethanol in the E and M populations did not involve substantial changes in gene expression plasticity. The results give evidence that expression plasticity evolves considerably more slowly than mean expression.


Assuntos
Adaptação Fisiológica/genética , Drosophila melanogaster/genética , Etanol , Evolução Molecular , Seleção Genética , Animais , Drosophila melanogaster/efeitos dos fármacos , Regulação da Expressão Gênica , Genes de Insetos , Larva/efeitos dos fármacos , Larva/genética , Análise de Sequência com Séries de Oligonucleotídeos
17.
Sci Rep ; 12(1): 8827, 2022 05 25.
Artigo em Inglês | MEDLINE | ID: mdl-35614083

RESUMO

Non-legume plants such as rice and maize can form beneficial associations with plant growth-promoting bacteria (PGPB) such as Herbaspirillum seropedicae and Azospirillum brasilense. Several studies have shown that these PGPB promote plant growth via multiple mechanisms. Our current understanding of the molecular aspects and signaling between plants like rice and PGPB like Herbaspirillum seropedicae is limited. In this study, we used an experimental system where H. seropedicae could colonize the plant roots and promote growth in wild-type rice. Using this experimental setup, we identified 1688 differentially expressed genes (DEGs) in rice roots, 1 day post-inoculation (dpi) with H. seropedicae. Several of these DEGs encode proteins involved in the flavonoid biosynthetic pathway, defense, hormone signaling pathways, and nitrate and sugar transport. We validated the expression pattern of some genes via RT-PCR. Next, we compared the DEGs identified in this study to those we previously identified in rice roots during associations with another PGPB, Azospirillum brasilense. We identified 628 genes that were differentially expressed during both associations. The expression pattern of these genes suggests that some of these are likely to play a significant role(s) during associations with both H. seropedicae and A. brasilense and are excellent targets for future studies.


Assuntos
Azospirillum brasilense , Herbaspirillum , Oryza , Azospirillum brasilense/genética , Expressão Gênica , Herbaspirillum/genética , Herbaspirillum/metabolismo , Oryza/genética , Oryza/microbiologia , Raízes de Plantas/metabolismo
18.
Bioinformatics ; 26(3): 348-54, 2010 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-19996162

RESUMO

MOTIVATION: Very little attention has been given to gene selection procedures based on intergene correlation structure, which is often neglected in the context of differential gene expression analysis. We propose a statistical procedure to select genes that have different associations with others across different phenotypes. This procedure is based on a new gene association score, called the covariance distance. RESULTS: We apply the proposed method, along with two alternative methods, to several simulated datasets and find out that our method is much more powerful than the other two. For biological data, we demonstrate that the analysis of differentially associated genes complements the analysis of differentially expressed genes. Combining both procedures provides a more comprehensive functional interpretation of the experimental results. AVAILABILITY: The code is downloadable from http://www.urmc.rochester.edu/biostat/people/faculty/hu.cfm. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Biologia Computacional/métodos , Modelos Estatísticos , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Bases de Dados Genéticas , Variação Genética , Humanos , Fenótipo
19.
J Cell Biol ; 174(5): 665-75, 2006 Aug 28.
Artigo em Inglês | MEDLINE | ID: mdl-16923827

RESUMO

The spindle pole body (SPB) is the sole site of microtubule nucleation in Saccharomyces cerevisiae; yet, details of its assembly are poorly understood. Integral membrane proteins including Mps2 anchor the soluble core SPB in the nuclear envelope. Adjacent to the core SPB is a membrane-associated SPB substructure known as the half-bridge, where SPB duplication and microtubule nucleation during G1 occurs. We found that the half-bridge component Mps3 is the budding yeast member of the SUN protein family (Sad1-UNC-84 homology) and provide evidence that it interacts with the Mps2 C terminus to tether the half-bridge to the core SPB. Mutants in the Mps3 SUN domain or Mps2 C terminus have SPB duplication and karyogamy defects that are consistent with the aberrant half-bridge structures we observe cytologically. The interaction between the Mps3 SUN domain and Mps2 C terminus is the first biochemical link known to connect the half-bridge with the core SPB. Association with Mps3 also defines a novel function for Mps2 during SPB duplication.


Assuntos
Proteínas de Ciclo Celular/metabolismo , Proteínas de Membrana/genética , Proteínas de Membrana/metabolismo , Membrana Nuclear/metabolismo , Proteínas Nucleares/metabolismo , Proteínas Serina-Treonina Quinases/metabolismo , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo , Saccharomyces cerevisiae/fisiologia , Fuso Acromático/metabolismo , Sequência de Aminoácidos , Ciclo Celular/genética , Proteínas de Ciclo Celular/genética , Centrossomo/metabolismo , Quinase do Ponto de Checagem 2 , Segregação de Cromossomos , Proteínas de Membrana/química , Dados de Sequência Molecular , Mutação , Proteínas Nucleares/química , Proteínas Nucleares/genética , Ligação Proteica , Proteínas Serina-Treonina Quinases/genética , Estrutura Terciária de Proteína , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/química , Alinhamento de Sequência , Homologia de Sequência de Aminoácidos
20.
Sci Rep ; 11(1): 156, 2021 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-33420139

RESUMO

The identification of prognostic biomarkers for predicting cancer progression is an important problem for two reasons. First, such biomarkers find practical application in a clinical context for the treatment of patients. Second, interrogation of the biomarkers themselves is assumed to lead to novel insights of disease mechanisms and the underlying molecular processes that cause the pathological behavior. For breast cancer, many signatures based on gene expression values have been reported to be associated with overall survival. Consequently, such signatures have been used for suggesting biological explanations of breast cancer and drug mechanisms. In this paper, we demonstrate for a large number of breast cancer signatures that such an implication is not justified. Our approach eliminates systematically all traces of biological meaning of signature genes and shows that among the remaining genes, surrogate gene sets can be formed with indistinguishable prognostic prediction capabilities and opposite biological meaning. Hence, our results demonstrate that none of the studied signatures has a sensible biological interpretation or meaning with respect to disease etiology. Overall, this shows that prognostic signatures are black-box models with sensible predictions of breast cancer outcome but no value for revealing causal connections. Furthermore, we show that the number of such surrogate gene sets is not small but very large.


Assuntos
Biomarcadores Tumorais/genética , Neoplasias da Mama/genética , Biomarcadores Tumorais/metabolismo , Neoplasias da Mama/diagnóstico , Neoplasias da Mama/metabolismo , Neoplasias da Mama/mortalidade , Feminino , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Humanos , Prognóstico , Transcriptoma
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa