RESUMEN
BACKGROUND: Toll-like receptors, which stimulated by pathogen-associated molecular patterns such as lipopolysaccharides (LPS), induces the releasing of many kinds of proinflammatory cytokines to activate subsequent immune responses. Plenty of studies have also indicated the importance of TLR-signalling on the avoidance of excessive inflammation, tissue repairing and the return to homeostasis after infection and tissue injury. The significance of TLR-signalling attracts many attentions on the regulatory mechanisms since several years ago. However, as newly discovered regulators, how and how many different microRNAs (miRNAs) regulate TLR-signalling pathway are still unclear. RESULTS: By integrating several microarray datasets and miRNA-target information datasets, we identified 431 miRNAs and 498 differentially expressed target genes in bone marrow-derived macrophages (BMDMs) with LPS-stimulation. Cooperative miRNA network were constructed by calculating targets overlap scores, and a sub-network finding algorithm was used to identify cooperative miRNA modules. Finally, 17 and 8 modules are identified in the cooperative miRNA networks composed of miRNAs up-regulate and down-regulate genes, respectively. CONCLUSIONS: We used gene expression data of mouse macrophage stimulated by LPS and miRNA-target information to infer the regulatory mechanism of miRNAs on LPS-induced signalling pathway. Also, our results suggest that miRNAs can be important regulators of LPS-induced innate immune response in BMDMs.
Asunto(s)
Perfilación de la Expresión Génica/métodos , Redes Reguladoras de Genes , Lipopolisacáridos/farmacología , Macrófagos/efectos de los fármacos , MicroARNs/genética , Animales , Células Cultivadas , Biología Computacional/métodos , Bases de Datos Genéticas , Regulación de la Expresión Génica/efectos de los fármacos , Redes Reguladoras de Genes/efectos de los fármacos , Macrófagos/citología , Ratones , Transducción de Señal/efectos de los fármacos , Receptores Toll-Like/genéticaRESUMEN
Background: Urothelial carcinoma (UC) patients often bear clinical and genetic heterogeneity, which may differ in management and prognosis. Especially, patients with advanced/metastatic UC generally have a poor prognosis and survive for only few months. The Wnt/ß-catenin signaling is found to be highly activated in several cancers, including UC. However, accumulated evidence has shown discordance between the Wnt/ß-catenin signaling and UC carcinogenesis. Accordingly, we aim to get a better understanding of the molecular characterization of UC, focusing on the Wnt signaling, which may add value to guiding management more precisely. Patients and Methods: Clinical data and pathological features were retrospectively surveyed. The correlations of secreted Frizzled-related protein 2 (SFRP2) immunoexpression with clinicopathological features were analyzed by Pearson's chi-square test. The Kaplan-Meier method with a log-rank test was employed to plot survival curves. All significant features from the univariate analysis were incorporated into the Cox regression model for multivariate analysis. Results: Following data mining on a transcriptome dataset (GSE31684), we identified that 8 transcripts in relation to the Wnt signaling pathway (GO: 0016055) were significantly upregulated in advanced/metastatic bladder tumors. Among these transcripts, the SFRP2 level showed the most significant upregulation. Additionally, as SFRP2 is a putative Wnt inhibitor and may be expressed by stroma, we were interested in examining the immunoexpression and clinical relevance of stromal and tumoral SFRP2 in our urothelial carcinoma cohorts containing 295 urinary bladder UC (UBUC) and 340 upper urinary tract UC (UTUC) patients. We observed that high SFRP2 expression in stroma but not in tumors is significantly linked to aggressive UC features, including high tumor stage and histological grade, positive nodal metastasis, the presence of vascular and perineural invasion, and high mitotic activity in UBUC and UTUC. Moreover, high stromal SFRP2 expression significantly and independently predicted worse clinical outcomes in UBUC and UTUC. Utilizing bioinformatic analysis, we further noticed that stromal SFRP2 may link epithelial-mesenchymal transition (EMT) to UC progression. Conclusion: Collectively, these results imply that stromal SFRP2 may exert oncogenic function beyond its Wnt antagonistic ability, and stromal SFRP2 expression can provide prognostic and therapeutic implications for UC patients.
RESUMEN
BACKGROUND: Transcriptional regulation of gene expression is usually accomplished by multiple interactive transcription factors (TFs). Therefore, it is crucial to understand the precise cooperative interactions among TFs. Various kinds of experimental data including ChIP-chip, TF binding site (TFBS), gene expression, TF knockout and protein-protein interaction data have been used to identify cooperative TF pairs in existing methods. The nucleosome occupancy data is not yet used for this research topic despite that several researches have revealed the association between nucleosomes and TFBSs. RESULTS: In this study, we developed a novel method to infer the cooperativity between two TFs by integrating the TF-gene documented regulation, TFBS and nucleosome occupancy data. TF-gene documented regulation and TFBS data were used to determine the target genes of a TF, and the genome-wide nucleosome occupancy data was used to assess the nucleosome occupancy on TFBSs. Our method identifies cooperative TF pairs based on two biologically plausible assumptions. If two TFs cooperate, then (i) they should have a significantly higher number of common target genes than random expectation and (ii) their binding sites (in the promoters of their common target genes) should tend to be co-depleted of nucleosomes in order to make these binding sites simultaneously accessible to TF binding. Each TF pair is given a cooperativity score by our method. The higher the score is, the more likely a TF pair has cooperativity. Finally, a list of 27 cooperative TF pairs has been predicted by our method. Among these 27 TF pairs, 19 pairs are also predicted by existing methods. The other 8 pairs are novel cooperative TF pairs predicted by our method. The biological relevance of these 8 novel cooperative TF pairs is justified by the existence of protein-protein interactions and co-annotation in the same MIPS functional categories. Moreover, we adopted three performance indices to compare our predictions with 11 existing methods' predictions. We show that our method performs better than these 11 existing methods in identifying cooperative TF pairs in yeast. Finally, the cooperative TF network constructed from the 27 predicted cooperative TF pairs shows that our method has the power to find cooperative TF pairs of different biological processes. CONCLUSION: Our method is effective in identifying cooperative TF pairs in yeast. Many of our predictions are validated by the literature, and our method outperforms 11 existing methods. We believe that our study will help biologists to understand the mechanisms of transcriptional regulation in eukaryotic cells.
Asunto(s)
Proteínas Fúngicas/metabolismo , Nucleosomas/metabolismo , Mapeo de Interacción de Proteínas/métodos , Proteínas de Saccharomyces cerevisiae/metabolismo , Saccharomyces cerevisiae/metabolismo , Factores de Transcripción/metabolismo , Algoritmos , Secuencia de Aminoácidos , Sitios de Unión , Bases de Datos de Proteínas , Almacenamiento y Recuperación de la Información , Datos de Secuencia Molecular , Unión Proteica , Análisis de Secuencia de Proteína/métodos , Transcripción Genética/fisiologíaRESUMEN
BACKGROUND: Missing values commonly occur in the microarray data, which usually contain more than 5% missing values with up to 90% of genes affected. Inaccurate missing value estimation results in reducing the power of downstream microarray data analyses. Many types of methods have been developed to estimate missing values. Among them, the regression-based methods are very popular and have been shown to perform better than the other types of methods in many testing microarray datasets. RESULTS: To further improve the performances of the regression-based methods, we propose shrinkage regression-based methods. Our methods take the advantage of the correlation structure in the microarray data and select similar genes for the target gene by Pearson correlation coefficients. Besides, our methods incorporate the least squares principle, utilize a shrinkage estimation approach to adjust the coefficients of the regression model, and then use the new coefficients to estimate missing values. Simulation results show that the proposed methods provide more accurate missing value estimation in six testing microarray datasets than the existing regression-based methods do. CONCLUSIONS: Imputation of missing values is a very important aspect of microarray data analyses because most of the downstream analyses require a complete dataset. Therefore, exploring accurate and efficient methods for estimating missing values has become an essential issue. Since our proposed shrinkage regression-based methods can provide accurate missing value estimation, they are competitive alternatives to the existing regression-based methods.
Asunto(s)
Bioestadística/métodos , Biología Computacional/métodos , Análisis de Secuencia por Matrices de Oligonucleótidos , Perfilación de la Expresión Génica , Análisis de los Mínimos Cuadrados , Análisis de RegresiónRESUMEN
BACKGROUND: Microarray data are usually peppered with missing values due to various reasons. However, most of the downstream analyses for microarray data require complete datasets. Therefore, accurate algorithms for missing value estimation are needed for improving the performance of microarray data analyses. Although many algorithms have been developed, there are many debates on the selection of the optimal algorithm. The studies about the performance comparison of different algorithms are still incomprehensive, especially in the number of benchmark datasets used, the number of algorithms compared, the rounds of simulation conducted, and the performance measures used. RESULTS: In this paper, we performed a comprehensive comparison by using (I) thirteen datasets, (II) nine algorithms, (III) 110 independent runs of simulation, and (IV) three types of measures to evaluate the performance of each imputation algorithm fairly. First, the effects of different types of microarray datasets on the performance of each imputation algorithm were evaluated. Second, we discussed whether the datasets from different species have different impact on the performance of different algorithms. To assess the performance of each algorithm fairly, all evaluations were performed using three types of measures. Our results indicate that the performance of an imputation algorithm mainly depends on the type of a dataset but not on the species where the samples come from. In addition to the statistical measure, two other measures with biological meanings are useful to reflect the impact of missing value imputation on the downstream data analyses. Our study suggests that local-least-squares-based methods are good choices to handle missing values for most of the microarray datasets. CONCLUSIONS: In this work, we carried out a comprehensive comparison of the algorithms for microarray missing value imputation. Based on such a comprehensive comparison, researchers could choose the optimal algorithm for their datasets easily. Moreover, new imputation algorithms could be compared with the existing algorithms using this comparison strategy as a standard protocol. In addition, to assist researchers in dealing with missing values easily, we built a web-based and easy-to-use imputation tool, MissVIA (http://cosbi.ee.ncku.edu.tw/MissVIA), which supports many imputation algorithms. Once users upload a real microarray dataset and choose the imputation algorithms, MissVIA will determine the optimal algorithm for the users' data through a series of simulations, and then the imputed results can be downloaded for the downstream data analyses.
Asunto(s)
Algoritmos , Bioestadística/métodos , Biología Computacional/métodos , Internet , Análisis de Secuencia por Matrices de Oligonucleótidos , Humanos , Modelos Estadísticos , Saccharomyces cerevisiae/genética , Factores de TiempoRESUMEN
Transcription factor binding site (TFBS) identification plays an important role in deciphering gene regulatory codes. With comprehensive knowledge of TFBSs, one can understand molecular mechanisms of gene regulation. In the recent decades, various computational approaches have been proposed to predict TFBSs in the genome. The TFBS dataset of a TF generated by each algorithm is a ranked list of predicted TFBSs of that TF, where top ranked TFBSs are statistically significant ones. However, whether these statistically significant TFBSs are functional (i.e. biologically relevant) is still unknown. Here we develop a post-processor, called the functional propensity calculator (FPC), to assign a functional propensity to each TFBS in the existing computationally predicted TFBS datasets. It is known that functional TFBSs reveal strong positional preference towards the transcriptional start site (TSS). This motivates us to take TFBS position relative to the TSS as the key idea in building our FPC. Based on our calculated functional propensities, the TFBSs of a TF in the original TFBS dataset could be reordered, where top ranked TFBSs are now the ones with high functional propensities. To validate the biological significance of our results, we perform three published statistical tests to assess the enrichment of Gene Ontology (GO) terms, the enrichment of physical protein-protein interactions, and the tendency of being co-expressed. The top ranked TFBSs in our reordered TFBS dataset outperform the top ranked TFBSs in the original TFBS dataset, justifying the effectiveness of our post-processor in extracting functional TFBSs from the original TFBS dataset. More importantly, assigning functional propensities to putative TFBSs enables biologists to easily identify which TFBSs in the promoter of interest are likely to be biologically relevant and are good candidates to do further detailed experimental investigation. The FPC is implemented as a web tool at http://santiago.ee.ncku.edu.tw/FPC/.