Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Nucleic Acids Res ; 47(9): e53, 2019 05 21.
Artigo em Inglês | MEDLINE | ID: mdl-30820547

RESUMO

We present a novel approach to identify human microRNA (miRNA) regulatory modules (mRNA targets and relevant cell conditions) by biclustering a large collection of mRNA fold-change data for sequence-specific targets. Bicluster targets were assessed using validated messenger RNA (mRNA) targets and exhibited on an average 17.0% (median 19.4%) improved gain in certainty (sensitivity + specificity). The net gain was further increased up to 32.0% (median 33.4%) by incorporating functional networks of targets. We analyzed cancer-specific biclusters and found that the PI3K/Akt signaling pathway is strongly enriched with targets of a few miRNAs in breast cancer and diffuse large B-cell lymphoma. Indeed, five independent prognostic miRNAs were identified, and repression of bicluster targets and pathway activity by miR-29 was experimentally validated. In total, 29 898 biclusters for 459 human miRNAs were collected in the BiMIR database where biclusters are searchable for miRNAs, tissues, diseases, keywords and target genes.


Assuntos
Big Data , Perfilação da Expressão Gênica/métodos , Redes Reguladoras de Genes/genética , MicroRNAs/genética , Neoplasias da Mama/genética , Neoplasias da Mama/patologia , Bases de Dados Genéticas , Feminino , Regulação Neoplásica da Expressão Gênica/genética , Humanos , Linfoma Difuso de Grandes Células B/genética , Linfoma Difuso de Grandes Células B/patologia , Fosfatidilinositol 3-Quinases/genética , Prognóstico , Proteínas Proto-Oncogênicas c-akt/genética , Transdução de Sinais/genética , Transcriptoma/genética
2.
BMC Genomics ; 20(1): 352, 2019 May 09.
Artigo em Inglês | MEDLINE | ID: mdl-31072324

RESUMO

BACKGROUND: Gene-set analysis (GSA) has been commonly used to identify significantly altered pathways or functions from omics data. However, GSA often yields a long list of gene-sets, necessitating efficient post-processing for improved interpretation. Existing methods cluster the gene-sets based on the extent of their overlap to summarize the GSA results without considering interactions between gene-sets. RESULTS: Here, we presented a novel network-weighted gene-set clustering that incorporates both the gene-set overlap and protein-protein interaction (PPI) networks. Three examples were demonstrated for microarray gene expression, GWAS summary, and RNA-sequencing data to which different GSA methods were applied. These examples as well as a global analysis show that the proposed method increases PPI densities and functional relevance of the resulting clusters. Additionally, distinct properties of gene-set distance measures were compared. The methods are implemented as an R/Shiny package GScluster that provides gene-set clustering and diverse functions for visualization of gene-sets and PPI networks. CONCLUSIONS: Network-weighted gene-set clustering provides functionally more relevant gene-set clusters and related network analysis.


Assuntos
Perfilação da Expressão Gênica/métodos , Redes Reguladoras de Genes , Mapeamento de Interação de Proteínas/métodos , Software , Algoritmos , Animais , Diabetes Mellitus Tipo 2/genética , Regulação da Expressão Gênica , Humanos , Neoplasias/genética
3.
BMC Genomics ; 15: 450, 2014 Jun 09.
Artigo em Inglês | MEDLINE | ID: mdl-24912499

RESUMO

BACKGROUND: Genome-wide expression profiles reflect the transcriptional networks specific to the given cell context. However, most statistical models try to estimate the average connectivity of the networks from a collection of gene expression data, and are unable to characterize the context-specific transcriptional regulations. We propose an approach for mining context-specific transcription networks from a large collection of gene expression fold-change profiles and composite gene-set information. RESULTS: Using a composite gene-set analysis method, we combine the information of transcription factor binding sites, Gene Ontology or pathway gene sets and gene expression fold-change profiles for a variety of cell conditions. We then collected all the significant patterns and constructed a database of context-specific transcription networks for human (REGNET). As a result, context-specific roles of transcription factors as well as their functional targets are readily explored. To validate the approach, nine predicted targets of E2F1 in HeLa cells were tested using chromatin immunoprecipitation assay. Among them, five (Gadd45b, Dusp6, Mll5, Bmp2 and E2f3) were successfully bound by E2F1. c-JUN and the EMT transcription networks were also validated from literature. CONCLUSIONS: REGNET is a useful tool for exploring the ternary relationships among the transcription factors, their functional targets and the corresponding cell conditions. It is able to provide useful clues for novel cell-specific transcriptional regulations. The REGNET database is available at http://mgrc.kribb.re.kr/regnet.


Assuntos
Biologia Computacional/métodos , Redes Reguladoras de Genes , Fatores de Transcrição/metabolismo , Sítios de Ligação , Bases de Dados Genéticas , Expressão Gênica , Ontologia Genética , Genoma Humano , Células HeLa , Humanos , Reprodutibilidade dos Testes , Software
4.
Bioinformatics ; 28(7): 1028-30, 2012 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-22296788

RESUMO

SUMMARY: We present an accurate and fast web server, WegoLoc for predicting subcellular localization of proteins based on sequence similarity and weighted Gene Ontology (GO) information. A term weighting method in the text categorization process is applied to GO terms for a support vector machine classifier. As a result, WegoLoc surpasses the state-of-the-art methods for previously used test datasets. WegoLoc supports three eukaryotic kingdoms (animals, fungi and plants) and provides human-specific analysis, and covers several sets of cellular locations. In addition, WegoLoc provides (i) multiple possible localizations of input protein(s) as well as their corresponding probability scores, (ii) weights of GO terms representing the contribution of each GO term in the prediction, and (iii) a BLAST E-value for the best hit with GO terms. If the similarity score does not meet a given threshold, an amino acid composition-based prediction is applied as a backup method. AVAILABILITY: WegoLoc and User's guide are freely available at the website http://www.btool.org/WegoLoc CONTACT: smchiks@ks.ac.kr; dougnam@unist.ac.kr SUPPLEMENTARY INFORMATION: Supplementary data is available at http://www.btool.org/WegoLoc.


Assuntos
Biologia Computacional/métodos , Bases de Dados de Proteínas , Proteínas/metabolismo , Software , Vocabulário Controlado , Algoritmos , Aminoácidos , Animais , Humanos , Internet , Máquina de Vetores de Suporte
5.
Nucleic Acids Res ; 39(Web Server issue): W302-6, 2011 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-21624890

RESUMO

ADGO 2.0 is a web-based tool that provides composite interpretations for microarray data comparing two sample groups as well as lists of genes from diverse sources of biological information. Some other tools also incorporate composite annotations solely for interpreting lists of genes but usually provide highly redundant information. This new version has the following additional features: first, it provides multiple gene set analysis methods for microarray inputs as well as enrichment analyses for lists of genes. Second, it screens redundant composite annotations when generating and prioritizing them. Third, it incorporates union and subtracted sets as well as intersection sets. Lastly, users can upload their own gene sets (e.g. predicted miRNA targets) to generate and analyze new composite sets. The first two features are unique to ADGO 2.0. Using our tool, we demonstrate analyses of a microarray dataset and a list of genes for T-cell differentiation. The new ADGO is available at http://www.btool.org/ADGO2.


Assuntos
Perfilação da Expressão Gênica/métodos , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Software , Animais , Diferenciação Celular , Genes , Humanos , Internet , Camundongos , Anotação de Sequência Molecular , Ratos , Linfócitos T/citologia , Linfócitos T/metabolismo
6.
Biochem Biophys Res Commun ; 399(3): 402-5, 2010 Aug 27.
Artigo em Inglês | MEDLINE | ID: mdl-20678488

RESUMO

We develop a new weighting approach of gene ontology (GO) terms for predicting protein subcellular localization. The weights of individual GO terms, corresponding to their contribution to the prediction algorithm, are determined by the term-weighting methods used in text categorization. We evaluate several term-weighting methods, which are based on inverse document frequency, information gain, gain ratio, odds ratio, and chi-square and its variants. Additionally, we propose a new term-weighting method based on the logarithmic transformation of chi-square. The proposed term-weighting method performs better than other term-weighting methods, and also outperforms state-of-the-art subcellular prediction methods. Our proposed method achieves 98.1%, 99.3%, 98.1%, 98.1%, and 95.9% overall accuracies for the animal BaCelLo independent dataset (IDS), fungal BaCelLo IDS, animal Höglund IDS, fungal Höglund IDS, and PLOC dataset, respectively. Furthermore, the close correlation between high-weighted GO terms and subcellular localizations suggests that our proposed method appropriately weights GO terms according to their relevance to the localizations.


Assuntos
Algoritmos , Espaço Intracelular/metabolismo , Computação Matemática , Proteínas/metabolismo , Proteínas/genética
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa