Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
1.
Nucleic Acids Res ; 50(W1): W124-W131, 2022 07 05.
Artigo em Inglês | MEDLINE | ID: mdl-35536253

RESUMO

BioUML (https://www.biouml.org)-is a web-based integrated platform for systems biology and data analysis. It supports visual modelling and construction of hierarchical biological models that allow us to construct the most complex modular models of blood pressure regulation, skeletal muscle metabolism, COVID-19 epidemiology. BioUML has been integrated with git repositories where users can store their models and other data. We have also expanded the capabilities of BioUML for data analysis and visualization of biomedical data: (i) any programs and Jupyter kernels can be plugged into the BioUML platform using Docker technology; (ii) BioUML is integrated with the Galaxy and Galaxy Tool Shed; (iii) BioUML provides two-way integration with R and Python (Jupyter notebooks): scripts can be executed on the BioUML web pages, and BioUML functions can be called from scripts; (iv) using plug-in architecture, specialized viewers and editors can be added. For example, powerful genome browsers as well as viewers for molecular 3D structure are integrated in this way; (v) BioUML supports data analyses using workflows (own format, Galaxy, CWL, BPMN, nextFlow). Using these capabilities, we have initiated a new branch of the BioUML development-u-science-a universal scientific platform that can be configured for specific research requirements.


Assuntos
Modelos Biológicos , Software , Humanos , Biologia Computacional , COVID-19/epidemiologia , Biologia de Sistemas
2.
Nucleic Acids Res ; 49(D1): D104-D111, 2021 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-33231677

RESUMO

The Gene Transcription Regulation Database (GTRD; http://gtrd.biouml.org/) contains uniformly annotated and processed NGS data related to gene transcription regulation: ChIP-seq, ChIP-exo, DNase-seq, MNase-seq, ATAC-seq and RNA-seq. With the latest release, the database has reached a new level of data integration. All cell types (cell lines and tissues) presented in the GTRD were arranged into a dictionary and linked with different ontologies (BRENDA, Cell Ontology, Uberon, Cellosaurus and Experimental Factor Ontology) and with related experiments in specialized databases on transcription regulation (FANTOM5, ENCODE and GTEx). The updated version of the GTRD provides an integrated view of transcription regulation through a dedicated web interface with advanced browsing and search capabilities, an integrated genome browser, and table reports by cell types, transcription factors, and genes of interest.


Assuntos
Bases de Dados Genéticas , Regulação da Expressão Gênica , Genoma , Fatores de Transcrição/genética , Transcrição Gênica , Animais , Linhagem Celular , Drosophila melanogaster/genética , Drosophila melanogaster/metabolismo , Ontologia Genética , Humanos , Internet , Camundongos , Anotação de Sequência Molecular , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Software , Fatores de Transcrição/classificação , Fatores de Transcrição/metabolismo
3.
Nucleic Acids Res ; 47(D1): D100-D105, 2019 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-30445619

RESUMO

The current version of the Gene Transcription Regulation Database (GTRD; http://gtrd.biouml.org) contains information about: (i) transcription factor binding sites (TFBSs) and transcription coactivators identified by ChIP-seq experiments for Homo sapiens, Mus musculus, Rattus norvegicus, Danio rerio, Caenorhabditis elegans, Drosophila melanogaster, Saccharomyces cerevisiae, Schizosaccharomyces pombe and Arabidopsis thaliana; (ii) regions of open chromatin and TFBSs (DNase footprints) identified by DNase-seq; (iii) unmappable regions where TFBSs cannot be identified due to repeats; (iv) potential TFBSs for both human and mouse using position weight matrices from the HOCOMOCO database. Raw ChIP-seq and DNase-seq data were obtained from ENCODE and SRA, and uniformly processed. ChIP-seq peaks were called using four different methods: MACS, SISSRs, GEM and PICS. Moreover, peaks for the same factor and peak calling method, albeit using different experiment conditions (cell line, treatment, etc.), were merged into clusters. To reduce noise, such clusters for different peak calling methods were merged into meta-clusters; these were considered to be non-redundant TFBS sets. Moreover, extended quality control was applied to all ChIP-seq data. Web interface to access GTRD was developed using the BioUML platform. It provides browsing and displaying information, advanced search possibilities and an integrated genome browser.


Assuntos
Bases de Dados Genéticas , Regulação da Expressão Gênica , Transcrição Gênica , Sequenciamento de Cromatina por Imunoprecipitação , Biologia Computacional/métodos , Bases de Dados Genéticas/tendências , Software , Fatores de Transcrição/metabolismo , Interface Usuário-Computador , Navegador
4.
Nucleic Acids Res ; 47(W1): W225-W233, 2019 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-31131402

RESUMO

BioUML (homepage: http://www.biouml.org, main public server: https://ict.biouml.org) is a web-based integrated environment (platform) for systems biology and the analysis of biomedical data generated by omics technologies. The BioUML vision is to provide a computational platform to build virtual cell, virtual physiological human and virtual patient. BioUML spans a comprehensive range of capabilities, including access to biological databases, powerful tools for systems biology (visual modelling, simulation, parameters fitting and analyses), a genome browser, scripting (R, JavaScript) and a workflow engine. Due to integration with the Galaxy platform and R/Bioconductor, BioUML provides powerful possibilities for the analyses of omics data. The plug-in-based architecture allows the user to add new functionalities using plug-ins. To facilitate a user focus on a particular task or database, we have developed several predefined perspectives that display only those web interface elements that are needed for a specific task. To support collaborative work on scientific projects, there is a central authentication and authorization system (https://bio-store.org). The diagram editor enables several remote users to simultaneously edit diagrams.


Assuntos
Bases de Dados Factuais , Internet , Modelos Biológicos , Software , Biologia de Sistemas , Animais , Humanos
5.
PLoS One ; 15(12): e0243332, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-33347457

RESUMO

Creating a complete picture of the regulation of transcription seems to be an urgent task of modern biology. Regulation of transcription is a complex process carried out by transcription factors (TFs) and auxiliary proteins. Over the past decade, ChIP-Seq has become the most common experimental technology studying genome-wide interactions between TFs and DNA. We assessed the transcriptional significance of cell line-specific features using regression analysis of ChIP-Seq datasets from the GTRD database and transcriptional start site (TSS) activities from the FANTOM5 expression atlas. For this purpose, we initially generated a large number of features that were defined as the presence or absence of TFs in different promoter regions around TSSs. Using feature selection and regression analysis, we identified sets of the most important TFs that affect expression activity of TSSs in human cell lines such as HepG2, K562 and HEK293. We demonstrated that some TFs can be classified as repressors and activators depending on their location relative to TSS.


Assuntos
Bases de Dados de Ácidos Nucleicos , Perfilação da Expressão Gênica , Fatores de Transcrição , Transcriptoma , Células HEK293 , Células Hep G2 , Humanos , Células K562 , Fatores de Transcrição/classificação , Fatores de Transcrição/metabolismo
6.
Nucleic Acids Res ; 35(Database issue): D550-6, 2007 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-17202170

RESUMO

Computational modelling of mammalian cell cycle regulation is a challenging task, which requires comprehensive knowledge on many interrelated processes in the cell. We have developed a web-based integrated database on cell cycle regulation in mammals in normal and pathological states (Cyclonet database). It integrates data obtained by 'omics' sciences and chemoinformatics on the basis of systems biology approach. Cyclonet is a specialized resource, which enables researchers working in the field of anticancer drug discovery to analyze the wealth of currently available information in a systematic way. Cyclonet contains information on relevant genes and molecules; diagrams and models of cell cycle regulation and results of their simulation; microarray data on cell cycle and on various types of cancer, information on drug targets and their ligands, as well as extensive bibliography on modelling of cell cycle and cancer-related gene expression data. The Cyclonet database is also accessible through the BioUML workbench, which allows flexible querying, analyzing and editing the data by means of visual modelling. Cyclonet aims to predict promising anticancer targets and their agents by application of Prediction of Activity Spectra for Substances. The Cyclonet database is available at http://cyclonet.biouml.org.


Assuntos
Antineoplásicos/farmacologia , Ciclo Celular , Bases de Dados Factuais , Neoplasias/genética , Neoplasias/metabolismo , Animais , Ciclo Celular/genética , Desenho de Fármacos , Perfilação da Expressão Gênica , Genômica , Humanos , Internet , Mamíferos/genética , Mamíferos/metabolismo , Proteômica , Biologia de Sistemas , Integração de Sistemas , Interface Usuário-Computador
7.
PLoS One ; 14(8): e0221760, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31465497

RESUMO

Chromatin immunoprecipitation followed by sequencing, i.e. ChIP-Seq, is a widely used experimental technology for the identification of functional protein-DNA interactions. Nowadays, such databases as ENCODE, GTRD, ChIP-Atlas and ReMap systematically collect and annotate a large number of ChIP-Seq datasets. Comprehensive control of dataset quality is currently indispensable to select the most reliable data for further analysis. In addition to existing quality control metrics, we have developed two novel metrics that allow to control false positives and false negatives in ChIP-Seq datasets. For this purpose, we have adapted well-known population size estimate for determination of unknown number of genuine transcription factor binding regions. Determination of the proposed metrics was based on overlapping distinct binding sites derived from processing one ChIP-Seq experiment by different peak callers. Moreover, the metrics also can be useful for assessing quality of datasets obtained from processing distinct ChIP-Seq experiments by a given peak caller. We also have shown that these metrics appear to be useful not only for dataset selection but also for comparison of peak callers and identification of site motifs based on ChIP-Seq datasets. The developed algorithm for determination of the false positive control metric and false negative control metric for ChIP-Seq datasets was implemented as a plugin for a BioUML platform: https://ict.biouml.org/bioumlweb/chipseq_analysis.html.


Assuntos
Sequenciamento de Cromatina por Imunoprecipitação , Bases de Dados de Ácidos Nucleicos , Análise de Sequência de DNA , Algoritmos , Área Sob a Curva , Sítios de Ligação , Controle de Qualidade , Curva ROC , Fatores de Transcrição/metabolismo
8.
J Bioinform Comput Biol ; 16(2): 1840013, 2018 04.
Artigo em Inglês | MEDLINE | ID: mdl-29739305

RESUMO

RNA plays an important role in the intracellular cell life and in the organism in general. Besides the well-established protein coding RNAs (messenger RNAs, mRNAs), long non-coding RNAs (lncRNAs) have gained the attention of recent researchers. Although lncRNAs have been classified as non-coding, some authors reported the presence of corresponding sequences in ribosome profiling data (Ribo-seq). Ribo-seq technology is a powerful experimental tool utilized to characterize RNA translation in cell with focus on initiation (harringtonine, lactimidomycin) and elongation (cycloheximide). By exploiting translation starts obtained from the Ribo-seq experiment, we developed a novel position weight matrix model for the prediction of translation starts. This model allowed us to achieve 96% accuracy of discrimination between human mRNAs and lncRNAs. When the same model was used for the prediction of putative ORFs in RNAs, we discovered that the majority of lncRNAs contained only small ORFs ([Formula: see text][Formula: see text]nt) in contrast to mRNAs.


Assuntos
Biologia Computacional/métodos , Proteínas/genética , RNA Longo não Codificante , Regiões 3' não Traduzidas , Regiões 5' não Traduzidas , Algoritmos , Fases de Leitura Aberta , Biossíntese de Proteínas , RNA Mensageiro/genética , Ribossomos/genética , Análise de Sequência de RNA
9.
BMC Bioinformatics ; 8: 56, 2007 Feb 19.
Artigo em Inglês | MEDLINE | ID: mdl-17309789

RESUMO

BACKGROUND: Computational analysis of gene regulatory regions is important for prediction of functions of many uncharacterized genes. With this in mind, search of the target genes for interferon (IFN) induction appears of interest. IFNs are multi-functional cytokines. Their effects are immunomodulatory, antiviral, antibacterial, and antitumor. The interaction of the IFNs with their cell surface receptors produces an activation of several transcription factors. Four regulatory factors, ISGF3, STAT1, IRF1, and NF-kappaB, are essential for the function of the IFN system. The aim of this work is the development of computational approaches for the recognition of DNA binding sites for these factors and computer programs for the prediction of the IFN-inducible regions. RESULTS: We developed computational approaches to the recognition of the binding sites for ISGF3, STAT1, IRF1, and NF-kappaB. Analysis of the distribution of these binding sites demonstrated that the regions -500 upstream of the transcription start site in IFN-inducible genes are enriched in putative binding sites for these transcription factors. Based on selected combinations of the sites whose frequencies were significantly higher than in the other functional gene groups, we developed methods for the prediction of the IFN-inducible promoters and enhancers. We analyzed 1004 sequences of the IFN-inducible genes compiled using microarray data analyses and also about 10,000 human gene sequences from the EPD and RefSeq databases; 74 of 1,664 human genes annotated in EPD were significantly IFN-inducible. CONCLUSION: Analyses of several control datasets demonstrated that the developed methods have a high accuracy of prediction of the IFN-inducible genes. Application of these methods to several datasets suggested that the number of the IFN-inducible genes is approximately 1500-2000 in the human genome.


Assuntos
Mapeamento Cromossômico/métodos , Elementos Facilitadores Genéticos/genética , Interferons/genética , Elementos Reguladores de Transcrição/genética , Análise de Sequência de DNA/métodos , Fatores de Transcrição/genética , Ativação Transcricional/genética , Sequência de Bases , Sítios de Ligação , Dados de Sequência Molecular , Regiões Promotoras Genéticas , Ligação Proteica , Alinhamento de Sequência
10.
J Bioinform Comput Biol ; 14(2): 1641006, 2016 04.
Artigo em Inglês | MEDLINE | ID: mdl-27122318

RESUMO

Ribosome profiling technology (Ribo-Seq) allowed to highlight more details of mRNA translation in cell and get additional information on importance of mRNA sequence features for this process. Application of translation inhibitors like harringtonine and cycloheximide along with mRNA-Seq technique helped to assess such important characteristic as translation efficiency. We assessed the translational importance of features of mRNA sequences with the help of statistical analysis of Ribo-Seq and mRNA-Seq data. Translationally important features known from literature as well as proposed by the authors were used in analysis. Such comparisons as protein coding versus non-coding RNAs and high- versus low-translated mRNAs were performed. We revealed a set of features that allowed to discriminate the compared categories of RNA. Significant relationships between mRNA features and efficiency of translation were also established.


Assuntos
Mamíferos/genética , RNA Mensageiro/genética , Análise de Sequência de RNA/métodos , Regiões 3' não Traduzidas , Regiões 5' não Traduzidas , Animais , Códon de Iniciação , Humanos , Camundongos , Biossíntese de Proteínas , Proteínas/genética , RNA Longo não Codificante/genética , RNA Longo não Codificante/metabolismo , Ribossomos/genética
11.
In Silico Biol ; 8(5-6): 383-411, 2008.
Artigo em Inglês | MEDLINE | ID: mdl-19374127

RESUMO

Albeit the great number of microarray data available on breast cancer, reliable identification of genes associated with breast cancer development remains a challenge. The aim of this work was to develop a novel method of meta-analysis for the identification of differentially expressed genes integrating results of several independent microarray experiments. We developed a statistical method for identification of up- and down-regulated genes to perform meta-analysis. The method takes advantage of hypergeometric and binomial distributions. Using our method we performed meta-analysis of five data sets from independent cDNA-microarray experiments on breast cancer. The meta-analysis revealed that 3.2% and 2.8% of the 24,726 analyzed genes are significantly (P-value < 0.01) down- and up-regulated, respectively. We also show that properly applied meta-analysis is a good tool for comparison of different breast cancer subtypes. Our meta-analysis showed that the expression of the majority of genes does not show significant differences in different subtypes of breast cancer. Here, we report the rationale, development and application of meta-analysis that enable us to identify biologically meaningful features of breast cancer. The algorithm we propose for the meta-analysis can reveal the features specific to the breast cancer subtypes and those common to breast cancer. The results allow us to revise the previously generated lists of genes associated with breast cancer and also identify most promising anticancer drug-target genes.


Assuntos
Neoplasias da Mama/genética , Regulação Neoplásica da Expressão Gênica/genética , Análise de Sequência com Séries de Oligonucleotídeos , Algoritmos , Neoplasias da Mama/classificação , Heterogeneidade Genética , Humanos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA