Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
1.
BMC Genomics ; 22(1): 592, 2021 Aug 04.
Artigo em Inglês | MEDLINE | ID: mdl-34348664

RESUMO

BACKGROUND: Genetic aberrations in hepatocellular carcinoma (HCC) are well known, but the functional consequences of such aberrations remain poorly understood. RESULTS: Here, we explored the effect of defined genetic changes on the transcriptome, proteome and phosphoproteome in twelve tumors from an mTOR-driven hepatocellular carcinoma mouse model. Using Network-based Integration of multi-omiCS data (NetICS), we detected 74 'mediators' that relay via molecular interactions the effects of genetic and miRNA expression changes. The detected mediators account for the effects of oncogenic mTOR signaling on the transcriptome, proteome and phosphoproteome. We confirmed the dysregulation of the mediators YAP1, GRB2, SIRT1, HDAC4 and LIS1 in human HCC. CONCLUSIONS: This study suggests that targeting pathways such as YAP1 or GRB2 signaling and pathways regulating global histone acetylation could be beneficial in treating HCC with hyperactive mTOR signaling.


Assuntos
Carcinoma Hepatocelular , Neoplasias Hepáticas , MicroRNAs , Preparações Farmacêuticas , Carcinoma Hepatocelular/tratamento farmacológico , Carcinoma Hepatocelular/genética , Humanos , Neoplasias Hepáticas/tratamento farmacológico , Neoplasias Hepáticas/genética , Transcriptoma
2.
Bioinformatics ; 34(14): 2441-2448, 2018 07 15.
Artigo em Inglês | MEDLINE | ID: mdl-29547932

RESUMO

Motivation: Several molecular events are known to be cancer-related, including genomic aberrations, hypermethylation of gene promoter regions and differential expression of microRNAs. These aberration events are very heterogeneous across tumors and it is poorly understood how they affect the molecular makeup of the cell, including the transcriptome and proteome. Protein interaction networks can help decode the functional relationship between aberration events and changes in gene and protein expression. Results: We developed NetICS (Network-based Integration of Multi-omics Data), a new graph diffusion-based method for prioritizing cancer genes by integrating diverse molecular data types on a directed functional interaction network. NetICS prioritizes genes by their mediator effect, defined as the proximity of the gene to upstream aberration events and to downstream differentially expressed genes and proteins in an interaction network. Genes are prioritized for individual samples separately and integrated using a robust rank aggregation technique. NetICS provides a comprehensive computational framework that can aid in explaining the heterogeneity of aberration events by their functional convergence to common differentially expressed genes and proteins. We demonstrate NetICS' competitive performance in predicting known cancer genes and in generating robust gene lists using TCGA data from five cancer types. Availability and implementation: NetICS is available at https://github.com/cbg-ethz/netics. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Biologia Computacional/métodos , Genes Neoplásicos , Neoplasias/genética , Software , Aberrações Cromossômicas , Metilação de DNA , Regulação da Expressão Gênica , Genômica/métodos , Humanos , MicroRNAs/genética , Mutação , Neoplasias/metabolismo , Proteoma , Transcriptoma
3.
Plant Cell ; 28(11): 2715-2734, 2016 11.
Artigo em Inglês | MEDLINE | ID: mdl-27803310

RESUMO

Plants use light as source of energy and information to detect diurnal rhythms and seasonal changes. Sensing changing light conditions is critical to adjust plant metabolism and to initiate developmental transitions. Here, we analyzed transcriptome-wide alterations in gene expression and alternative splicing (AS) of etiolated seedlings undergoing photomorphogenesis upon exposure to blue, red, or white light. Our analysis revealed massive transcriptome reprogramming as reflected by differential expression of ∼20% of all genes and changes in several hundred AS events. For more than 60% of all regulated AS events, light promoted the production of a presumably protein-coding variant at the expense of an mRNA with nonsense-mediated decay-triggering features. Accordingly, AS of the putative splicing factor REDUCED RED-LIGHT RESPONSES IN CRY1CRY2 BACKGROUND1, previously identified as a red light signaling component, was shifted to the functional variant under light. Downstream analyses of candidate AS events pointed at a role of photoreceptor signaling only in monochromatic but not in white light. Furthermore, we demonstrated similar AS changes upon light exposure and exogenous sugar supply, with a critical involvement of kinase signaling. We propose that AS is an integration point of signaling pathways that sense and transmit information regarding the energy availability in plants.


Assuntos
Processamento Alternativo/fisiologia , Proteínas de Arabidopsis/metabolismo , Arabidopsis/genética , Transcriptoma/genética , Processamento Alternativo/genética , Arabidopsis/fisiologia , Proteínas de Arabidopsis/genética , Regulação da Expressão Gênica de Plantas/genética , Regulação da Expressão Gênica de Plantas/fisiologia , Transdução de Sinais/genética , Transdução de Sinais/fisiologia
4.
BMC Bioinformatics ; 18(1): 8, 2017 Jan 03.
Artigo em Inglês | MEDLINE | ID: mdl-28049408

RESUMO

BACKGROUND: Next-generation sequencing of matched tumor and normal biopsy pairs has become a technology of paramount importance for precision cancer treatment. Sequencing costs have dropped tremendously, allowing the sequencing of the whole exome of tumors for just a fraction of the total treatment costs. However, clinicians and scientists cannot take full advantage of the generated data because the accuracy of analysis pipelines is limited. This particularly concerns the reliable identification of subclonal mutations in a cancer tissue sample with very low frequencies, which may be clinically relevant. RESULTS: Using simulations based on kidney tumor data, we compared the performance of nine state-of-the-art variant callers, namely deepSNV, GATK HaplotypeCaller, GATK UnifiedGenotyper, JointSNVMix2, MuTect, SAMtools, SiNVICT, SomaticSniper, and VarScan2. The comparison was done as a function of variant allele frequencies and coverage. Our analysis revealed that deepSNV and JointSNVMix2 perform very well, especially in the low-frequency range. We attributed false positive and false negative calls of the nine tools to specific error sources and assigned them to processing steps of the pipeline. All of these errors can be expected to occur in real data sets. We found that modifying certain steps of the pipeline or parameters of the tools can lead to substantial improvements in performance. Furthermore, a novel integration strategy that combines the ranks of the variants yielded the best performance. More precisely, the rank-combination of deepSNV, JointSNVMix2, MuTect, SiNVICT and VarScan2 reached a sensitivity of 78% when fixing the precision at 90%, and outperformed all individual tools, where the maximum sensitivity was 71% with the same precision. CONCLUSIONS: The choice of well-performing tools for alignment and variant calling is crucial for the correct interpretation of exome sequencing data obtained from mixed samples, and common pipelines are suboptimal. We were able to relate observed substantial differences in performance to the underlying statistical models of the tools, and to pinpoint the error sources of false positive and false negative calls. These findings might inspire new software developments that improve exome sequencing pipelines and further the field of precision cancer treatment.


Assuntos
Exoma/genética , Neoplasias Renais/genética , Algoritmos , DNA de Neoplasias/química , DNA de Neoplasias/metabolismo , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Neoplasias Renais/patologia , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA
5.
Bioinformatics ; 32(5): 770-2, 2016 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-26519503

RESUMO

MOTIVATION: Mapping high-throughput sequencing data to a reference genome is an essential step for most analysis pipelines aiming at the computational analysis of genome and transcriptome sequencing data. Breaking ties between equally well mapping locations poses a severe problem not only during the alignment phase but also has significant impact on the results of downstream analyses. We present the multi-mapper resolution (MMR) tool that infers optimal mapping locations from the coverage density of other mapped reads. RESULTS: Filtering alignments with MMR can significantly improve the performance of downstream analyses like transcript quantitation and differential testing. We illustrate that the accuracy (Spearman correlation) of transcript quantification increases by 15% when using reads of length 51. In addition, MMR decreases the alignment file sizes by more than 50%, and this leads to a reduced running time of the quantification tool. Our efficient implementation of the MMR algorithm is easily applicable as a post-processing step to existing alignment files in BAM format. Its complexity scales linearly with the number of alignments and requires no further inputs. AVAILABILITY AND IMPLEMENTATION: Open source code and documentation are available for download at http://github.com/ratschlab/mmr Comprehensive testing results and further information can be found at http://bioweb.me/mmr. CONTACT: andre.kahles@ratschlab.org or gunnar.ratsch@ratschlab.org SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Software , Algoritmos , Genoma , Sequenciamento de Nucleotídeos em Larga Escala , Alinhamento de Sequência
6.
Nature ; 477(7365): 419-23, 2011 Aug 28.
Artigo em Inglês | MEDLINE | ID: mdl-21874022

RESUMO

Genetic differences between Arabidopsis thaliana accessions underlie the plant's extensive phenotypic variation, and until now these have been interpreted largely in the context of the annotated reference accession Col-0. Here we report the sequencing, assembly and annotation of the genomes of 18 natural A. thaliana accessions, and their transcriptomes. When assessed on the basis of the reference annotation, one-third of protein-coding genes are predicted to be disrupted in at least one accession. However, re-annotation of each genome revealed that alternative gene models often restore coding potential. Gene expression in seedlings differed for nearly half of expressed genes and was frequently associated with cis variants within 5 kilobases, as were intron retention alternative splicing events. Sequence and expression variation is most pronounced in genes that respond to the biotic environment. Our data further promote evolutionary and functional studies in A. thaliana, especially the MAGIC genetic reference population descended from these accessions.


Assuntos
Arabidopsis/genética , Perfilação da Expressão Gênica , Regulação da Expressão Gênica de Plantas/genética , Genoma de Planta/genética , Transcrição Gênica/genética , Arabidopsis/classificação , Proteínas de Arabidopsis/genética , Sequência de Bases , Genes de Plantas/genética , Genômica , Haplótipos/genética , Mutação INDEL/genética , Anotação de Sequência Molecular , Filogenia , Polimorfismo de Nucleotídeo Único/genética , Proteoma/genética , Plântula/genética , Análise de Sequência de DNA
7.
Plant Cell ; 25(10): 3726-42, 2013 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-24163313

RESUMO

The nonsense-mediated decay (NMD) surveillance pathway can recognize erroneous transcripts and physiological mRNAs, such as precursor mRNA alternative splicing (AS) variants. Currently, information on the global extent of coupled AS and NMD remains scarce and even absent for any plant species. To address this, we conducted transcriptome-wide splicing studies using Arabidopsis thaliana mutants in the NMD factor homologs UP FRAMESHIFT1 (UPF1) and UPF3 as well as wild-type samples treated with the translation inhibitor cycloheximide. Our analyses revealed that at least 17.4% of all multi-exon, protein-coding genes produce splicing variants that are targeted by NMD. Moreover, we provide evidence that UPF1 and UPF3 act in a translation-independent mRNA decay pathway. Importantly, 92.3% of the NMD-responsive mRNAs exhibit classical NMD-eliciting features, supporting their authenticity as direct targets. Genes generating NMD-sensitive AS variants function in diverse biological processes, including signaling and protein modification, for which NaCl stress-modulated AS-NMD was found. Besides mRNAs, numerous noncoding RNAs and transcripts derived from intergenic regions were shown to be NMD responsive. In summary, we provide evidence for a major function of AS-coupled NMD in shaping the Arabidopsis transcriptome, having fundamental implications in gene regulation and quality control of transcript processing.


Assuntos
Processamento Alternativo , Arabidopsis/genética , Degradação do RNAm Mediada por Códon sem Sentido , Transcriptoma , Proteínas de Arabidopsis/genética , Regulação da Expressão Gênica de Plantas , Genótipo , Mutação , RNA Helicases/genética , RNA de Plantas/genética , Análise de Sequência de RNA
8.
Bioinformatics ; 29(20): 2529-38, 2013 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-23980025

RESUMO

MOTIVATION: High-throughput sequencing of mRNA (RNA-Seq) has led to tremendous improvements in the detection of expressed genes and reconstruction of RNA transcripts. However, the extensive dynamic range of gene expression, technical limitations and biases, as well as the observed complexity of the transcriptional landscape, pose profound computational challenges for transcriptome reconstruction. RESULTS: We present the novel framework MITIE (Mixed Integer Transcript IdEntification) for simultaneous transcript reconstruction and quantification. We define a likelihood function based on the negative binomial distribution, use a regularization approach to select a few transcripts collectively explaining the observed read data and show how to find the optimal solution using Mixed Integer Programming. MITIE can (i) take advantage of known transcripts, (ii) reconstruct and quantify transcripts simultaneously in multiple samples, and (iii) resolve the location of multi-mapping reads. It is designed for genome- and assembly-based transcriptome reconstruction. We present an extensive study based on realistic simulated RNA-Seq data. When compared with state-of-the-art approaches, MITIE proves to be significantly more sensitive and overall more accurate. Moreover, MITIE yields substantial performance gains when used with multiple samples. We applied our system to 38 Drosophila melanogaster modENCODE RNA-Seq libraries and estimated the sensitivity of reconstructing omitted transcript annotations and the specificity with respect to annotated transcripts. Our results corroborate that a well-motivated objective paired with appropriate optimization techniques lead to significant improvements over the state-of-the-art in transcriptome reconstruction. AVAILABILITY: MITIE is implemented in C++ and is available from http://bioweb.me/mitie under the GPL license.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , RNA/análise , Análise de Sequência de RNA/métodos , Software , Transcrição Gênica , Animais , Drosophila melanogaster , Humanos , Internet , RNA/genética
9.
Genome Res ; 19(11): 2133-43, 2009 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-19564452

RESUMO

We present a highly accurate gene-prediction system for eukaryotic genomes, called mGene. It combines in an unprecedented manner the flexibility of generalized hidden Markov models (gHMMs) with the predictive power of modern machine learning methods, such as Support Vector Machines (SVMs). Its excellent performance was proved in an objective competition based on the genome of the nematode Caenorhabditis elegans. Considering the average of sensitivity and specificity, the developmental version of mGene exhibited the best prediction performance on nucleotide, exon, and transcript level for ab initio and multiple-genome gene-prediction tasks. The fully developed version shows superior performance in 10 out of 12 evaluation criteria compared with the other participating gene finders, including Fgenesh++ and Augustus. An in-depth analysis of mGene's genome-wide predictions revealed that approximately 2200 predicted genes were not contained in the current genome annotation. Testing a subset of 57 of these genes by RT-PCR and sequencing, we confirmed expression for 24 (42%) of them. mGene missed 300 annotated genes, out of which 205 were unconfirmed. RT-PCR testing of 24 of these genes resulted in a success rate of merely 8%. These findings suggest that even the gene catalog of a well-studied organism such as C. elegans can be substantially improved by mGene's predictions. We also provide gene predictions for the four nematodes C. briggsae, C. brenneri, C. japonica, and C. remanei. Comparing the resulting proteomes among these organisms and to the known protein universe, we identified many species-specific gene inventions. In a quality assessment of several available annotations for these genomes, we find that mGene's predictions are most accurate.


Assuntos
Algoritmos , Caenorhabditis elegans/genética , Biologia Computacional/métodos , Genoma Helmíntico/genética , Animais , Inteligência Artificial , Caenorhabditis/classificação , Caenorhabditis/genética , Genes de Helmintos/genética , Genômica/métodos , Sítios de Splice de RNA , Reprodutibilidade dos Testes , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Análise de Sequência de DNA , Sítio de Iniciação de Transcrição
10.
Nucleic Acids Res ; 37(Web Server issue): W312-6, 2009 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-19494180

RESUMO

We describe mGene.web, a web service for the genome-wide prediction of protein coding genes from eukaryotic DNA sequences. It offers pre-trained models for the recognition of gene structures including untranslated regions in an increasing number of organisms. With mGene.web, users have the additional possibility to train the system with their own data for other organisms on the push of a button, a functionality that will greatly accelerate the annotation of newly sequenced genomes. The system is built in a highly modular way, such that individual components of the framework, like the promoter prediction tool or the splice site predictor, can be used autonomously. The underlying gene finding system mGene is based on discriminative machine learning techniques and its high accuracy has been demonstrated in an international competition on nematode genomes. mGene.web is available at http://www.mgene.org/web, it is free of charge and can be used for eukaryotic genomes of small to moderate size (several hundred Mbp).


Assuntos
Genes , Genômica , Proteínas/genética , Software , Internet , Sítios de Splice de RNA , Análise de Sequência de DNA , Sítio de Iniciação de Transcrição
11.
Nat Commun ; 9(1): 4353, 2018 10 19.
Artigo em Inglês | MEDLINE | ID: mdl-30341300

RESUMO

Large-scale genomic data highlight the complexity and diversity of the molecular changes that drive cancer progression. Statistical analysis of cancer data from different tissues can guide drug repositioning as well as the design of targeted treatments. Here, we develop an improved Bayesian network model for tumour mutational profiles and apply it to 8198 patient samples across 22 cancer types from TCGA. For each cancer type, we identify the interactions between mutated genes, capturing signatures beyond mere mutational frequencies. When comparing mutation networks, we find genes which interact both within and across cancer types. To detach cancer classification from the tissue type we perform de novo clustering of the pancancer mutational profiles based on the Bayesian network models. We find 22 novel clusters which significantly improve survival prediction beyond clinical information. The models highlight key gene interactions for each cluster potentially allowing genomic stratification for clinical trials and identifying drug targets.


Assuntos
Mutação , Neoplasias/genética , Teorema de Bayes , Análise por Conglomerados , Análise Mutacional de DNA , Humanos , Modelos Genéticos
12.
BMC Bioinformatics ; 8 Suppl 10: S7, 2007.
Artigo em Inglês | MEDLINE | ID: mdl-18269701

RESUMO

BACKGROUND: For splice site recognition, one has to solve two classification problems: discriminating true from decoy splice sites for both acceptor and donor sites. Gene finding systems typically rely on Markov Chains to solve these tasks. RESULTS: In this work we consider Support Vector Machines for splice site recognition. We employ the so-called weighted degree kernel which turns out well suited for this task, as we will illustrate in several experiments where we compare its prediction accuracy with that of recently proposed systems. We apply our method to the genome-wide recognition of splice sites in Caenorhabditis elegans, Drosophila melanogaster, Arabidopsis thaliana, Danio rerio, and Homo sapiens. Our performance estimates indicate that splice sites can be recognized very accurately in these genomes and that our method outperforms many other methods including Markov Chains, GeneSplicer and SpliceMachine. We provide genome-wide predictions of splice sites and a stand-alone prediction tool ready to be used for incorporation in a gene finder. AVAILABILITY: Data, splits, additional information on the model selection, the whole genome predictions, as well as the stand-alone prediction tool are available for download at http://www.fml.mpg.de/raetsch/projects/splice.


Assuntos
Sítios de Splice de RNA/genética , Algoritmos , Animais , Arabidopsis/genética , Brassicaceae/genética , Caenorhabditis elegans/genética , Drosophila melanogaster/genética , Previsões/métodos , Genômica/métodos , Humanos , Cadeias de Markov , Peixe-Zebra/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA