Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Nat Biotechnol ; 35(4): 350-353, 2017 04.
Artículo en Inglés | MEDLINE | ID: mdl-28263295

RESUMEN

We present SplashRNA, a sequential classifier to predict potent microRNA-based short hairpin RNAs (shRNAs). Trained on published and novel data sets, SplashRNA outperforms previous algorithms and reliably predicts the most efficient shRNAs for a given gene. Combined with an optimized miR-E backbone, >90% of high-scoring SplashRNA predictions trigger >85% protein knockdown when expressed from a single genomic integration. SplashRNA can significantly improve the accuracy of loss-of-function genetics studies and facilitates the generation of compact shRNA libraries.


Asunto(s)
Algoritmos , Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas/genética , Silenciador del Gen , Aprendizaje Automático , ARN Interferente Pequeño/genética , Programas Informáticos , Sistemas CRISPR-Cas/genética , Mapeo Cromosómico/métodos , Análisis de Secuencia de ARN/métodos
2.
Bioinformatics ; 33(1): 139-141, 2017 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-27634950

RESUMEN

MOTIVATION: Deep sequencing based ribosome footprint profiling can provide novel insights into the regulatory mechanisms of protein translation. However, the observed ribosome profile is fundamentally confounded by transcriptional activity. In order to decipher principles of translation regulation, tools that can reliably detect changes in translation efficiency in case-control studies are needed. RESULTS: We present a statistical framework and an analysis tool, RiboDiff, to detect genes with changes in translation efficiency across experimental treatments. RiboDiff uses generalized linear models to estimate the over-dispersion of RNA-Seq and ribosome profiling measurements separately, and performs a statistical test for differential translation efficiency using both mRNA abundance and ribosome occupancy. AVAILABILITY AND IMPLEMENTATION: RiboDiff webpage http://bioweb.me/ribodiff Source code including scripts for preprocessing the FASTQ data are available at http://github.com/ratschlab/ribodiff CONTACTS: zhongy@cbio.mskcc.org or raetsch@inf.ethz.chSupplementary information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Biosíntesis de Proteínas , ARN Mensajero/metabolismo , Ribosomas/metabolismo , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Regulación de la Expresión Génica , Secuenciación de Nucleótidos de Alto Rendimiento/métodos
3.
Bioinformatics ; 30(9): 1300-1, 2014 May 01.
Artículo en Inglés | MEDLINE | ID: mdl-24413671

RESUMEN

We present Oqtans, an open-source workbench for quantitative transcriptome analysis, that is integrated in Galaxy. Its distinguishing features include customizable computational workflows and a modular pipeline architecture that facilitates comparative assessment of tool and data quality. Oqtans integrates an assortment of machine learning-powered tools into Galaxy, which show superior or equal performance to state-of-the-art tools. Implemented tools comprise a complete transcriptome analysis workflow: short-read alignment, transcript identification/quantification and differential expression analysis. Oqtans and Galaxy facilitate persistent storage, data exchange and documentation of intermediate results and analysis workflows. We illustrate how Oqtans aids the interpretation of data from different experiments in easy to understand use cases. Users can easily create their own workflows and extend Oqtans by integrating specific tools. Oqtans is available as (i) a cloud machine image with a demo instance at cloud.oqtans.org, (ii) a public Galaxy instance at galaxy.cbio.mskcc.org, (iii) a git repository containing all installed software (oqtans.org/git); most of which is also available from (iv) the Galaxy Toolshed and (v) a share string to use along with Galaxy CloudMan.


Asunto(s)
ARN/genética , Análisis de Secuencia de ARN/métodos , Transcriptoma , Secuencia de Bases , Internet , Programas Informáticos
4.
Bioinformatics ; 29(20): 2529-38, 2013 Oct 15.
Artículo en Inglés | MEDLINE | ID: mdl-23980025

RESUMEN

MOTIVATION: High-throughput sequencing of mRNA (RNA-Seq) has led to tremendous improvements in the detection of expressed genes and reconstruction of RNA transcripts. However, the extensive dynamic range of gene expression, technical limitations and biases, as well as the observed complexity of the transcriptional landscape, pose profound computational challenges for transcriptome reconstruction. RESULTS: We present the novel framework MITIE (Mixed Integer Transcript IdEntification) for simultaneous transcript reconstruction and quantification. We define a likelihood function based on the negative binomial distribution, use a regularization approach to select a few transcripts collectively explaining the observed read data and show how to find the optimal solution using Mixed Integer Programming. MITIE can (i) take advantage of known transcripts, (ii) reconstruct and quantify transcripts simultaneously in multiple samples, and (iii) resolve the location of multi-mapping reads. It is designed for genome- and assembly-based transcriptome reconstruction. We present an extensive study based on realistic simulated RNA-Seq data. When compared with state-of-the-art approaches, MITIE proves to be significantly more sensitive and overall more accurate. Moreover, MITIE yields substantial performance gains when used with multiple samples. We applied our system to 38 Drosophila melanogaster modENCODE RNA-Seq libraries and estimated the sensitivity of reconstructing omitted transcript annotations and the specificity with respect to annotated transcripts. Our results corroborate that a well-motivated objective paired with appropriate optimization techniques lead to significant improvements over the state-of-the-art in transcriptome reconstruction. AVAILABILITY: MITIE is implemented in C++ and is available from http://bioweb.me/mitie under the GPL license.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento/métodos , ARN/análisis , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Transcripción Genética , Animales , Drosophila melanogaster , Humanos , Internet , ARN/genética
5.
PLoS Genet ; 8(8): e1002856, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-22912589

RESUMEN

Cohesin is a protein complex that forms a ring around sister chromatids thus holding them together. The ring is composed of three proteins: Smc1, Smc3 and Scc1. The roles of three additional proteins that associate with the ring, Scc3, Pds5 and Wpl1, are not well understood. It has been proposed that these three factors form a complex that stabilizes the ring and prevents it from opening. This activity promotes sister chromatid cohesion but at the same time poses an obstacle for the initial entrapment of sister DNAs. This hindrance to cohesion establishment is overcome during DNA replication via acetylation of the Smc3 subunit by the Eco1 acetyltransferase. However, the full mechanistic consequences of Smc3 acetylation remain unknown. In the current work, we test the requirement of Scc3 and Pds5 for the stable association of cohesin with DNA. We investigated the consequences of Scc3 and Pds5 depletion in vivo using degron tagging in budding yeast. The previously described DHFR-based N-terminal degron as well as a novel Eco1-derived C-terminal degron were employed in our study. Scc3 and Pds5 associate with cohesin complexes independently of each other and require the Scc1 "core" subunit for their association with chromosomes. Contrary to previous data for Scc1 downregulation, depletion of either Scc3 or Pds5 had a strong effect on sister chromatid cohesion but not on cohesin binding to DNA. Quantity, stability and genome-wide distribution of cohesin complexes remained mostly unchanged after the depletion of Scc3 and Pds5. Our findings are inconsistent with a previously proposed model that Scc3 and Pds5 are cohesin maintenance factors required for cohesin ring stability or for maintaining its association with DNA. We propose that Scc3 and Pds5 specifically function during cohesion establishment in S phase.


Asunto(s)
Proteínas de Ciclo Celular/genética , Proteínas Cromosómicas no Histona/genética , Cromosomas Fúngicos , ADN de Hongos/metabolismo , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/genética , Acetiltransferasas/genética , Acetiltransferasas/metabolismo , Proteínas de Ciclo Celular/deficiencia , Proteínas de Ciclo Celular/metabolismo , Cromátides/genética , Cromátides/metabolismo , Proteínas Cromosómicas no Histona/metabolismo , Segregación Cromosómica/genética , ADN de Hongos/genética , Proteínas Nucleares/genética , Proteínas Nucleares/metabolismo , Fase S/genética , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/metabolismo , Cohesinas
6.
Nature ; 477(7365): 419-23, 2011 Aug 28.
Artículo en Inglés | MEDLINE | ID: mdl-21874022

RESUMEN

Genetic differences between Arabidopsis thaliana accessions underlie the plant's extensive phenotypic variation, and until now these have been interpreted largely in the context of the annotated reference accession Col-0. Here we report the sequencing, assembly and annotation of the genomes of 18 natural A. thaliana accessions, and their transcriptomes. When assessed on the basis of the reference annotation, one-third of protein-coding genes are predicted to be disrupted in at least one accession. However, re-annotation of each genome revealed that alternative gene models often restore coding potential. Gene expression in seedlings differed for nearly half of expressed genes and was frequently associated with cis variants within 5 kilobases, as were intron retention alternative splicing events. Sequence and expression variation is most pronounced in genes that respond to the biotic environment. Our data further promote evolutionary and functional studies in A. thaliana, especially the MAGIC genetic reference population descended from these accessions.


Asunto(s)
Arabidopsis/genética , Perfilación de la Expresión Génica , Regulación de la Expresión Génica de las Plantas/genética , Genoma de Planta/genética , Transcripción Genética/genética , Arabidopsis/clasificación , Proteínas de Arabidopsis/genética , Secuencia de Bases , Genes de Plantas/genética , Genómica , Haplotipos/genética , Mutación INDEL/genética , Anotación de Secuencia Molecular , Filogenia , Polimorfismo de Nucleótido Simple/genética , Proteoma/genética , Plantones/genética , Análisis de Secuencia de ADN
7.
Genome Res ; 21(2): 325-41, 2011 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-21177967

RESUMEN

The C. elegans genome has been completely sequenced, and the developmental anatomy of this model organism is described at single-cell resolution. Here we utilize strategies that exploit this precisely defined architecture to link gene expression to cell type. We obtained RNAs from specific cells and from each developmental stage using tissue-specific promoters to mark cells for isolation by FACS or for mRNA extraction by the mRNA-tagging method. We then generated gene expression profiles of more than 30 different cells and developmental stages using tiling arrays. Machine-learning-based analysis detected transcripts corresponding to established gene models and revealed novel transcriptionally active regions (TARs) in noncoding domains that comprise at least 10% of the total C. elegans genome. Our results show that about 75% of transcripts with detectable expression are differentially expressed among developmental stages and across cell types. Examination of known tissue- and cell-specific transcripts validates these data sets and suggests that newly identified TARs may exercise cell-specific functions. Additionally, we used self-organizing maps to define groups of coregulated transcripts and applied regulatory element analysis to identify known transcription factor- and miRNA-binding sites, as well as novel motifs that likely function to control subsets of these genes. By using cell-specific, whole-genome profiling strategies, we have detected a large number of novel transcripts and produced high-resolution gene expression maps that provide a basis for establishing the roles of individual genes in cellular differentiation.


Asunto(s)
Caenorhabditis elegans/genética , Regulación del Desarrollo de la Expresión Génica , Animales , Biología Computacional , Bases de Datos Genéticas , Perfilación de la Expresión Génica , Regulación del Desarrollo de la Expresión Génica/genética , Masculino , Meiosis/genética , Datos de Secuencia Molecular , Oogénesis/genética , Sistemas de Lectura Abierta/genética , Transcripción Genética , Regiones no Traducidas/genética , Inactivación del Cromosoma X/genética
8.
Curr Protoc Bioinformatics ; Chapter 11: Unit 11.6, 2010 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-21154708

RESUMEN

Next-generation sequencing technologies have revolutionized genome and transcriptome sequencing. RNA-Seq experiments are able to generate huge amounts of transcriptome sequence reads at a fraction of the cost of Sanger sequencing. Reads produced by these technologies are relatively short and error prone. To utilize such reads for transcriptome reconstruction and gene-structure identification, one needs to be able to accurately align the sequence reads over intron boundaries. In this unit, we describe PALMapper, a fast and easy-to-use tool that is designed to accurately compute both unspliced and spliced alignments for millions of RNA-Seq reads. It combines the efficient read mapper GenomeMapper with the spliced aligner QPALMA, which exploits read-quality information and predictions of splice sites to improve the alignment accuracy. The PALMapper package is available as a command-line tool running on Unix or Mac OS X systems or through a Web interface based on Galaxy tools.


Asunto(s)
Genómica/métodos , ARN/química , Alineación de Secuencia/métodos , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Secuencia de Bases , Perfilación de la Expresión Génica , Genoma , Empalme del ARN
9.
Genomics ; 94(1): 48-54, 2009 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-19285128

RESUMEN

We have developed AspAlt-a web-based comparative analytical platform for exploring the variations in alternative transcription (AT) events and alternative splicing (AS) events in eukaryotes. AspAlt provides integrated access to 2.1 million AT-AS annotations from 1,58,876 multi-isoform genes and has the following user-friendly analytical features: (1) advanced graphical display to visualize and analyze AT-AS events in 46 eukaryotic genomes; (2) compare and identify the differences in AT-AS patterns among a group of genes specified by the user or among homologous gene groups; (3) inter-database comparative viewer to analyze the differences in the AT-AS annotations for the same gene among Ensembl, RefSeq and AceView databases; (4) dynamically classify and generate graphical plots of AT-AS events from mRNA annotations submitted by the user; and (5) download genomic AT-AS annotations of 46 eukaryotes in XML and tab-delimited formats. The AspAlt resource is available at http://66.170.16.154/AspAlt.


Asunto(s)
Empalme Alternativo/genética , Biología Computacional/métodos , Bases de Datos de Ácidos Nucleicos , Programas Informáticos , Transcripción Genética , Gráficos por Computador , Células Eucariotas , Internet , ARN Mensajero/genética , Alineación de Secuencia
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...