Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
1.
Nat Methods ; 14(6): 584-586, 2017 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-28418000

RESUMEN

The normalization of RNA-seq data is essential for accurate downstream inference, but the assumptions upon which most normalization methods are based are not applicable in the single-cell setting. Consequently, applying existing normalization methods to single-cell RNA-seq data introduces artifacts that bias downstream analyses. To address this, we introduce SCnorm for accurate and efficient normalization of single-cell RNA-seq data.


Asunto(s)
Algoritmos , Secuenciación de Nucleótidos de Alto Rendimiento/normas , ARN/genética , Análisis de Secuencia de ARN/normas , Análisis de la Célula Individual/normas , Transcriptoma/genética , Interpretación Estadística de Datos , Valores de Referencia , Programas Informáticos
2.
Nat Methods ; 12(10): 947-950, 2015 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-26301841

RESUMEN

Oscillatory gene expression is fundamental to development, but technologies for monitoring expression oscillations are limited. We have developed a statistical approach called Oscope to identify and characterize the transcriptional dynamics of oscillating genes in single-cell RNA-seq data from an unsynchronized cell population. Applying Oscope to a number of data sets, we demonstrated its utility and also identified a potential artifact in the Fluidigm C1 platform.


Asunto(s)
Interpretación Estadística de Datos , Modelos Genéticos , Análisis de Secuencia de ARN/métodos , Análisis de la Célula Individual/métodos , Algoritmos , Análisis de Varianza , Células Madre Embrionarias/fisiología , Perfilación de la Expresión Génica/métodos , Perfilación de la Expresión Génica/estadística & datos numéricos , Humanos , Reacción en Cadena en Tiempo Real de la Polimerasa/métodos , Análisis de Secuencia de ARN/estadística & datos numéricos , Análisis de la Célula Individual/estadística & datos numéricos , Programas Informáticos
3.
Bioinformatics ; 29(8): 1035-43, 2013 Apr 15.
Artículo en Inglés | MEDLINE | ID: mdl-23428641

RESUMEN

MOTIVATION: Messenger RNA expression is important in normal development and differentiation, as well as in manifestation of disease. RNA-seq experiments allow for the identification of differentially expressed (DE) genes and their corresponding isoforms on a genome-wide scale. However, statistical methods are required to ensure that accurate identifications are made. A number of methods exist for identifying DE genes, but far fewer are available for identifying DE isoforms. When isoform DE is of interest, investigators often apply gene-level (count-based) methods directly to estimates of isoform counts. Doing so is not recommended. In short, estimating isoform expression is relatively straightforward for some groups of isoforms, but more challenging for others. This results in estimation uncertainty that varies across isoform groups. Count-based methods were not designed to accommodate this varying uncertainty, and consequently, application of them for isoform inference results in reduced power for some classes of isoforms and increased false discoveries for others. RESULTS: Taking advantage of the merits of empirical Bayesian methods, we have developed EBSeq for identifying DE isoforms in an RNA-seq experiment comparing two or more biological conditions. Results demonstrate substantially improved power and performance of EBSeq for identifying DE isoforms. EBSeq also proves to be a robust approach for identifying DE genes. AVAILABILITY AND IMPLEMENTATION: An R package containing examples and sample datasets is available at http://www.biostat.wisc.edu/kendzior/EBSEQ/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Perfilación de la Expresión Génica/métodos , Isoformas de ARN/metabolismo , Análisis de Secuencia de ARN/métodos , Teorema de Bayes , Línea Celular , Células Madre Embrionarias/metabolismo , Genoma , Modelos Estadísticos , ARN Mensajero/metabolismo , Programas Informáticos
4.
Bioinformatics ; 26(4): 493-500, 2010 Feb 15.
Artículo en Inglés | MEDLINE | ID: mdl-20022975

RESUMEN

MOTIVATION: RNA-Seq is a promising new technology for accurately measuring gene expression levels. Expression estimation with RNA-Seq requires the mapping of relatively short sequencing reads to a reference genome or transcript set. Because reads are generally shorter than transcripts from which they are derived, a single read may map to multiple genes and isoforms, complicating expression analyses. Previous computational methods either discard reads that map to multiple locations or allocate them to genes heuristically. RESULTS: We present a generative statistical model and associated inference methods that handle read mapping uncertainty in a principled manner. Through simulations parameterized by real RNA-Seq data, we show that our method is more accurate than previous methods. Our improved accuracy is the result of handling read mapping uncertainty with a statistical model and the estimation of gene expression levels as the sum of isoform expression levels. Unlike previous methods, our method is capable of modeling non-uniform read distributions. Simulations with our method indicate that a read length of 20-25 bases is optimal for gene-level expression estimation from mouse and maize RNA-Seq data when sequencing throughput is fixed.


Asunto(s)
Expresión Génica , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Algoritmos , Animales , Secuencia de Bases , Biología Computacional/métodos , Bases de Datos Genéticas , Perfilación de la Expresión Génica , Genoma , Ratones , Zea mays/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA