Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 4 de 4
Filter
1.
Nat Methods ; 14(6): 584-586, 2017 Jun.
Article in English | MEDLINE | ID: mdl-28418000

ABSTRACT

The normalization of RNA-seq data is essential for accurate downstream inference, but the assumptions upon which most normalization methods are based are not applicable in the single-cell setting. Consequently, applying existing normalization methods to single-cell RNA-seq data introduces artifacts that bias downstream analyses. To address this, we introduce SCnorm for accurate and efficient normalization of single-cell RNA-seq data.


Subject(s)
Algorithms , High-Throughput Nucleotide Sequencing/standards , RNA/genetics , Sequence Analysis, RNA/standards , Single-Cell Analysis/standards , Transcriptome/genetics , Data Interpretation, Statistical , Reference Values , Software
2.
Nat Methods ; 12(10): 947-950, 2015 Oct.
Article in English | MEDLINE | ID: mdl-26301841

ABSTRACT

Oscillatory gene expression is fundamental to development, but technologies for monitoring expression oscillations are limited. We have developed a statistical approach called Oscope to identify and characterize the transcriptional dynamics of oscillating genes in single-cell RNA-seq data from an unsynchronized cell population. Applying Oscope to a number of data sets, we demonstrated its utility and also identified a potential artifact in the Fluidigm C1 platform.


Subject(s)
Data Interpretation, Statistical , Models, Genetic , Sequence Analysis, RNA/methods , Single-Cell Analysis/methods , Algorithms , Analysis of Variance , Embryonic Stem Cells/physiology , Gene Expression Profiling/methods , Gene Expression Profiling/statistics & numerical data , Humans , Real-Time Polymerase Chain Reaction/methods , Sequence Analysis, RNA/statistics & numerical data , Single-Cell Analysis/statistics & numerical data , Software
3.
Bioinformatics ; 29(8): 1035-43, 2013 Apr 15.
Article in English | MEDLINE | ID: mdl-23428641

ABSTRACT

MOTIVATION: Messenger RNA expression is important in normal development and differentiation, as well as in manifestation of disease. RNA-seq experiments allow for the identification of differentially expressed (DE) genes and their corresponding isoforms on a genome-wide scale. However, statistical methods are required to ensure that accurate identifications are made. A number of methods exist for identifying DE genes, but far fewer are available for identifying DE isoforms. When isoform DE is of interest, investigators often apply gene-level (count-based) methods directly to estimates of isoform counts. Doing so is not recommended. In short, estimating isoform expression is relatively straightforward for some groups of isoforms, but more challenging for others. This results in estimation uncertainty that varies across isoform groups. Count-based methods were not designed to accommodate this varying uncertainty, and consequently, application of them for isoform inference results in reduced power for some classes of isoforms and increased false discoveries for others. RESULTS: Taking advantage of the merits of empirical Bayesian methods, we have developed EBSeq for identifying DE isoforms in an RNA-seq experiment comparing two or more biological conditions. Results demonstrate substantially improved power and performance of EBSeq for identifying DE isoforms. EBSeq also proves to be a robust approach for identifying DE genes. AVAILABILITY AND IMPLEMENTATION: An R package containing examples and sample datasets is available at http://www.biostat.wisc.edu/kendzior/EBSEQ/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Gene Expression Profiling/methods , RNA Isoforms/metabolism , Sequence Analysis, RNA/methods , Bayes Theorem , Cell Line , Embryonic Stem Cells/metabolism , Genome , Models, Statistical , RNA, Messenger/metabolism , Software
4.
Bioinformatics ; 26(4): 493-500, 2010 Feb 15.
Article in English | MEDLINE | ID: mdl-20022975

ABSTRACT

MOTIVATION: RNA-Seq is a promising new technology for accurately measuring gene expression levels. Expression estimation with RNA-Seq requires the mapping of relatively short sequencing reads to a reference genome or transcript set. Because reads are generally shorter than transcripts from which they are derived, a single read may map to multiple genes and isoforms, complicating expression analyses. Previous computational methods either discard reads that map to multiple locations or allocate them to genes heuristically. RESULTS: We present a generative statistical model and associated inference methods that handle read mapping uncertainty in a principled manner. Through simulations parameterized by real RNA-Seq data, we show that our method is more accurate than previous methods. Our improved accuracy is the result of handling read mapping uncertainty with a statistical model and the estimation of gene expression levels as the sum of isoform expression levels. Unlike previous methods, our method is capable of modeling non-uniform read distributions. Simulations with our method indicate that a read length of 20-25 bases is optimal for gene-level expression estimation from mouse and maize RNA-Seq data when sequencing throughput is fixed.


Subject(s)
Gene Expression , Sequence Analysis, RNA/methods , Software , Algorithms , Animals , Base Sequence , Computational Biology/methods , Databases, Genetic , Gene Expression Profiling , Genome , Mice , Zea mays/genetics
SELECTION OF CITATIONS
SEARCH DETAIL