Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
1.
Biostatistics ; 15(1): 87-101, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-23887981

RESUMO

Blood and tissue are composed of many functionally distinct cell subsets. In immunological studies, these can be measured accurately only using single-cell assays. The characterization of these small cell subsets is crucial to decipher system-level biological changes. For this reason, an increasing number of studies rely on assays that provide single-cell measurements of multiple genes and proteins from bulk cell samples. A common problem in the analysis of such data is to identify biomarkers (or combinations of biomarkers) that are differentially expressed between two biological conditions (e.g. before/after stimulation), where expression is defined as the proportion of cells expressing that biomarker (or biomarker combination) in the cell subset(s) of interest. Here, we present a Bayesian hierarchical framework based on a beta-binomial mixture model for testing for differential biomarker expression using single-cell assays. Our model allows the inference to be subject specific, as is typically required when assessing vaccine responses, while borrowing strength across subjects through common prior distributions. We propose two approaches for parameter estimation: an empirical-Bayes approach using an Expectation-Maximization algorithm and a fully Bayesian one based on a Markov chain Monte Carlo algorithm. We compare our method against classical approaches for single-cell assays including Fisher's exact test, a likelihood ratio test, and basic log-fold changes. Using several experimental assays measuring proteins or genes at single-cell level and simulations, we show that our method has higher sensitivity and specificity than alternative methods. Additional simulations show that our framework is also robust to model misspecification. Finally, we demonstrate how our approach can be extended to testing multivariate differential expression across multiple biomarker combinations using a Dirichlet-multinomial model and illustrate this approach using single-cell gene expression data and simulations.


Assuntos
Algoritmos , Teorema de Bayes , Biomarcadores/análise , Modelos Estatísticos , Análise de Célula Única/métodos , Vacinas contra a AIDS/imunologia , Simulação por Computador , Infecções por HIV/imunologia , Infecções por HIV/prevenção & controle , Humanos , Cadeias de Markov , Método de Monte Carlo
2.
Bioinformatics ; 29(21): 2797-8, 2013 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-23958729

RESUMO

MOTIVATION: Recently, mapping studies of expression quantitative loci (eQTL) (where gene expression levels are viewed as quantitative traits) have provided insight into the biology of gene regulation. Bayesian methods provide natural modeling frameworks for analyzing eQTL studies, where information shared across markers and/or genes can increase the power to detect eQTLs. Bayesian approaches tend to be computationally demanding and require specialized software. As a result, most eQTL studies use univariate methods treating each gene independently, leading to suboptimal results. RESULTS: We present a powerful, computationally optimized and free open-source R package, iBMQ. Our package implements a joint hierarchical Bayesian model where all genes and SNPs are modeled concurrently. Model parameters are estimated using a Markov chain Monte Carlo algorithm. The free and widely used openMP parallel library speeds up computation. Using a mouse cardiac dataset, we show that iBMQ improves the detection of large trans-eQTL hotspots compared with other state-of-the-art packages for eQTL analysis. AVAILABILITY: The R-package iBMQ is available from the Bioconductor Web site at http://bioconductor.org and runs on Linux, Windows and MAC OS X. It is distributed under the Artistic Licence-2.0 terms. CONTACT: christian.deschepper@ircm.qc.ca or rgottard@fhcrc.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Expressão Gênica , Locos de Características Quantitativas , Software , Algoritmos , Animais , Teorema de Bayes , Cadeias de Markov , Camundongos , Método de Monte Carlo , Polimorfismo de Nucleotídeo Único
3.
Nat Methods ; 10(3): 228-38, 2013 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-23396282

RESUMO

Traditional methods for flow cytometry (FCM) data processing rely on subjective manual gating. Recently, several groups have developed computational methods for identifying cell populations in multidimensional FCM data. The Flow Cytometry: Critical Assessment of Population Identification Methods (FlowCAP) challenges were established to compare the performance of these methods on two tasks: (i) mammalian cell population identification, to determine whether automated algorithms can reproduce expert manual gating and (ii) sample classification, to determine whether analysis pipelines can identify characteristics that correlate with external variables (such as clinical outcome). This analysis presents the results of the first FlowCAP challenges. Several methods performed well as compared to manual gating or external variables using statistical performance measures, which suggests that automated methods have reached a sufficient level of maturity and accuracy for reliable use in FCM data analysis.


Assuntos
Biologia Computacional , Citometria de Fluxo/métodos , Processamento de Imagem Assistida por Computador , Algoritmos , Animais , Análise por Conglomerados , Interpretação Estatística de Dados , Citometria de Fluxo/normas , Citometria de Fluxo/estatística & dados numéricos , Doença Enxerto-Hospedeiro/sangue , Doença Enxerto-Hospedeiro/patologia , Humanos , Leucócitos Mononucleares/patologia , Leucócitos Mononucleares/virologia , Linfoma Difuso de Grandes Células B/sangue , Linfoma Difuso de Grandes Células B/patologia , Reprodutibilidade dos Testes , Sensibilidade e Especificidade , Software , Febre do Nilo Ocidental/sangue , Febre do Nilo Ocidental/patologia , Febre do Nilo Ocidental/virologia
4.
BMC Bioinformatics ; 13: 252, 2012 Sep 28.
Artigo em Inglês | MEDLINE | ID: mdl-23020243

RESUMO

BACKGROUND: Effective quality assessment is an important part of any high-throughput flow cytometry data analysis pipeline, especially when considering the complex designs of the typical flow experiments applied in clinical trials. Technical issues like instrument variation, problematic antibody staining, or reagent lot changes can lead to biases in the extracted cell subpopulation statistics. These biases can manifest themselves in non-obvious ways that can be difficult to detect without leveraging information about the study design or other experimental metadata. Consequently, a systematic and integrated approach to quality assessment of flow cytometry data is necessary to effectively identify technical errors that impact multiple samples over time. Gated cell populations and their statistics must be monitored within the context of the experimental run, assay, and the overall study. RESULTS: We have developed two new packages, flowWorkspace and QUAliFiER to construct a pipeline for quality assessment of gated flow cytometry data. flowWorkspace makes manually gated data accessible to BioConductor's computational flow tools by importing pre-processed and gated data from the widely used manual gating tool, FlowJo (Tree Star Inc, Ashland OR). The QUAliFiER package takes advantage of the manual gates to perform an extensive series of statistical quality assessment checks on the gated cell sub-populations while taking into account the structure of the data and the study design to monitor the consistency of population statistics across staining panels, subject, aliquots, channels, or other experimental variables. QUAliFiER implements SVG-based interactive visualization methods, allowing investigators to examine quality assessment results across different views of the data, and it has a flexible interface allowing users to tailor quality checks and outlier detection routines to suit their data analysis needs. CONCLUSION: We present a pipeline constructed from two new R packages for importing manually gated flow cytometry data and performing flexible and robust quality assessment checks. The pipeline addresses the increasing demand for tools capable of performing quality checks on large flow data sets generated in typical clinical trials. The QUAliFiER tool objectively, efficiently, and reproducibly identifies outlier samples in an automated manner by monitoring cell population statistics from gated or ungated flow data conditioned on experiment-level metadata.


Assuntos
Citometria de Fluxo/estatística & dados numéricos , Software , Interpretação Estatística de Dados , Humanos
5.
Biostatistics ; 7(1): 85-99, 2006 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-16049139

RESUMO

We describe a probabilistic approach to simultaneous image segmentation and intensity estimation for complementary DNA microarray experiments. The approach overcomes several limitations of existing methods. In particular, it (a) uses a flexible Markov random field approach to segmentation that allows for a wider range of spot shapes than existing methods, including relatively common 'doughnut-shaped' spots; (b) models the image directly as background plus hybridization intensity, and estimates the two quantities simultaneously, avoiding the common logical error that estimates of foreground may be less than those of the corresponding background if the two are estimated separately; and (c) uses a probabilistic modeling approach to simultaneously perform segmentation and intensity estimation, and to compute spot quality measures. We describe two approaches to parameter estimation: a fast algorithm, based on the expectation-maximization and the iterated conditional modes algorithms, and a fully Bayesian framework. These approaches produce comparable results, and both appear to offer some advantages over other methods. We use an HIV experiment to compare our approach to two commercial software products: Spot and Arrayvision.


Assuntos
Análise de Sequência com Séries de Oligonucleotídeos/métodos , Algoritmos , Teorema de Bayes , Perfilação da Expressão Gênica/métodos , Interpretação de Imagem Assistida por Computador/métodos , Cadeias de Markov , Modelos Estatísticos , Reconhecimento Automatizado de Padrão/métodos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA