RESUMO
Traditional methods for flow cytometry (FCM) data processing rely on subjective manual gating. Recently, several groups have developed computational methods for identifying cell populations in multidimensional FCM data. The Flow Cytometry: Critical Assessment of Population Identification Methods (FlowCAP) challenges were established to compare the performance of these methods on two tasks: (i) mammalian cell population identification, to determine whether automated algorithms can reproduce expert manual gating and (ii) sample classification, to determine whether analysis pipelines can identify characteristics that correlate with external variables (such as clinical outcome). This analysis presents the results of the first FlowCAP challenges. Several methods performed well as compared to manual gating or external variables using statistical performance measures, which suggests that automated methods have reached a sufficient level of maturity and accuracy for reliable use in FCM data analysis.
Assuntos
Biologia Computacional , Citometria de Fluxo/métodos , Processamento de Imagem Assistida por Computador , Algoritmos , Animais , Análise por Conglomerados , Interpretação Estatística de Dados , Citometria de Fluxo/normas , Citometria de Fluxo/estatística & dados numéricos , Doença Enxerto-Hospedeiro/sangue , Doença Enxerto-Hospedeiro/patologia , Humanos , Leucócitos Mononucleares/patologia , Leucócitos Mononucleares/virologia , Linfoma Difuso de Grandes Células B/sangue , Linfoma Difuso de Grandes Células B/patologia , Reprodutibilidade dos Testes , Sensibilidade e Especificidade , Software , Febre do Nilo Ocidental/sangue , Febre do Nilo Ocidental/patologia , Febre do Nilo Ocidental/virologiaRESUMO
BACKGROUND: The recent development of semiautomated techniques for staining and analyzing flow cytometry samples has presented new challenges. Quality control and quality assessment are critical when developing new high throughput technologies and their associated information services. Our experience suggests that significant bottlenecks remain in the development of high throughput flow cytometry methods for data analysis and display. Especially, data quality control and quality assessment are crucial steps in processing and analyzing high throughput flow cytometry data. METHODS: We propose a variety of graphical exploratory data analytic tools for exploring ungated flow cytometry data. We have implemented a number of specialized functions and methods in the Bioconductor package rflowcyt. We demonstrate the use of these approaches by investigating two independent sets of high throughput flow cytometry data. RESULTS: We found that graphical representations can reveal substantial nonbiological differences in samples. Empirical Cumulative Distribution Function and summary scatterplots were especially useful in the rapid identification of problems not identified by manual review. CONCLUSIONS: Graphical exploratory data analytic tools are quick and useful means of assessing data quality. We propose that the described visualizations should be used as quality assessment tools and where possible, be used for quality control.