Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 3 de 3
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
País de afiliação
Intervalo de ano de publicação
1.
Bioinformatics ; 40(1)2024 01 02.
Artigo em Inglês | MEDLINE | ID: mdl-38141207

RESUMO

MOTIVATION: The utilization of single-cell bisulfite sequencing (scBS-seq) methods allows for precise analysis of DNA methylation patterns at the individual cell level, enabling the identification of rare populations, revealing cell-specific epigenetic changes, and improving differential methylation analysis. Nonetheless, the presence of sparse data and an overabundance of zeros and ones, attributed to limited sequencing depth and coverage, frequently results in reduced precision accuracy during the process of differential methylation detection using scBS-seq. Consequently, there is a pressing demand for an innovative differential methylation analysis approach that effectively tackles these data characteristics and enhances recognition accuracy. RESULTS: We propose a novel beta mixture approach called scDMV for analyzing methylation differences in single-cell bisulfite sequencing data, which effectively handles excess zeros and ones and accommodates low-input sequencing. Our extensive simulation studies demonstrate that the scDMV approach outperforms several alternative methods in terms of sensitivity, precision, and controlling the false positive rate. Moreover, in real data applications, we observe that scDMV exhibits higher precision and sensitivity in identifying differentially methylated regions, even with low-input samples. In addition, scDMV reveals important information for GO enrichment analysis with single-cell whole-genome sequencing data that are often overlooked by other methods. AVAILABILITY AND IMPLEMENTATION: The scDMV method, along with a comprehensive tutorial, can be accessed as an R package on the following GitHub repository: https://github.com/PLX-m/scDMV.


Assuntos
Metilação de DNA , Sulfitos , Análise de Sequência de DNA/métodos , Sequenciamento Completo do Genoma
2.
Stat Med ; 30(7): 725-41, 2011 Mar 30.
Artigo em Inglês | MEDLINE | ID: mdl-21394749

RESUMO

The CUSUM procedure has been popularly used for detecting a shift in the incidence rate of a rare health event. Many CUSUM methods are developed based on a Poisson model with a constant mean number of events. In practice, the expected number of events is likely to vary over time as the population size at risk is not constant but often grows over time. An increase in the baseline incidence rate tends to be masked by the population growth. To efficiently detect an increase in the baseline incidence rate, it is appealing to assign more weight to recent observations and less weight to older observations. This paper compares weighted CUSUM (WCUSUM) and conventional CUSUM procedures in the presence of monotone changes in population size. The simulation results show that the WCUSUM method may be more efficient than the conventional CUSUM methods in detecting increases in the incidence rate, especially for small shifts. An example based on mortality data from New Mexico is used to illustrate the implementation of the WCUSUM method.


Assuntos
Modelos Estatísticos , Vigilância da População/métodos , Interpretação Estatística de Dados , Feminino , Humanos , Incidência , Masculino , Neoplasias Induzidas por Radiação/epidemiologia , New Mexico/epidemiologia , Densidade Demográfica , Neoplasias da Glândula Tireoide/epidemiologia
3.
Front Genet ; 12: 642227, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33747051

RESUMO

Next-generation sequencing has emerged as an essential technology for the quantitative analysis of gene expression. In medical research, RNA sequencing (RNA-seq) data are commonly used to identify which type of disease a patient has. Because of the discrete nature of RNA-seq data, the existing statistical methods that have been developed for microarray data cannot be directly applied to RNA-seq data. Existing statistical methods usually model RNA-seq data by a discrete distribution, such as the Poisson, the negative binomial, or the mixture distribution with a point mass at zero and a Poisson distribution to further allow for data with an excess of zeros. Consequently, analytic tools corresponding to the above three discrete distributions have been developed: Poisson linear discriminant analysis (PLDA), negative binomial linear discriminant analysis (NBLDA), and zero-inflated Poisson logistic discriminant analysis (ZIPLDA). However, it is unclear what the real distributions would be for these classifications when applied to a new and real dataset. Considering that count datasets are frequently characterized by excess zeros and overdispersion, this paper extends the existing distribution to a mixture distribution with a point mass at zero and a negative binomial distribution and proposes a zero-inflated negative binomial logistic discriminant analysis (ZINBLDA) for classification. More importantly, we compare the above four classification methods from the perspective of model parameters, as an understanding of parameters is necessary for selecting the optimal method for RNA-seq data. Furthermore, we determine that the above four methods could transform into each other in some cases. Using simulation studies, we compare and evaluate the performance of these classification methods in a wide range of settings, and we also present a decision tree model created to help us select the optimal classifier for a new RNA-seq dataset. The results of the two real datasets coincide with the theory and simulation analysis results. The methods used in this work are implemented in the open-scource R scripts, with a source code freely available at https://github.com/FocusPaka/ZINBLDA.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA