Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
Mais filtros

Bases de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Artigo em Inglês | MEDLINE | ID: mdl-33477576

RESUMO

With the rapid spread of the pandemic due to the coronavirus disease 2019 (COVID-19), the virus has already led to considerable mortality and morbidity worldwide, as well as having a severe impact on economic development. In this article, we analyze the state-level correlation between COVID-19 risk and weather/climate factors in the USA. For this purpose, we consider a spatio-temporal multivariate time series model under a hierarchical framework, which is especially suitable for envisioning the virus transmission tendency across a geographic area over time. Briefly, our model decomposes the COVID-19 risk into: (i) an autoregressive component that describes the within-state COVID-19 risk effect; (ii) a spatiotemporal component that describes the across-state COVID-19 risk effect; (iii) an exogenous component that includes other factors (e.g., weather/climate) that could envision future epidemic development risk; and (iv) an endemic component that captures the function of time and other predictors mainly for individual states. Our results indicate that maximum temperature, minimum temperature, humidity, the percentage of cloud coverage, and the columnar density of total atmospheric ozone have a strong association with the COVID-19 pandemic in many states. In particular, the maximum temperature, minimum temperature, and the columnar density of total atmospheric ozone demonstrate statistically significant associations with the tendency of COVID-19 spreading in almost all states. Furthermore, our results from transmission tendency analysis suggest that the community-level transmission has been relatively mitigated in the USA, and the daily confirmed cases within a state are predominated by the earlier daily confirmed cases within that state compared to other factors, which implies that states such as Texas, California, and Florida with a large number of confirmed cases still need strategies like stay-at-home orders to prevent another outbreak.


Assuntos
COVID-19/epidemiologia , Pandemias , Tempo (Meteorologia) , COVID-19/transmissão , California , Florida , Humanos , Modelos Teóricos , Ozônio , Fatores de Risco , Análise Espaço-Temporal , Texas , Estados Unidos/epidemiologia
2.
Stat Methods Med Res ; 30(1): 129-150, 2021 01.
Artigo em Inglês | MEDLINE | ID: mdl-32746735

RESUMO

In this paper, we consider variable selection for ultra-high dimensional quantile regression model with missing data and measurement errors in covariates. Specifically, we correct the bias in the loss function caused by measurement error by applying the orthogonal quantile regression approach and remove the bias caused by missing data using the inverse probability weighting. A nonconvex Atan penalized estimation method is proposed for simultaneous variable selection and estimation. With the proper choice of the regularization parameter and under some relaxed conditions, we show that the proposed estimate enjoys the oracle properties. The choice of smoothing parameters is also discussed. The performance of the proposed variable selection procedure is assessed by Monte Carlo simulation studies. We further demonstrate the proposed procedure with a breast cancer data set.


Assuntos
Neoplasias da Mama , Viés , Simulação por Computador , Feminino , Humanos , Método de Monte Carlo , Probabilidade
3.
Comb Chem High Throughput Screen ; 23(8): 740-756, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32342803

RESUMO

AIM AND OBJECTIVE: Near Infrared (NIR) spectroscopy data are featured by few dozen to many thousands of samples and highly correlated variables. Quantitative analysis of such data usually requires a combination of analytical methods with variable selection or screening methods. Commonly-used variable screening methods fail to recover the true model when (i) some of the variables are highly correlated, and (ii) the sample size is less than the number of relevant variables. In these cases, Partial Least Squares (PLS) regression based approaches can be useful alternatives. MATERIALS AND METHODS: In this research, a fast variable screening strategy, namely the preconditioned screening for ridge partial least squares regression (PSRPLS), is proposed for modelling NIR spectroscopy data with high-dimensional and highly correlated covariates. Under rather mild assumptions, we prove that using Puffer transformation, the proposed approach successfully transforms the problem of variable screening with highly correlated predictor variables to that of weakly correlated covariates with less extra computational effort. RESULTS: We show that our proposed method leads to theoretically consistent model selection results. Four simulation studies and two real examples are then analyzed to illustrate the effectiveness of the proposed approach. CONCLUSION: By introducing Puffer transformation, high correlation problem can be mitigated using the PSRPLS procedure we construct. By employing RPLS regression to our approach, it can be made more simple and computational efficient to cope with the situation where model size is larger than the sample size while maintaining a high precision prediction.


Assuntos
Solo/química , Espectroscopia de Luz Próxima ao Infravermelho/métodos , Simulação por Computador , Bases de Dados de Compostos Químicos , Análise dos Mínimos Quadrados , Modelos Teóricos , Método de Monte Carlo
4.
Stat Med ; 38(23): 4670-4685, 2019 10 15.
Artigo em Inglês | MEDLINE | ID: mdl-31359443

RESUMO

The proportional subdistribution hazard regression model has been widely used by clinical researchers for analyzing competing risks data. It is well known that quantile regression provides a more comprehensive alternative to model how covariates influence not only the location but also the entire conditional distribution. In this paper, we develop variable selection procedures based on penalized estimating equations for competing risks quantile regression. Asymptotic properties of the proposed estimators including consistency and oracle properties are established. Monte Carlo simulation studies are conducted, confirming that the proposed methods are efficient. A bone marrow transplant data set is analyzed to demonstrate our methodologies.


Assuntos
Modelos de Riscos Proporcionais , Transplante de Medula Óssea , Simulação por Computador , Humanos , Leucemia Mieloide Aguda/terapia , Método de Monte Carlo
5.
Hum Genomics ; 13(1): 9, 2019 02 13.
Artigo em Inglês | MEDLINE | ID: mdl-30795817

RESUMO

BACKGROUND: Accurate and reliable identification of sequence variants, including single nucleotide polymorphisms (SNPs) and insertion-deletion polymorphisms (INDELs), plays a fundamental role in next-generation sequencing (NGS) applications. Existing methods for calling these variants often make simplified assumptions of positional independence and fail to leverage the dependence between genotypes at nearby loci that is caused by linkage disequilibrium (LD). RESULTS AND CONCLUSION: We propose vi-HMM, a hidden Markov model (HMM)-based method for calling SNPs and INDELs in mapped short-read data. This method allows transitions between hidden states (defined as "SNP," "Ins," "Del," and "Match") of adjacent genomic bases and determines an optimal hidden state path by using the Viterbi algorithm. The inferred hidden state path provides a direct solution to the identification of SNPs and INDELs. Simulation studies show that, under various sequencing depths, vi-HMM outperforms commonly used variant calling methods in terms of sensitivity and F1 score. When applied to the real data, vi-HMM demonstrates higher accuracy in calling SNPs and INDELs.


Assuntos
Algoritmos , Variação Genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Cadeias de Markov , Bases de Dados Genéticas , Haplótipos , Sequenciamento de Nucleotídeos em Larga Escala/estatística & dados numéricos , Humanos , Mutação INDEL , Desequilíbrio de Ligação , Polimorfismo de Nucleotídeo Único
6.
J Biopharm Stat ; 26(2): 323-38, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-25632882

RESUMO

Under the assumption of missing at random, eight confidence intervals (CIs) for the difference between two correlated proportions in the presence of incomplete paired binary data are constructed on the basis of the likelihood ratio statistic, the score statistic, the Wald-type statistic, the hybrid method incorporated with the Wilson score and Agresti-Coull (AC) intervals, and the Bootstrap-resampling method. Extensive simulation studies are conducted to evaluate the performance of the presented CIs in terms of coverage probability and expected interval width. Our empirical results evidence that the Wilson-score-based hybrid CI and the Wald-type CI together with the constrained maximum likelihood estimates perform well for small-to-moderate sample sizes in the sense that (i) their empirical coverage probabilities are quite close to the prespecified confidence level, (ii) their expected interval widths are shorter, and (iii) their ratios of the mesial non-coverage to non-coverage probabilities lie in interval [0.4, 0.6]. An example from a neurological study is used to illustrate the proposed methodologies.


Assuntos
Intervalos de Confiança , Modelos Estatísticos , Ensaios Clínicos Controlados Aleatórios como Assunto/estatística & dados numéricos , Simulação por Computador , Estudos Cross-Over , Interpretação Estatística de Dados , Humanos , Análise por Pareamento , Meningite/complicações , Meningite/tratamento farmacológico , Método de Monte Carlo , Exame Neurológico/estatística & dados numéricos
7.
Stat Med ; 33(25): 4370-86, 2014 Nov 10.
Artigo em Inglês | MEDLINE | ID: mdl-24974954

RESUMO

Stratified data analysis is an important research topic in many biomedical studies and clinical trials. In this article, we develop five test statistics for testing the homogeneity of proportion ratios for stratified correlated bilateral binary data based on an equal correlation model assumption. Bootstrap procedures based on these test statistics are also considered. To evaluate the performance of these statistics and procedures, we conduct Monte Carlo simulations to study their empirical sizes and powers under various scenarios. Our results suggest that the procedure based on score statistic performs well generally and is highly recommended. When the sample size is large, procedures based on the commonly used weighted least square estimate and logarithmic transformation with Mantel-Haenszel estimate are recommended as they do not involve any computation of maximum likelihood estimates requiring iterative algorithms. We also derive approximate sample size formulas based on the recommended test procedures. Finally, we apply the proposed methods to analyze a multi-center randomized clinical trial for scleroderma patients.


Assuntos
Interpretação Estatística de Dados , Modelos Estatísticos , Ensaios Clínicos Controlados Aleatórios como Assunto/métodos , Simulação por Computador , Humanos , Método de Monte Carlo , Tamanho da Amostra , Escleroderma Sistêmico/patologia , Escleroderma Sistêmico/terapia , Resultado do Tratamento
8.
Biom J ; 50(2): 283-98, 2008 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-18311854

RESUMO

In this paper we compare the properties of four different general approaches for testing the ratio of two Poisson rates. Asymptotically normal tests, tests based on approximate p -values, exact conditional tests, and a likelihood ratio test are considered. The properties and power performance of these tests are studied by a Monte Carlo simulation experiment. Sample size calculation formulae are given for each of the test procedures and their validities are studied. Some recommendations favoring the likelihood ratio and certain asymptotic tests are based on these simulation results. Finally, all of the test procedures are illustrated with two real life medical examples.


Assuntos
Interpretação Estatística de Dados , Distribuição de Poisson , Tamanho da Amostra , Neoplasias da Mama/epidemiologia , Simulação por Computador , Métodos Epidemiológicos , Humanos , Funções Verossimilhança , Método de Monte Carlo
9.
Stat Med ; 24(6): 955-65, 2005 Mar 30.
Artigo em Inglês | MEDLINE | ID: mdl-15532090

RESUMO

In this article, we investigate procedures for comparing two independent Poisson variates that are observed over unequal sampling frames (i.e. time intervals, populations, areas or any combination thereof). We consider two statistics (with and without the logarithmic transformation) for testing the equality of two Poisson rates. Two methods for implementing these statistics are reviewed. They are (1) the sample-based method, and (2) the constrained maximum likelihood estimation (CMLE) method. We conduct an empirical study to evaluate the performance of different statistics and methods. Generally, we find that the CMLE method works satisfactorily only for the statistic without the logarithmic transformation (denoted as W(2)) while sample-based method performs better for the statistic using the logarithmic transformation (denoted as W(3)). It is noteworthy that both statistics perform well for moderate to large Poisson rates (e.g. > or =10). For small Poisson rates (e.g. <10), W(2) can be liberal (e.g. actual type I error rate/nominal level > or =1.2) while W(3) can be conservative (e.g. actual type I error rate/nominal level < or =0.8). The corresponding sample size formulae are provided and valid in the sense that the simulated powers associated with the approximate sample size formulae are generally close to the pre-chosen power level. We illustrate our methodologies with a real example from a breast cancer study.


Assuntos
Interpretação Estatística de Dados , Distribuição de Poisson , Neoplasias da Mama/etiologia , Simulação por Computador , Feminino , Fluoroscopia/efeitos adversos , Humanos , Funções Verossimilhança , Método de Monte Carlo , Tamanho da Amostra , Espectrometria por Raios X , Tuberculose/radioterapia
10.
Stat Med ; 23(23): 3593-605, 2004 Dec 15.
Artigo em Inglês | MEDLINE | ID: mdl-15534897

RESUMO

Diagnostic tests are seldom adopted in isolation. Few tests have high sensitivity and specificity simultaneously. In these cases, one can increase either the sensitivity or the specificity by combining two component tests under either the 'either positive' rule or the 'both positive' rule. However, there is a tradeoff between sensitivity and specificity when these rules are applied. We propose three statistical procedures to simultaneously assess the sensitivity and specificity when combining two component tests. Measurements of interest include rate difference and rate ratio. Our empirical results demonstrate that (i) the asymptotic test procedures for both measurements and approximate test procedure for rate difference possess inflated type I error rate; (ii) the exact test procedures for both measurements possess deflated type I error rate; and (iii) the approximate (unconditional) test procedure for rate ratio becomes an reliable alternative and nicely controls the actual type I error rate in small to moderate sample sizes. Moreover, the approximate (unconditional) test procedure is computationally less intensive than the exact (unconditional) test procedure. We illustrate our methodologies with a real example from a residual nasopharyngeal carcinoma (RNP) study.


Assuntos
Biometria/métodos , Testes Diagnósticos de Rotina/estatística & dados numéricos , Humanos , Modelos Estatísticos , Neoplasias Nasofaríngeas/diagnóstico por imagem , Recidiva Local de Neoplasia/diagnóstico por imagem , Cintilografia , Compostos Radiofarmacêuticos , Sensibilidade e Especificidade , Tecnécio Tc 99m Sestamibi , Tomografia Computadorizada por Raios X/estatística & dados numéricos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA