Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 26
Filtrar
1.
Artigo em Inglês | MEDLINE | ID: mdl-38409814

RESUMO

A sufficient number of participants should be included to adequately address the research interest in the surveys with sensitive questions. In this paper, sample size formulas/iterative algorithms are developed from the perspective of controlling the confidence interval width of the prevalence of a sensitive attribute under four non-randomized response models: the crosswise model, parallel model, Poisson item count technique model and negative binomial item count technique model. In contrast to the conventional approach for sample size determination, our sample size formulas/algorithms explicitly incorporate an assurance probability of controlling the width of a confidence interval within the pre-specified range. The performance of the proposed methods is evaluated with respect to the empirical coverage probability, empirical assurance probability and confidence width. Simulation results show that all formulas/algorithms are effective and hence are recommended for practical applications. A real example is used to illustrate the proposed methods.

2.
J Biopharm Stat ; 32(6): 871-896, 2022 11 02.
Artigo em Inglês | MEDLINE | ID: mdl-35536693

RESUMO

This article investigates the confidence interval (CI) construction of proportion difference for two independent partially validated series under the double-sampling scheme in which both classifiers are fallible. Several CIs based on the variance estimates recovery method of combining confidence limits from asymptotic, bootstrap, and Bayesian methods for two independent binomial proportions are developed under two models. Simulation results show that all CIs except for the bootstrap percentile-t CI and Bayesian credible interval with uniform prior under the independence model and all CIs under the dependence model generally perform well and are recommended. Two examples are used to illustrate the methodologies.


Assuntos
Modelos Estatísticos , Humanos , Teorema de Bayes , Intervalos de Confiança , Simulação por Computador
3.
J Appl Stat ; 47(8): 1375-1401, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-35706696

RESUMO

A disease prevalence can be estimated by classifying subjects according to whether they have the disease. When gold-standard tests are too expensive to be applied to all subjects, partially validated data can be obtained by double-sampling in which all individuals are classified by a fallible classifier, and some of individuals are validated by the gold-standard classifier. However, it could happen in practice that such infallible classifier does not available. In this article, we consider two models in which both classifiers are fallible and propose four asymptotic test procedures for comparing disease prevalence in two groups. Corresponding sample size formulae and validated ratio given the total sample sizes are also derived and evaluated. Simulation results show that (i) Score test performs well and the corresponding sample size formula is also accurate in terms of the empirical power and size in two models; (ii) the Wald test based on the variance estimator with parameters estimated under the null hypothesis outperforms the others even under small sample sizes in Model II, and the sample size estimated by this test is also accurate; (iii) the estimated validated ratios based on all tests are accurate. The malarial data are used to illustrate the proposed methodologies.

4.
Stat Methods Med Res ; 29(2): 359-373, 2020 02.
Artigo em Inglês | MEDLINE | ID: mdl-30841791

RESUMO

Ordinal responses are common in clinical studies. Although the proportional odds model is a popular option for analyzing ordered-categorical data, it cannot control the type I error rate when the proportional odds assumption fails to hold. The latent Weibull model was recently shown to be a superior candidate for modeling ordinal data, with remarkably better performance than the latent normal model when the data are highly skewed. In clinical trials with ordinal responses, a balanced design is common, with equal sample allocation for each treatment. However, a more ethical approach is to adopt a response-adaptive allocation scheme in which more patients receive the better treatment. In this paper, we propose the use of the doubly adaptive biased coin design to generate treatment allocations that benefit the trial participants. The proposed treatment allocation scheme not only allows more patients to receive the better treatment, it also maintains compatible test power for the comparison of treatment efficiencies. A clinical example is used to illustrate the proposed procedure.


Assuntos
Viés , Protocolos Clínicos , Estudos Clínicos como Assunto/estatística & dados numéricos , Modelos Estatísticos , Humanos , Avaliação de Processos e Resultados em Cuidados de Saúde/estatística & dados numéricos , Resultado do Tratamento
5.
Stat Med ; 38(28): 5332-5349, 2019 12 10.
Artigo em Inglês | MEDLINE | ID: mdl-31637752

RESUMO

New treatments that are noninferior or equivalent to-but not necessarily superior to-the reference treatment may still be beneficial to patients because they have fewer side effects, are more convenient, take less time, or cost less. The noninferiority test is widely used in medical research to provide guidance in such situation. In addition, categorical variables are frequently encountered in medical research, such as in studies involving patient-reported outcomes. In this paper, we develop a noninferiority testing procedure for correlated ordinal categorical variables based on a paired design with a latent normal distribution approach. Misclassification is frequently encountered in the collection of ordinal categorical data; therefore, we further extend the procedure to account for misclassification using information in the partially validated data. Simulation studies are conducted to investigate the accuracy of the estimates, the type I error rates, and the power of the proposed procedure. Finally, we analyze one substantive example to demonstrate the utility of the proposed approach.


Assuntos
Estudos de Equivalência como Asunto , Modelos Estatísticos , Bioestatística , Simulação por Computador , Interpretação Estatística de Dados , Humanos , Malária/parasitologia , Malária/prevenção & controle , Malária/transmissão , Resultado do Tratamento
6.
J Biopharm Stat ; 29(3): 446-467, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-30933654

RESUMO

A stratified study is often designed for adjusting a confounding effect or effect of different centers/groups in two treatments or diagnostic tests, and the risk difference is one of the most frequently used indices in comparing efficiency between two treatments or diagnostic tests. This article presented five simultaneous confidence intervals (CIs) for risk differences in stratified bilateral designs accounting for the intraclass correlation and developed seven CIs for the common risk difference under the homogeneity assumption. The performance of the CIs is evaluated with respect to the empirical coverage probabilities, empirical coverage widths and ratios of mesial noncoverage probability and the noncoverage probability under various scenarios. Empirical results show that Wald simultaneous CI, Haldane simultaneous CI, Score simultaneous CI based on Bonferroni method and simultaneous CI based on bootstrap-resampling method perform satisfactorily and hence be recommended for applications, the CI based on the weighted-least-square (WLS) estimator, the CIs based on Mantel-Haenszel estimator, the CI based on Cochran statistic and the CI based on Score statistic for the common risk difference behave well even under small sample sizes. A real data example is used to demonstrate the proposed methodologies.


Assuntos
Intervalos de Confiança , Modelos Estatísticos , Ensaios Clínicos Controlados Aleatórios como Assunto/métodos , Ensaios Clínicos Controlados Aleatórios como Assunto/estatística & dados numéricos , Projetos de Pesquisa/estatística & dados numéricos , Simulação por Computador , Humanos , Análise dos Mínimos Quadrados , Probabilidade , Risco , Tamanho da Amostra
7.
Stat Methods Med Res ; 28(4): 1019-1043, 2019 04.
Artigo em Inglês | MEDLINE | ID: mdl-29233082

RESUMO

Double sampling is usually applied to collect necessary information for situations in which an infallible classifier is available for validating a subset of the sample that has already been classified by a fallible classifier. Inference procedures have previously been developed based on the partially validated data obtained by the double-sampling process. However, it could happen in practice that such infallible classifier or gold standard does not exist. In this article, we consider the case in which both classifiers are fallible and propose asymptotic and approximate unconditional test procedures based on six test statistics for a population proportion and five approximate sample size formulas based on the recommended test procedures under two models. Our results suggest that both asymptotic and approximate unconditional procedures based on the score statistic perform satisfactorily for small to large sample sizes and are highly recommended. When sample size is moderate or large, asymptotic procedures based on the Wald statistic with the variance being estimated under the null hypothesis, likelihood rate statistic, log- and logit-transformation statistics based on both models generally perform well and are hence recommended. The approximate unconditional procedures based on the log-transformation statistic under Model I, Wald statistic with the variance being estimated under the null hypothesis, log- and logit-transformation statistics under Model II are recommended when sample size is small. In general, sample size formulae based on the Wald statistic with the variance being estimated under the null hypothesis, likelihood rate statistic and score statistic are recommended in practical applications. The applicability of the proposed methods is illustrated by a real-data example.


Assuntos
Modelos Estatísticos , Estudos de Amostragem , Algoritmos , Humanos , Funções Verossimilhança , Noruega , Tamanho da Amostra
8.
J Biopharm Stat ; 27(1): 111-123, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-26881877

RESUMO

In clinical studies, ordered categorical responses are common. To compare the efficacy of several treatments with a control for ordinal responses, the normal latent variable model has recently been proposed. This approach conceptualizes the responses as manifestations of an underlying continuous normal variable. In this article, we extend this idea to develop the multiple comparison method for use when there are two controls in the clinical trial. The proposed method is constructed such that the familywise type I error rate is controlled at a prespecified level. In addition, for a given level of test power, the procedure to evaluate the required sample size is provided. The proposed testing procedure is also illustrated by an example from a clinical study.


Assuntos
Ensaios Clínicos como Assunto , Modelos Estatísticos , Projetos de Pesquisa , Humanos , Tamanho da Amostra
9.
Stat Med ; 35(2): 189-201, 2016 Jan 30.
Artigo em Inglês | MEDLINE | ID: mdl-26289419

RESUMO

In clinical studies, the proportional odds model is widely used to compare treatment efficacies when the responses are categorically ordered. However, this model has been shown to be inappropriate when the proportional odds assumption is invalid, mainly because it is unable to control the type I error rate in such circumstances. To remedy this problem, the latent normal model was recently promoted and has been demonstrated to be superior to the proportional odds model. However, the application of the latent normal model is limited to compare treatments with similar underlying distributions except possibly their means and variances. When the underlying distributions are very different in skewness, both of the aforementioned procedures suffer from the undesirable inflation of the type I error rate. To solve the problem for clinical studies with ordinal responses, we provide a viable solution that relies on the use of the latent Weibull distribution, which is a member of the log-location-scale family. The proposed model is able to control the type I error rate regardless of the degree of skewness of the treatment responses. In addition, the power of the test also outperforms that of the latent normal model. The testing procedure draws on newly developed theoretical results related to latent distributions from the location-scale family. The testing procedure is illustrated with two clinical examples.


Assuntos
Bioestatística/métodos , Modelos Estatísticos , Resultado do Tratamento , Analgésicos/farmacologia , Simulação por Computador , Humanos , Ketamina/farmacologia , Modelos Logísticos , Dor/prevenção & controle , Propofol/administração & dosagem , Propofol/efeitos adversos , Doenças Retinianas/etiologia , Fumar/efeitos adversos , Distribuições Estatísticas
10.
Stat Methods Med Res ; 25(1): 37-63, 2016 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-22374341

RESUMO

Disease prevalence is an important topic in medical research, and its study is based on data that are obtained by classifying subjects according to whether a disease has been contracted. Classification can be conducted with high-cost gold standard tests or low-cost screening tests, but the latter are subject to the misclassification of subjects. As a compromise between the two, many research studies use partially validated datasets in which all data points are classified by fallible tests, and some of the data points are validated in the sense that they are also classified by the completely accurate gold-standard test. In this article, we investigate the determination of sample sizes for disease prevalence studies with partially validated data. We use two approaches. The first is to find sample sizes that can achieve a pre-specified power of a statistical test at a chosen significance level, and the second is to find sample sizes that can control the width of a confidence interval with a pre-specified confidence level. Empirical studies have been conducted to demonstrate the performance of various testing procedures with the proposed sample sizes. The applicability of the proposed methods are illustrated by a real-data example.


Assuntos
Bases de Dados Factuais/estatística & dados numéricos , Prevalência , Tamanho da Amostra , Anemia Aplástica/terapia , Bioestatística , Transplante de Medula Óssea/efeitos adversos , Simulação por Computador , Intervalos de Confiança , Doença Enxerto-Hospedeiro/epidemiologia , Doença Enxerto-Hospedeiro/etiologia , Humanos , Funções Verossimilhança , Modelos Estatísticos , Estudos de Validação como Assunto
11.
Stat Methods Med Res ; 25(5): 2250-2273, 2016 10.
Artigo em Inglês | MEDLINE | ID: mdl-24448443

RESUMO

Partially validated series are common when a gold-standard test is too expensive to be applied to all subjects, and hence a fallible device is used accordingly to measure the presence of a characteristic of interest. In this article, confidence interval construction for proportion difference between two independent partially validated series is studied. Ten confidence intervals based on the method of variance estimates recovery (MOVER) are proposed, with each using the confidence limits for the two independent binomial proportions obtained by the asymptotic, Logit-transformation, Agresti-Coull and Bayesian methods. The performances of the proposed confidence intervals and three likelihood-based intervals available in the literature are compared with respect to the empirical coverage probability, confidence width and ratio of mesial non-coverage to non-coverage probability. Our empirical results show that (1) all confidence intervals exhibit good performance in large samples; (2) confidence intervals based on MOVER combining the confidence limits for binomial proportions based on Wilson, Agresti-Coull, Logit-transformation, Bayesian (with three priors) methods perform satisfactorily from small to large samples, and hence can be recommended for practical applications. Two real data sets are analysed to illustrate the proposed methods.


Assuntos
Teorema de Bayes , Intervalos de Confiança , Acidentes de Trânsito/estatística & dados numéricos , Anemia Aplástica/epidemiologia , Automóveis , Distribuição Binomial , Feminino , Humanos , Funções Verossimilhança , Masculino , Prevalência , Reprodutibilidade dos Testes , Adulto Jovem
12.
Stat Methods Med Res ; 24(6): 949-67, 2015 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-22267547

RESUMO

Ordered categorical data are frequently encountered in clinical studies. A popular method for comparing the efficacy of treatments is to use logistic regression with the proportional odds assumption. The test statistic is based on the Wilcoxon-Mann-Whitney test. However, the proportional odds assumption may not be appropriate. In such cases, the probability of rejecting the null hypothesis is much inflated even though the treatments have the same mean efficacy. An alternative approach that does not rely on the proportional odds assumption is to conceptualize the responses as manifestations of some underlying continuous variables. However, statistical procedures were developed only for the comparison of two treatments. In this article, we derive testing procedures that compare several treatments to a control, utilizing a latent normal distribution with the latent variable model. The proposed procedure is useful because multiple comparisons with a control is very frequently an objective of a clinical study. Data from clinical trials are used to illustrate the proposed procedures.


Assuntos
Interpretação Estatística de Dados , Modelos Estatísticos , Resultado do Tratamento , Ensaios Clínicos como Assunto , Humanos , Modelos Logísticos , Estatísticas não Paramétricas
13.
Stat Med ; 33(21): 3629-38, 2014 Sep 20.
Artigo em Inglês | MEDLINE | ID: mdl-24757077

RESUMO

In clinical studies, multiple comparisons of several treatments to a control with ordered categorical responses are often encountered. A popular statistical approach to analyzing the data is to use the logistic regression model with the proportional odds assumption. As discussed in several recent research papers, if the proportional odds assumption fails to hold, the undesirable consequence of an inflated familywise type I error rate may affect the validity of the clinical findings. To remedy the problem, a more flexible approach that uses the latent normal model with single-step and stepwise testing procedures has been recently proposed. In this paper, we introduce a step-up procedure that uses the correlation structure of test statistics under the latent normal model. A simulation study demonstrates the superiority of the proposed procedure to all existing testing procedures. Based on the proposed step-up procedure, we derive an algorithm that enables the determination of the total sample size and the sample size allocation scheme with a pre-determined level of test power before the onset of a clinical trial. A clinical example is presented to illustrate our proposed method.


Assuntos
Algoritmos , Ensaios Clínicos como Assunto/métodos , Interpretação Estatística de Dados , Modelos Estatísticos , Simulação por Computador , Fentanila/administração & dosagem , Humanos , Lidocaína/administração & dosagem , Dor/prevenção & controle , Tamanho da Amostra
14.
Psychometrika ; 79(4): 605-20, 2014 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-24288031

RESUMO

Different latent variable models have been used to analyze ordinal categorical data which can be conceptualized as manifestations of an unobserved continuous variable. In this paper, we propose a unified framework based on a general latent variable model for the comparison of treatments with ordinal responses. The latent variable model is built upon the location-scale family and is rich enough to include many important existing models for analyzing ordinal categorical variables, including the proportional odds model, the ordered probit-type model, and the proportional hazards model. A flexible estimation procedure is proposed for the identification and estimation of the general latent variable model, which allows for the location and scale parameters to be freely estimated. The framework advances the existing methods by enabling many other popular models for analyzing continuous variables to be used to analyze ordinal categorical data, thus allowing for important statistical inferences such as location and/or dispersion comparisons among treatments to be conveniently drawn. Analysis on real data sets is used to illustrate the proposed methods.


Assuntos
Interpretação Estatística de Dados , Modelos Estatísticos , Avaliação de Resultados em Cuidados de Saúde/métodos , Humanos
15.
Stat Med ; 32(18): 3192-205, 2013 Aug 15.
Artigo em Inglês | MEDLINE | ID: mdl-23386287

RESUMO

Clinical trials frequently involve pairwise comparisons of different treatments to evaluate their relative efficacy. In this study, we examine methods for conducting pairwise tests of treatments with ordered categorical responses. A modified version of the Wilcoxon-Mann-Whitney test based on a logistic regression model assuming proportional odds is a popular choice for comparing two treatments. This paper discusses the extension of this test to pairwise comparisons involving more than two treatments. However, when the proportional odds assumption is not valid, the Wilcoxon-Mann-Whitney-type test procedure cannot control the overall type I error rate at the prespecified level of significance. We therefore propose a better strategy in which a latent normal model is employed. We presented a simulated comparative study of power and the overall type I error rate to illustrate the superiority of the latent normal model. Examples are also given for illustrative purposes.


Assuntos
Ensaios Clínicos como Assunto/métodos , Modelos Logísticos , Alfentanil/farmacologia , Criança , Pré-Escolar , Simulação por Computador , Humanos , Dor/tratamento farmacológico , Piperidinas/farmacologia , Propofol/efeitos adversos , Remifentanil
16.
Biom J ; 54(6): 786-807, 2012 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-22941869

RESUMO

Comparing disease prevalence in two groups is an important topic in medical research, and prevalence rates are obtained by classifying subjects according to whether they have the disease. Both high-cost infallible gold-standard classifiers or low-cost fallible classifiers can be used to classify subjects. However, statistical analysis that is based on data sets with misclassifications leads to biased results. As a compromise between the two classification approaches, partially validated sets are often used in which all individuals are classified by fallible classifiers, and some of the individuals are validated by the accurate gold-standard classifiers. In this article, we develop several reliable test procedures and approximate sample size formulas for disease prevalence studies based on the difference between two disease prevalence rates with two independent partially validated series. Empirical studies show that (i) the Score test produces close-to-nominal level and is preferred in practice; and (ii) the sample size formula based on the Score test is also fairly accurate in terms of the empirical power and type I error rate, and is hence recommended. A real example from an aplastic anemia study is used to illustrate the proposed methodologies.


Assuntos
Biometria/métodos , Doença Enxerto-Hospedeiro/epidemiologia , Humanos , Prevalência , Tamanho da Amostra , Adulto Jovem
17.
J Biopharm Stat ; 22(2): 368-86, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22251180

RESUMO

Investigating the prevalence of a disease is an important topic in medical studies. Such investigations are usually based on the classification results of a group of subjects according to whether they have the disease. To classify subjects, screening tests that are inexpensive and nonintrusive to the test subjects are frequently used to produce results in a timely manner. However, such screening tests may suffer from high levels of misclassification. Although it is often possible to design a gold-standard test or device that is not subject to misclassification, such devices are usually costly and time-consuming, and in some cases intrusive to the test subjects. As a compromise between these two approaches, it is possible to use data that are obtained by the method of double-sampling. In this article, we derive and investigate four test statistics for testing a hypothesis on disease prevalence with double-sampling data. The test statistics are implemented through both the asymptotic method suitable for large samples and approximate unconditional method suitable for small samples. Our simulation results show that the approximate unconditional method usually produces a more satisfactory empirical type I error rate and power than its asymptotic counterpart, especially for small to moderate sample sizes. The results also suggest that the score test and the Wald test based on an estimate of variance with parameters estimated under the null hypothesis outperform the others. An real example is used to illustrate the proposed methods.


Assuntos
Interpretação Estatística de Dados , Epidemiologia/estatística & dados numéricos , Prevalência , Algoritmos , Doença , Estudos Epidemiológicos , Humanos , Funções Verossimilhança , Projetos de Pesquisa , Tamanho da Amostra
18.
Br J Math Stat Psychol ; 63(Pt 1): 17-42, 2010 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-19364445

RESUMO

We develop a method for the analysis of multivariate ordinal categorical data with misclassification based on the latent normal variable approach. Misclassification arises if a subject has been classified into a category that does not truly reflect its actual state, and can occur with one or more variables. A basic framework is developed to enable the analysis of two types of data. The first corresponds to a single sample that is obtained from a fallible design that may lead to misclassified data. The other corresponds to data that is obtained by double sampling. Double sampling data consists of two parts: a sample that is obtained by classifying subjects using the fallible design only and a sample that is obtained by classifying subjects using both fallible and true designs, which is assumed to have no misclassification. A unified expectation-maximization approach is developed to find the maximum likelihood estimate of model parameters. Simulation studies and examples that are based on real data are used to demonstrate the applicability and practicability of the proposed methods.


Assuntos
Classificação/métodos , Simulação por Computador , Funções Verossimilhança , Viés de Seleção , Acidentes/estatística & dados numéricos , Intervalos de Confiança , Coleta de Dados/estatística & dados numéricos , Humanos , Modelos Estatísticos , Análise Multivariada , Psicometria/estatística & dados numéricos
19.
Br J Math Stat Psychol ; 62(Pt 3): 507-27, 2009 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-19055868

RESUMO

A Thurstonian type approach is applied to modelling ranking data with ties. It uses a non-totally differentiable discriminational process instead of the conventional totally differential one to relate the observed rankings and the underlying subjective values. A Monte Carlo expectation-maximization algorithm is proposed to find the maximum likelihood estimates together with the standard errors of the parameters. The approach is examined numerically by means of an artificial example and a simulation study and is applied to a study of attribute assessment.


Assuntos
Técnicas de Apoio para a Decisão , Análise Fatorial , Modelos Estatísticos , Escalas de Valor Relativo , Marketing Social , Estatística como Assunto , Algoritmos , Interpretação Estatística de Dados , Análise Discriminante , Humanos , Funções Verossimilhança , Método de Monte Carlo
20.
Br J Math Stat Psychol ; 61(Pt 1): 49-74, 2008 May.
Artigo em Inglês | MEDLINE | ID: mdl-18482475

RESUMO

Many variables that are used in social and behavioural science research are ordinal categorical or polytomous variables. When more than one polytomous variable is involved in an analysis, observations are classified in a contingency table, and a commonly used statistic for describing the association between two variables is the polychoric correlation. This paper investigates the estimation of the polychoric correlation when the data set consists of misclassified observations. Two approaches for estimating the polychoric correlation have been developed. One assumes that the probabilities in relation to misclassification are known, and the other uses a double sampling scheme to obtain information on misclassification. A parameter estimation procedure is developed, and statistical properties for the estimates are discussed. The practicability and applicability of the proposed approaches are illustrated by analysing data sets that are based on real and generated data. Excel programmes with visual basic for application (VBA) have been developed to compute the estimate of the polychoric correlation and its standard error. The use of the structural equation modelling programme Mx to find parameter estimates in the double sampling scheme is discussed.


Assuntos
Ciências do Comportamento/estatística & dados numéricos , Coleta de Dados/classificação , Computação Matemática , Psicometria/estatística & dados numéricos , Ciências Sociais/estatística & dados numéricos , Software , Coleta de Dados/estatística & dados numéricos , Modelos Estatísticos , Probabilidade , Inquéritos e Questionários
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...