Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 249
Filtrar
1.
Biometrics ; 80(3)2024 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-39329229

RESUMO

The discussions of our paper provide insights into the practical considerations of the latent exchangeability prior while also highlighting further extensions. In this rejoinder, we briefly summarize the discussions and provide comments.


Assuntos
Modelos Estatísticos , Interpretação Estatística de Dados , Humanos , Biometria/história , Biometria/métodos
2.
Biometrics ; 80(3)2024 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-39329230

RESUMO

It is becoming increasingly popular to elicit informative priors on the basis of historical data. Popular existing priors, including the power prior, commensurate prior, and robust meta-analytic predictive prior, provide blanket discounting. Thus, if only a subset of participants in the historical data are exchangeable with the current data, these priors may not be appropriate. In order to combat this issue, propensity score approaches have been proposed. However, these approaches are only concerned with the covariate distribution, whereas exchangeability is typically assessed with parameters pertaining to the outcome. In this paper, we introduce the latent exchangeability prior (LEAP), where observations in the historical data are classified into exchangeable and non-exchangeable groups. The LEAP discounts the historical data by identifying the most relevant subjects from the historical data. We compare our proposed approach against alternative approaches in simulations and present a case study using our proposed prior to augment a control arm in a phase 3 clinical trial in plaque psoriasis with an unbalanced randomization scheme.


Assuntos
Simulação por Computador , Humanos , Modelos Estatísticos , Psoríase , Pontuação de Propensão , Interpretação Estatística de Dados , Biometria/métodos , Ensaios Clínicos Controlados Aleatórios como Assunto/estatística & dados numéricos
3.
J Comput Graph Stat ; 33(2): 638-650, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-39184956

RESUMO

Deep Learning (DL) methods have dramatically increased in popularity in recent years, with significant growth in their application to various supervised learning problems. However, the greater prevalence and complexity of missing data in such datasets present significant challenges for DL methods. Here, we provide a formal treatment of missing data in the context of deeply learned generalized linear models, a supervised DL architecture for regression and classification problems. We propose a new architecture, dlglm, that is one of the first to be able to flexibly account for both ignorable and non-ignorable patterns of missingness in input features and response at training time. We demonstrate through statistical simulation that our method outperforms existing approaches for supervised learning tasks in the presence of missing not at random (MNAR) missingness. We conclude with a case study of the Bank Marketing dataset from the UCI Machine Learning Repository, in which we predict whether clients subscribed to a product based on phone survey data. Supplementary materials for this article are available online.

4.
Biostatistics ; 2024 Apr 26.
Artigo em Inglês | MEDLINE | ID: mdl-38669589

RESUMO

There is an increasing interest in the use of joint models for the analysis of longitudinal and survival data. While random effects models have been extensively studied, these models can be hard to implement and the fixed effect regression parameters must be interpreted conditional on the random effects. Copulas provide a useful alternative framework for joint modeling. One advantage of using copulas is that practitioners can directly specify marginal models for the outcomes of interest. We develop a joint model using a Gaussian copula to characterize the association between multivariate longitudinal and survival outcomes. Rather than using an unstructured correlation matrix in the copula model to characterize dependence structure as is common, we propose a novel decomposition that allows practitioners to impose structure (e.g., auto-regressive) which provides efficiency gains in small to moderate sample sizes and reduces computational complexity. We develop a Markov chain Monte Carlo model fitting procedure for estimation. We illustrate the method's value using a simulation study and present a real data analysis of longitudinal quality of life and disease-free survival data from an International Breast Cancer Study Group trial.

5.
J Biopharm Stat ; : 1-20, 2024 Apr 19.
Artigo em Inglês | MEDLINE | ID: mdl-38639571

RESUMO

There are many Bayesian design methods allowing for the incorporation of historical data for sample size determination (SSD) in situations where the outcome in the historical data is the same as the outcome of a new study. However, there is a dearth of methods supporting the incorporation of data from a previously completed clinical trial that investigated the same or similar treatment as the new trial but had a primary outcome that is different. We propose a simulation-based Bayesian SSD framework using the partial-borrowing scale transformed power prior (straPP). The partial-borrowing straPP is developed by applying a novel scale transformation to a traditional power prior on the parameters from the historical data model to make the information better align with the new data model. The scale transformation is based on the assumption that the standardized parameters (i.e., parameters multiplied by the square roots of their respective Fisher information matrices) are equal. To illustrate the method, we present results from simulation studies that use real data from a previously completed clinical trial to design a new clinical trial with a primary time-to-event endpoint.

6.
bioRxiv ; 2024 Mar 19.
Artigo em Inglês | MEDLINE | ID: mdl-38562795

RESUMO

Tumors are comprised of a mixture of distinct cell populations that differ in terms of genetic makeup and function. Such heterogeneity plays a role in the development of drug resistance and the ineffectiveness of targeted cancer therapies. Insight into this complexity can be obtained through the construction of a phylogenetic tree, which illustrates the evolutionary lineage of tumor cells as they acquire mutations over time. We propose Canopy2, a Bayesian framework that uses single nucleotide variants derived from bulk DNA and single-cell RNA sequencing to infer tumor phylogeny and conduct mutational profiling of tumor subpopulations. Canopy2 uses Markov chain Monte Carlo methods to sample from a joint probability distribution involving a mixture of binomial and beta-binomial distributions, specifically chosen to account for the sparsity and stochasticity of the single-cell data. Canopy2 demystifies the sources of zeros in the single-cell data and separates zeros categorized as non-cancerous (cells without mutations), stochastic (mutations not expressed due to bursting), and technical (expressed mutations not picked up by sequencing). Simulations demonstrate that Canopy2 consistently outperforms competing methods and reconstructs the clonal tree with high fidelity, even in situations involving low sequencing depth, poor single-cell yield, and highly-advanced and polyclonal tumors. We further assess the performance of Canopy2 through application to breast cancer and glioblastoma data, benchmarking against existing methods. Canopy2 is an open-source R package available at https://github.com/annweideman/canopy2.

7.
Biometrics ; 80(1)2024 Jan 29.
Artigo em Inglês | MEDLINE | ID: mdl-38497825

RESUMO

Modern biomedical datasets are increasingly high-dimensional and exhibit complex correlation structures. Generalized linear mixed models (GLMMs) have long been employed to account for such dependencies. However, proper specification of the fixed and random effects in GLMMs is increasingly difficult in high dimensions, and computational complexity grows with increasing dimension of the random effects. We present a novel reformulation of the GLMM using a factor model decomposition of the random effects, enabling scalable computation of GLMMs in high dimensions by reducing the latent space from a large number of random effects to a smaller set of latent factors. We also extend our prior work to estimate model parameters using a modified Monte Carlo Expectation Conditional Minimization algorithm, allowing us to perform variable selection on both the fixed and random effects simultaneously. We show through simulation that through this factor model decomposition, our method can fit high-dimensional penalized GLMMs faster than comparable methods and more easily scale to larger dimensions not previously seen in existing approaches.


Assuntos
Algoritmos , Simulação por Computador , Modelos Lineares , Método de Monte Carlo
8.
Stat Med ; 43(7): 1397-1418, 2024 Mar 30.
Artigo em Inglês | MEDLINE | ID: mdl-38297431

RESUMO

Postmarket drug safety database like vaccine adverse event reporting system (VAERS) collect thousands of spontaneous reports annually, with each report recording occurrences of any adverse events (AEs) and use of vaccines. We hope to identify signal vaccine-AE pairs, for which certain vaccines are statistically associated with certain adverse events (AE), using such data. Thus, the outcomes of interest are multiple AEs, which are binary outcomes and could be correlated because they might share certain latent factors; and the primary covariates are vaccines. Appropriately accounting for the complex correlation among AEs could improve the sensitivity and specificity of identifying signal vaccine-AE pairs. We propose a two-step approach in which we first estimate the shared latent factors among AEs using a working multivariate logistic regression model, and then use univariate logistic regression model to examine the vaccine-AE associations after controlling for the latent factors. Our simulation studies show that this approach outperforms current approaches in terms of sensitivity and specificity. We apply our approach in analyzing VAERS data and report our findings.


Assuntos
Sistemas de Notificação de Reações Adversas a Medicamentos , Vacinas , Humanos , Estados Unidos , Vacinas/efeitos adversos , Bases de Dados Factuais , Simulação por Computador , Software
9.
Genomics Inform ; 21(3): e28, 2023 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-37813624

RESUMO

Mild cognitive impairment (MCI) is a clinical syndrome characterized by the onset and evolution of cognitive impairments, often considered a transitional stage to Alzheimer's disease (AD). The genetic traits of MCI patients who experience a rapid progression to AD can enhance early diagnosis capabilities and facilitate drug discovery for AD. While a genome-wide association study (GWAS) is a standard tool for identifying single nucleotide polymorphisms (SNPs) related to a disease, it fails to detect SNPs with small effect sizes due to stringent control for multiple testing. Additionally, the method does not consider the group structures of SNPs, such as genes or linkage disequilibrium blocks, which can provide valuable insights into the genetic architecture. To address the limitations, we propose a Bayesian bi-level variable selection method that detects SNPs associated with time of conversion from MCI to AD. Our approach integrates group inclusion indicators into an accelerated failure time model to identify important SNP groups. Additionally, we employ data augmentation techniques to impute censored time values using a predictive posterior. We adapt Dirichlet-Laplace shrinkage priors to incorporate the group structure for SNP-level variable selection. In the simulation study, our method outperformed other competing methods regarding variable selection. The analysis of Alzheimer's Disease Neuroimaging Initiative (ADNI) data revealed several genes directly or indirectly related to AD, whereas a classical GWAS did not identify any significant SNPs.

10.
Biostatistics ; 2023 Sep 05.
Artigo em Inglês | MEDLINE | ID: mdl-37669215

RESUMO

In recent years, multi-regional clinical trials (MRCTs) have increased in popularity in the pharmaceutical industry due to their ability to accelerate the global drug development process. To address potential challenges with MRCTs, the International Council for Harmonisation released the E17 guidance document which suggests the use of statistical methods that utilize information borrowing across regions if regional sample sizes are small. We develop an approach that allows for information borrowing via Bayesian model averaging in the context of a joint analysis of survival and longitudinal data from MRCTs. In this novel application of joint models to MRCTs, we use Laplace's method to integrate over subject-specific random effects and to approximate posterior distributions for region-specific treatment effects on the time-to-event outcome. Through simulation studies, we demonstrate that the joint modeling approach can result in an increased rejection rate when testing the global treatment effect compared with methods that analyze survival data alone. We then apply the proposed approach to data from a cardiovascular outcomes MRCT.

11.
J Am Stat Assoc ; 118(541): 3-17, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37153845

RESUMO

Over the past 30 years, magnetic resonance imaging has become a ubiquitous tool for accurately visualizing the change and development of the brain's subcortical structures (e.g., hippocampus). Although subcortical structures act as information hubs of the nervous system, their quantification is still in its infancy due to many challenges in shape extraction, representation, and modeling. Here, we develop a simple and efficient framework of longitudinal elastic shape analysis (LESA) for subcortical structures. Integrating ideas from elastic shape analysis of static surfaces and statistical modeling of sparse longitudinal data, LESA provides a set of tools for systematically quantifying changes of longitudinal subcortical surface shapes from raw structure MRI data. The key novelties of LESA include: (i) it can efficiently represent complex subcortical structures using a small number of basis functions and (ii) it can accurately delineate the spatiotemporal shape changes of the human subcortical structures. We applied LESA to analyze three longitudinal neuroimaging data sets and showcase its wide applications in estimating continuous shape trajectories, building life-span growth patterns, and comparing shape differences among different groups. In particular, with the Alzheimer's Disease Neuroimaging Initiative (ADNI) data, we found that the Alzheimer's Disease (AD) can significantly speed the shape change of ventricle and hippocampus from 60 to 75 years old compared with normal aging.

12.
J Immunol ; 211(2): 219-228, 2023 07 15.
Artigo em Inglês | MEDLINE | ID: mdl-37204246

RESUMO

Previous work from our group and others has shown that patients with breast cancer can generate a T cell response against specific human epidermal growth factor 2 (HER2) epitopes. In addition, preclinical work has shown that this T cell response can be augmented by Ag-directed mAb therapy. This study evaluated the activity and safety of a combination of dendritic cell (DC) vaccination given with mAb and cytotoxic therapy. We performed a phase I/II study using autologous DCs pulsed with two different HER2 peptides given with trastuzumab and vinorelbine to a study cohort of patients with HER2-overexpressing and a second with HER2 nonoverexpressing metastatic breast cancer. Seventeen patients with HER2-overexpressing and seven with nonoverexpressing disease were treated. Treatment was well tolerated, with one patient removed from therapy because of toxicity and no deaths. Forty-six percent of patients had stable disease after therapy, with 4% achieving a partial response and no complete responses. Immune responses were generated in the majority of patients but did not correlate with clinical response. However, in one patient, who has survived >14 y since treatment in the trial, a robust immune response was demonstrated, with 25% of her T cells specific to one of the peptides in the vaccine at the peak of her response. These data suggest that autologous DC vaccination when given with anti-HER2-directed mAb therapy and vinorelbine is safe and can induce immune responses, including significant T cell clonal expansion, in a subset of patients.


Assuntos
Neoplasias da Mama , Neoplasias Mamárias Animais , Humanos , Feminino , Animais , Epitopos/metabolismo , Vinorelbina/metabolismo , Vinorelbina/uso terapêutico , Receptor ErbB-2 , Neoplasias da Mama/metabolismo , Imunoterapia , Peptídeos/metabolismo , Células Dendríticas , Trastuzumab/uso terapêutico , Trastuzumab/metabolismo
13.
Stat Med ; 42(11): 1722-1740, 2023 05 20.
Artigo em Inglês | MEDLINE | ID: mdl-36929939

RESUMO

There has been increased interest in the design and analysis of studies consisting of multiple response variables of mixed types. For example, in clinical trials, it is desirable to establish efficacy for a treatment effect in primary and secondary outcomes. In this article, we develop Bayesian approaches for hypothesis testing and study planning for data consisting of multiple response variables of mixed types with covariates. We assume that the responses are correlated via a Gaussian copula, and that the model for each response is, marginally, a generalized linear model (GLM). Taking a fully Bayesian approach, the proposed method enables inference based on the joint posterior distribution of the parameters. Under some mild conditions, we show that the joint distribution of the posterior probabilities under any Bayesian analysis converges to a Gaussian copula distribution as the sample size tends to infinity. Using this result, we develop an approach to control the type I error rate under multiple testing. Simulation results indicate that the method is more powerful than conducting marginal regression models and correcting for multiplicity using the Bonferroni-Holm Method. We also develop a Bayesian approach to sample size determination in the presence of response variables of mixed types, extending the concept of probability of success (POS) to multiple response variables of mixed types.


Assuntos
Projetos de Pesquisa , Humanos , Teorema de Bayes , Probabilidade , Modelos Lineares , Simulação por Computador
14.
Stat Med ; 42(12): 2009-2026, 2023 05 30.
Artigo em Inglês | MEDLINE | ID: mdl-36974659

RESUMO

We propose a generalized linear low-rank mixed model (GLLRM) for the analysis of both high-dimensional and sparse responses and covariates where the responses may be binary, counts, or continuous. This development is motivated by the problem of identifying vaccine-adverse event associations in post-market drug safety databases, where an adverse event is any untoward medical occurrence or health problem that occurs during or following vaccination. The GLLRM is a generalization of a generalized linear mixed model in that it integrates a factor analysis model to describe the dependence among responses and a low-rank matrix to approximate the high-dimensional regression coefficient matrix. A sampling procedure combining the Gibbs sampler and Metropolis and Gamerman algorithms is employed to obtain posterior estimates of the regression coefficients and other model parameters. Testing of response-covariate pair associations is based on the posterior distribution of the corresponding regression coefficients. Monte Carlo simulation studies are conducted to examine the finite-sample performance of the proposed procedures on binary and count outcomes. We further illustrate the GLLRM via a real data example based on the Vaccine Adverse Event Reporting System.


Assuntos
Vacinas , Humanos , Teorema de Bayes , Modelos Lineares , Vacinas/efeitos adversos , Simulação por Computador , Algoritmos
15.
Stat Med ; 42(9): 1308-1322, 2023 04 30.
Artigo em Inglês | MEDLINE | ID: mdl-36696954

RESUMO

Competing risks survival data in the presence of partially masked causes are frequently encountered in medical research or clinical trials. When longitudinal biomarkers are also available, it is of great clinical importance to examine associations between the longitudinal biomarkers and the cause-specific survival outcomes. In this article, we propose a cause-specific C-index for joint models of longitudinal and competing risks survival data accounting for masked causes. We also develop a posterior predictive algorithm for computing the out-of-sample cause-specific C-index using Markov chain Monte Carlo samples from the joint posterior of the in-sample longitudinal and competing risks survival data. We further construct the Δ $$ \Delta $$ C-index to quantify the strength of association between the longitudinal and cause-specific survival data, or between the out-of-sample longitudinal and survival data. Empirical performance of the proposed assessment criteria is examined through an extensive simulation study. An in-depth analysis of the real data from large cancer prevention trials is carried out to demonstrate the usefulness of the proposed methodology.


Assuntos
Pesquisa Biomédica , Modelos Estatísticos , Humanos , Análise de Sobrevida , Simulação por Computador , Causalidade , Modelos de Riscos Proporcionais , Estudos Longitudinais
16.
Biostatistics ; 24(2): 262-276, 2023 04 14.
Artigo em Inglês | MEDLINE | ID: mdl-34296263

RESUMO

Multiregional clinical trials (MRCTs) provide the benefit of more rapidly introducing drugs to the global market; however, small regional sample sizes can lead to poor estimation quality of region-specific effects when using current statistical methods. With the publication of the International Conference for Harmonisation E17 guideline in 2017, the MRCT design is recognized as a viable strategy that can be accepted by regional regulatory authorities, necessitating new statistical methods that improve the quality of region-specific inference. In this article, we develop a novel methodology for estimating region-specific and global treatment effects for MRCTs using Bayesian model averaging. This approach can be used for trials that compare two treatment groups with respect to a continuous outcome, and it allows for the incorporation of patient characteristics through the inclusion of covariates. We propose an approach that uses posterior model probabilities to quantify evidence in favor of consistency of treatment effects across all regions, and this metric can be used by regulatory authorities for drug approval. We show through simulations that the proposed modeling approach results in lower MSE than a fixed-effects linear regression model and better control of type I error rates than a Bayesian hierarchical model.


Assuntos
Aprovação de Drogas , Projetos de Pesquisa , Humanos , Teorema de Bayes , Resultado do Tratamento , Tamanho da Amostra , Probabilidade
17.
Lifetime Data Anal ; 29(1): 213-233, 2023 01.
Artigo em Inglês | MEDLINE | ID: mdl-36357647

RESUMO

For clinical trial design and analysis, there has been extensive work related to using joint models for longitudinal and time-to-event data without a cure fraction (i.e., when all patients are at risk for the event of interest), but comparatively little treatment has been given to design and analysis of clinical trials using joint models that incorporate a cure fraction. In this paper, we develop a Bayesian clinical trial design methodology focused on evaluating the treatment's effect on a time-to-event endpoint using a promotion time cure rate model, where the longitudinal process is incorporated into the hazard model for the promotion times. A piecewise linear hazard model for the period after assessment of the longitudinal measure ends is proposed as an alternative to extrapolating the longitudinal trajectory. This may be advantageous in scenarios where the period of time from the end of longitudinal measurements until the end of observation is substantial. Inference for the time-to-event endpoint is based on a novel estimand which combines the treatment's effect on the probability of cure and its effect on the promotion time distribution, mediated by the longitudinal outcome. We propose an approach for sample size determination such that the design has a high power and a well-controlled type I error rate with both operating characteristics defined from a Bayesian perspective. We demonstrate the methodology by designing a breast cancer clinical trial with a primary time-to-event endpoint where longitudinal outcomes are measured periodically during follow up.


Assuntos
Modelos Estatísticos , Humanos , Teorema de Bayes , Estudos Longitudinais , Tamanho da Amostra , Modelos Lineares
18.
Biometrics ; 79(2): 854-865, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-34921386

RESUMO

Human tissue samples are often mixtures of heterogeneous cell types, which can confound the analyses of gene expression data derived from such tissues. The cell type composition of a tissue sample may itself be of interest and is needed for proper analysis of differential gene expression. A variety of computational methods have been developed to estimate cell type proportions using gene-level expression data. However, RNA isoforms can also be differentially expressed across cell types, and isoform-level expression could be equally or more informative for determining cell type origin than gene-level expression. We propose a new computational method, IsoDeconvMM, which estimates cell type fractions using isoform-level gene expression data. A novel and useful feature of IsoDeconvMM is that it can estimate cell type proportions using only a single gene, though in practice we recommend aggregating estimates of a few dozen genes to obtain more accurate results. We demonstrate the performance of IsoDeconvMM using a unique data set with cell type-specific RNA-seq data across more than 135 individuals. This data set allows us to evaluate different methods given the biological variation of cell type-specific gene expression data across individuals. We further complement this analysis with additional simulations.


Assuntos
Perfilação da Expressão Gênica , Humanos , Isoformas de Proteínas/genética , Análise de Sequência de RNA
19.
Stat Med ; 42(1): 1-14, 2023 01 15.
Artigo em Inglês | MEDLINE | ID: mdl-36318875

RESUMO

We develop the scale transformed power prior for settings where historical and current data involve different data types, such as binary and continuous data. This situation arises often in clinical trials, for example, when historical data involve binary responses and the current data involve some other type of continuous or discrete outcome. The power prior, proposed by Ibrahim and Chen, does not address the issue of different data types. Herein, we develop a new type of power prior, which we call the scale transformed power prior (straPP). The straPP is constructed by transforming the power prior for the historical data by rescaling the parameter using a function of the Fisher information matrices for the historical and current data models, thereby shifting the scale of the parameter vector from that of the historical to that of the current data. Examples are presented to motivate the need for such a transformation, and simulation studies are presented to illustrate the performance advantages of the straPP over the power prior and other informative and noninformative priors. A real dataset from a clinical trial undertaken to study a novel transitional care model for stroke survivors is used to illustrate the methodology.


Assuntos
Modelos Estatísticos , Projetos de Pesquisa , Humanos , Teorema de Bayes , Simulação por Computador
20.
Biostatistics ; 24(4): 866-884, 2023 10 18.
Artigo em Inglês | MEDLINE | ID: mdl-35851911

RESUMO

Joint models for recurrent event and terminating event data are increasingly used for the analysis of clinical trials. However, few methods have been proposed for designing clinical trials using these models. In this article, we develop a Bayesian clinical trial design methodology focused on evaluating the effect of an investigational product (IP) on both recurrent event and terminating event processes considered as multiple primary endpoints, using a multifrailty joint model. Dependence between the recurrent and terminating event processes is accounted for using a shared frailty. Inferences for the multiple primary outcomes are based on posterior model probabilities corresponding to mutually exclusive hypotheses regarding the benefit of IP with respect to the recurrent and terminating event processes. We propose an approach for sample size determination to ensure the trial design has a high power and a well-controlled type I error rate, with both operating characteristics defined from a Bayesian perspective. We also consider a generalization of the proposed parametric model that uses a nonparametric mixture of Dirichlet processes to model the frailty distributions and compare its performance to the proposed approach. We demonstrate the methodology by designing a colorectal cancer clinical trial with a goal of demonstrating that the IP causes a favorable effect on at least one of the two outcomes but no harm on either.


Assuntos
Fragilidade , Neoplasias Primárias Múltiplas , Humanos , Teorema de Bayes , Tamanho da Amostra , Modelos Estatísticos , Simulação por Computador
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA