Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Resultados 1 - 20 de 43
Filtrar
1.
Clin Infect Dis ; 2024 Jun 02.
Artículo en Inglés | MEDLINE | ID: mdl-38824440

RESUMEN

Data on alcohol use and incident Tuberculosis (TB) infection are needed. In adults aged 15+ in rural Uganda (N=49,585), estimated risk of incident TB infection was 29.2% with alcohol use vs. 19.2% without (RR: 1.49; 95%CI: 1.40-1.60). There is potential for interventions to interrupt transmission among people who drink alcohol.

2.
Biostatistics ; 24(2): 502-517, 2023 04 14.
Artículo en Inglés | MEDLINE | ID: mdl-34939083

RESUMEN

Cluster randomized trials (CRTs) randomly assign an intervention to groups of individuals (e.g., clinics or communities) and measure outcomes on individuals in those groups. While offering many advantages, this experimental design introduces challenges that are only partially addressed by existing analytic approaches. First, outcomes are often missing for some individuals within clusters. Failing to appropriately adjust for differential outcome measurement can result in biased estimates and inference. Second, CRTs often randomize limited numbers of clusters, resulting in chance imbalances on baseline outcome predictors between arms. Failing to adaptively adjust for these imbalances and other predictive covariates can result in efficiency losses. To address these methodological gaps, we propose and evaluate a novel two-stage targeted minimum loss-based estimator to adjust for baseline covariates in a manner that optimizes precision, after controlling for baseline and postbaseline causes of missing outcomes. Finite sample simulations illustrate that our approach can nearly eliminate bias due to differential outcome measurement, while existing CRT estimators yield misleading results and inferences. Application to real data from the SEARCH community randomized trial demonstrates the gains in efficiency afforded through adaptive adjustment for baseline covariates, after controlling for missingness on individual-level outcomes.


Asunto(s)
Evaluación de Resultado en la Atención de Salud , Proyectos de Investigación , Humanos , Ensayos Clínicos Controlados Aleatorios como Asunto , Probabilidad , Sesgo , Análisis por Conglomerados , Simulación por Computador
3.
Biostatistics ; 2023 Aug 02.
Artículo en Inglés | MEDLINE | ID: mdl-37531621

RESUMEN

Cluster randomized trials (CRTs) often enroll large numbers of participants; yet due to resource constraints, only a subset of participants may be selected for outcome assessment, and those sampled may not be representative of all cluster members. Missing data also present a challenge: if sampled individuals with measured outcomes are dissimilar from those with missing outcomes, unadjusted estimates of arm-specific endpoints and the intervention effect may be biased. Further, CRTs often enroll and randomize few clusters, limiting statistical power and raising concerns about finite sample performance. Motivated by SEARCH-TB, a CRT aimed at reducing incident tuberculosis infection, we demonstrate interlocking methods to handle these challenges. First, we extend Two-Stage targeted minimum loss-based estimation to account for three sources of missingness: (i) subsampling; (ii) measurement of baseline status among those sampled; and (iii) measurement of final status among those in the incidence cohort (persons known to be at risk at baseline). Second, we critically evaluate the assumptions under which subunits of the cluster can be considered the conditionally independent unit, improving precision and statistical power but also causing the CRT to behave like an observational study. Our application to SEARCH-TB highlights the real-world impact of different assumptions on measurement and dependence; estimates relying on unrealistic assumptions suggested the intervention increased the incidence of TB infection by 18% (risk ratio [RR]=1.18, 95% confidence interval [CI]: 0.85-1.63), while estimates accounting for the sampling scheme, missingness, and within community dependence found the intervention decreased the incident TB by 27% (RR=0.73, 95% CI: 0.57-0.92).

4.
Biometrics ; 80(1)2024 Jan 29.
Artículo en Inglés | MEDLINE | ID: mdl-38446441

RESUMEN

Benkeser et al. demonstrate how adjustment for baseline covariates in randomized trials can meaningfully improve precision for a variety of outcome types. Their findings build on a long history, starting in 1932 with R.A. Fisher and including more recent endorsements by the U.S. Food and Drug Administration and the European Medicines Agency. Here, we address an important practical consideration: how to select the adjustment approach-which variables and in which form-to maximize precision, while maintaining Type-I error control. Balzer et al. previously proposed Adaptive Pre-specification within TMLE to flexibly and automatically select, from a prespecified set, the approach that maximizes empirical efficiency in small trials (N < 40). To avoid overfitting with few randomized units, selection was previously limited to working generalized linear models, adjusting for a single covariate. Now, we tailor Adaptive Pre-specification to trials with many randomized units. Using V-fold cross-validation and the estimated influence curve-squared as the loss function, we select from an expanded set of candidates, including modern machine learning methods adjusting for multiple covariates. As assessed in simulations exploring a variety of data-generating processes, our approach maintains Type-I error control (under the null) and offers substantial gains in precision-equivalent to 20%-43% reductions in sample size for the same statistical power. When applied to real data from ACTG Study 175, we also see meaningful efficiency improvements overall and within subgroups.


Asunto(s)
Aprendizaje Automático , Proyectos de Investigación , Estados Unidos , Ensayos Clínicos Controlados Aleatorios como Asunto , Modelos Lineales , Tamaño de la Muestra
5.
Biometrics ; 80(1)2024 Jan 29.
Artículo en Inglés | MEDLINE | ID: mdl-38281772

RESUMEN

Strategic test allocation is important for control of both emerging and existing pandemics (eg, COVID-19, HIV). It supports effective epidemic control by (1) reducing transmission via identifying cases and (2) tracking outbreak dynamics to inform targeted interventions. However, infectious disease surveillance presents unique statistical challenges. For instance, the true outcome of interest (positive infection status) is often a latent variable. In addition, presence of both network and temporal dependence reduces data to a single observation. In this work, we study an adaptive sequential design, which allows for unspecified dependence among individuals and across time. Our causal parameter is the mean latent outcome we would have obtained, if, starting at time t given the observed past, we had carried out a stochastic intervention that maximizes the outcome under a resource constraint. The key strength of the method is that we do not have to model network and time dependence: a short-term performance Online Super Learner is used to select among dependence models and randomization schemes. The proposed strategy learns the optimal choice of testing over time while adapting to the current state of the outbreak and learning across samples, through time, or both. We demonstrate the superior performance of the proposed strategy in an agent-based simulation modeling a residential university environment during the COVID-19 pandemic.


Asunto(s)
COVID-19 , Enfermedades Transmisibles , Humanos , Pandemias/prevención & control , COVID-19/epidemiología , Simulación por Computador , Brotes de Enfermedades
6.
J Biopharm Stat ; : 1-19, 2024 May 02.
Artículo en Inglés | MEDLINE | ID: mdl-38695298

RESUMEN

In the drug development for rare disease, the number of treated subjects in the clinical trial is often very small, whereas the number of external controls can be relatively large. There is no clear guidance on choosing an appropriate statistical method to control baseline confounding in this situation. To fill this gap, we conduct extensive simulations to evaluate the performance of commonly used matching and weighting methods as well as the more recently developed targeted maximum likelihood estimation (TMLE) and cardinality matching in small sample settings, mimicking the motivating data from a pediatric rare disease. Among the methods examined, the performance of coarsened exact matching (CEM) and TMLE are relatively robust under various model specifications. CEM is only feasible when the number of controls far exceeds the number of treated, whereas TMLE has better performance with less extreme treatment allocation ratios. Our simulations suggest bootstrap is useful for variance estimation in small samples after matching.

7.
J Gen Intern Med ; 38(4): 954-960, 2023 03.
Artículo en Inglés | MEDLINE | ID: mdl-36175761

RESUMEN

BACKGROUND: Low-value healthcare is costly and inefficient and may adversely affect patient outcomes. Despite increases in low-value service use, little is known about how the receipt of low-value care differs across payers. OBJECTIVE: To evaluate differences in the use of low-value care between patients with commercial versus Medicaid coverage. DESIGN: Retrospective observational analysis of the 2017 Rhode Island All-payer Claims Database, estimating the probability of receiving each of 14 low-value services between commercial and Medicaid enrollees, adjusting for patient sociodemographic and clinical characteristics. Ensemble machine learning minimized the possibility of model misspecification. PARTICIPANTS: Medicaid and commercial enrollees aged 18-64 with continuous coverage and an encounter at which they were at risk of receiving a low-value service. INTERVENTION: Enrollment in Medicaid or Commercial insurance. MAIN MEASURES: Use of one of 14 validated measures of low-value care. KEY RESULTS: Among 110,609 patients, Medicaid enrollees were younger, had more comorbidities, and were more likely to be female than commercial enrollees. Medicaid enrollees had higher rates of use for 7 low-value care measures, and those with commercial coverage had higher rates for 5 measures. Across all measures of low-value care, commercial enrollees received more (risk difference [RD] 6.8 percentage points; CI: 6.6 to 7.0) low-value services than their counterparts with Medicaid. Commercial enrollees were also more likely to receive low-value services typically performed in the emergency room (RD 11.4 percentage points; CI: 10.7 to 12.2) and services that were less expensive (RD 15.3 percentage points; CI 14.6 to 16.0). CONCLUSION: Differences in the provision of low-value care varied across measures, though average use was slightly higher among commercial than Medicaid enrollees. This difference was more pronounced for less expensive services indicating that financial incentives may not be the sole driver of low-value care.


Asunto(s)
Atención de Bajo Valor , Medicaid , Estados Unidos/epidemiología , Humanos , Femenino , Masculino , Estudios Retrospectivos , Atención a la Salud , Rhode Island
8.
BMC Med Res Methodol ; 23(1): 178, 2023 08 02.
Artículo en Inglés | MEDLINE | ID: mdl-37533017

RESUMEN

BACKGROUND: The Targeted Learning roadmap provides a systematic guide for generating and evaluating real-world evidence (RWE). From a regulatory perspective, RWE arises from diverse sources such as randomized controlled trials that make use of real-world data, observational studies, and other study designs. This paper illustrates a principled approach to assessing the validity and interpretability of RWE. METHODS: We applied the roadmap to a published observational study of the dose-response association between ritodrine hydrochloride and pulmonary edema among women pregnant with twins in Japan. The goal was to identify barriers to causal effect estimation beyond unmeasured confounding reported by the study's authors, and to explore potential options for overcoming the barriers that robustify results. RESULTS: Following the roadmap raised issues that led us to formulate alternative causal questions that produced more reliable, interpretable RWE. The process revealed a lack of information in the available data to identify a causal dose-response curve. However, under explicit assumptions the effect of treatment with any amount of ritodrine versus none, albeit a less ambitious parameter, can be estimated from data. CONCLUSIONS: Before RWE can be used in support of clinical and regulatory decision-making, its quality and reliability must be systematically evaluated. The TL roadmap prescribes how to carry out a thorough, transparent, and realistic assessment of RWE. We recommend this approach be a routine part of any decision-making process.


Asunto(s)
Proyectos de Investigación , Femenino , Humanos , Reproducibilidad de los Resultados , Japón , Ensayos Clínicos Controlados Aleatorios como Asunto
9.
BMC Public Health ; 23(1): 1152, 2023 06 15.
Artículo en Inglés | MEDLINE | ID: mdl-37316852

RESUMEN

BACKGROUND: Hypertension (HTN) and diabetes mellitus (DM) as part of non-communicable diseases are among the most common causes of death worldwide, especially in the WHO's Eastern Mediterranean Region (EMR). The family physician program (FPP) proposed by WHO is a health strategy to provide primary health care and improve the community's awareness of non-communicable diseases. Since there was no clear focus on the causal effect of FPP on the prevalence, screening, and awareness of HTN and DM, the primary objective of this study is to determine the causal effect of FPP on these factors in Iran, which is an EMR country. METHODS: We conducted a repeated cross-sectional design based on two independent surveys of 42,776 adult participants in 2011 and 2016, of which 2301 individuals were selected from two regions where the family physician program was implemented (FPP) and where it wasn't (non-FPP). We used an Inverse Probability Weighting difference-in-differences and Targeted Maximum Likelihood Estimation analysis to estimate the average treatment effects on treated (ATT) using R version 4.1.1. RESULTS: The FPP implementation increased the screening (ATT = 36%, 95% CI: (27%, 45%), P-value < 0.001) and the control of hypertension (ATT = 26%, 95% CI: (1%, 52%), P-value = 0.03) based on 2017 ACC/AHA guidelines that these results were in keeping with JNC7. There was no causal effect in other indexes, such as prevalence, awareness, and treatment. The DM screening (ATT = 20%, 95% CI: (6%, 34%), P-value = 0.004) and awareness (ATT = 14%, 95% CI: (1%, 27%), P-value = 0.042) were significantly increased among FPP administered region. However, the treatment of HTN decreased (ATT = -32%, 95% CI: (-59%, -5%), P-value = 0.012). CONCLUSION: This study has identified some limitations related to the FPP in managing HTN and DM, and presented solutions to solve them in two general categories. Thus, we recommend that the FPP be revised before the generalization of the program to other parts of Iran.


Asunto(s)
Diabetes Mellitus , Hipertensión , Enfermedades no Transmisibles , Adulto , Humanos , Prevalencia , Estudios Transversales , Médicos de Familia , Diabetes Mellitus/diagnóstico , Diabetes Mellitus/epidemiología , Diabetes Mellitus/terapia , Hipertensión/diagnóstico , Hipertensión/epidemiología , Hipertensión/prevención & control , Región Mediterránea
10.
Lifetime Data Anal ; 2022 Nov 07.
Artículo en Inglés | MEDLINE | ID: mdl-36336732

RESUMEN

Targeted maximum likelihood estimation (TMLE) provides a general methodology for estimation of causal parameters in presence of high-dimensional nuisance parameters. Generally, TMLE consists of a two-step procedure that combines data-adaptive nuisance parameter estimation with semiparametric efficiency and rigorous statistical inference obtained via a targeted update step. In this paper, we demonstrate the practical applicability of TMLE based causal inference in survival and competing risks settings where event times are not confined to take place on a discrete and finite grid. We focus on estimation of causal effects of time-fixed treatment decisions on survival and absolute risk probabilities, considering different univariate and multidimensional parameters. Besides providing a general guidance to using TMLE for survival and competing risks analysis, we further describe how the previous work can be extended with the use of loss-based cross-validated estimation, also known as super learning, of the conditional hazards. We illustrate the usage of the considered methods using publicly available data from a trial on adjuvant chemotherapy for colon cancer. R software code to implement all considered algorithms and to reproduce all analyses is available in an accompanying online appendix on Github.

11.
Entropy (Basel) ; 24(8)2022 Jul 31.
Artículo en Inglés | MEDLINE | ID: mdl-36010724

RESUMEN

Although causal inference has shown great value in estimating effect sizes in, for instance, physics, medical studies, and economics, it is rarely used in sports science. Targeted Maximum Likelihood Estimation (TMLE) is a modern method for performing causal inference. TMLE is forgiving in the misspecification of the causal model and improves the estimation of effect sizes using machine-learning methods. We demonstrate the advantage of TMLE in sports science by comparing the calculated effect size with a Generalized Linear Model (GLM). In this study, we introduce TMLE and provide a roadmap for making causal inference and apply the roadmap along with the methods mentioned above in a simulation study and case study investigating the influence of substitutions on the physical performance of the entire soccer team (i.e., the effect size of substitutions on the total physical performance). We construct a causal model, a misspecified causal model, a simulation dataset, and an observed tracking dataset of individual players from 302 elite soccer matches. The simulation dataset results show that TMLE outperforms GLM in estimating the effect size of the substitutions on the total physical performance. Furthermore, TMLE is most robust against model misspecification in both the simulation and the tracking dataset. However, independent of the method used in the tracking dataset, it was found that substitutes increase the physical performance of the entire soccer team.

12.
Am J Epidemiol ; 190(8): 1519-1532, 2021 08 01.
Artículo en Inglés | MEDLINE | ID: mdl-33576383

RESUMEN

Rapid initiation of antiretroviral therapy (ART) is recommended for people living with human immunodeficiency virus (HIV), with the option to start treatment on the day of diagnosis (same-day ART). However, the effect of same-day ART remains unknown in realistic public sector settings. We established a cohort of ≥16-year-old patients who initiated first-line ART under a treat-all policy in Nhlangano (Eswatini) during 2014-2016, either on the day of HIV care enrollment (same-day ART) or 1-14 days thereafter (early ART). Directed acyclic graphs, flexible parametric survival analysis, and targeted maximum likelihood estimation (TMLE) were used to estimate the effect of same-day-ART initiation on a composite unfavorable treatment outcome (loss to follow-up, death, viral failure, treatment switch). Of 1,328 patients, 839 (63.2%) initiated same-day ART. The adjusted hazard ratio of the unfavorable outcome was higher, 1.48 (95% confidence interval: 1.16, 1.89), for same-day ART compared with early ART. TMLE suggested that after 1 year, 28.9% of patients would experience the unfavorable outcome under same-day ART compared with 21.2% under early ART (difference: 7.7%; 1.3%-14.1%). This estimate was driven by loss to follow-up and varied over time, with a higher hazard during the first year after HIV care enrollment and a similar hazard thereafter. We found an increased risk with same-day ART. A limitation was that possible silent transfers that were not captured.


Asunto(s)
Fármacos Anti-VIH/uso terapéutico , Infecciones por VIH/tratamiento farmacológico , Infecciones por VIH/epidemiología , Tuberculosis/epidemiología , Adolescente , Adulto , Antituberculosos/uso terapéutico , Esuatini , Femenino , Humanos , Masculino , Persona de Mediana Edad , Pacientes Desistentes del Tratamiento , Políticas , Sector Público , Estudios Retrospectivos , Análisis de Supervivencia , Tiempo de Tratamiento , Tuberculosis/tratamiento farmacológico , Organización Mundial de la Salud , Adulto Joven
13.
Am J Epidemiol ; 2021 Jul 15.
Artículo en Inglés | MEDLINE | ID: mdl-34268553

RESUMEN

In this issue, Naimi et al. (Am J Epidemiol. XXXX;XXX(XX):XXXX-XXXX) discuss a critical topic in public health and beyond: obtaining valid statistical inference when using machine learning in causal research. In doing so, the authors review recent prominent methodological work and recommend: (i) double robust estimators, such as targeted maximum likelihood estimation (TMLE); (ii) ensemble methods, such as Super Learner, to combine predictions from a diverse library of algorithms, and (iii) sample-splitting to reduce bias and improve inference. We largely agree with these recommendations. In this commentary, we highlight the critical importance of the Super Learner library. Specifically, in both simulation settings considered by the authors, we demonstrate that low bias and valid statistical inference can be achieved using TMLE without sample-splitting and with a Super Learner library that excludes tree-based methods but includes regression splines. Whether extremely data-adaptive algorithms and sample-splitting are needed depends on the specific problem and should be informed by simulations reflecting the specific application. More research is needed on practical recommendations for selecting among these options in common situations arising in epidemiology.

14.
BMC Public Health ; 21(1): 1642, 2021 09 08.
Artículo en Inglés | MEDLINE | ID: mdl-34496810

RESUMEN

BACKGROUND: Epidemiological theory and many empirical studies support the hypothesis that there is a protective effect of male circumcision against some sexually transmitted infections (STIs). However, there is a paucity of randomized control trials (RCTs) to test this hypothesis in the South African population. Due to the infeasibility of conducting RCTs, estimating marginal or average treatment effects with observational data increases interest. Using targeted maximum likelihood estimation (TMLE), a doubly robust estimation technique, we aim to provide evidence of an association between medical male circumcision (MMC) and two STI outcomes. METHODS: HIV and HSV-2 status were the two primary outcomes for this study. We investigated the associations between MMC and these STI outcomes, using cross-sectional data from the HIV Incidence Provincial Surveillance System (HIPSS) study in KwaZulu-Natal, South Africa. HIV antibodies were tested from the blood samples collected in the study. For HSV-2, serum samples were tested for HSV-2 antibodies via an ELISA-based anti-HSV-2 IgG. We estimated marginal prevalence ratios (PR) using TMLE and compared estimates with those from propensity score full matching (PSFM) and inverse probability of treatment weighting (IPTW). RESULTS: From a total 2850 male participants included in the analytic sample, the overall weighted prevalence of HIV was 32.4% (n = 941) and HSV-2 was 53.2% (n = 1529). TMLE estimates suggest that MMC was associated with 31% lower HIV prevalence (PR: 0.690; 95% CI: 0.614, 0.777) and 21.1% lower HSV-2 prevalence (PR: 0.789; 95% CI: 0.734, 0.848). The propensity score analyses also provided evidence of association of MMC with lower prevalence of HIV and HSV-2. For PSFM: HIV (PR: 0.689; 95% CI: 0.537, 0.885), and HSV-2 (PR: 0.832; 95% CI: 0.709, 0.975). For IPTW: HIV (PR: 0.708; 95% CI: 0.572, 0.875), and HSV-2 (PR: 0.837; 95% CI: 0.738, 0.949). CONCLUSION: Using a TMLE approach, we present further evidence of a protective association of MMC against HIV and HSV-2 in this hyper-endemic South African setting. TMLE has the potential to enhance the evidence base for recommendations that embrace the effect of public health interventions on health or disease outcomes.


Asunto(s)
Circuncisión Masculina , Infecciones por VIH , Enfermedades de Transmisión Sexual , Infecciones por VIH/epidemiología , Infecciones por VIH/prevención & control , Humanos , Funciones de Verosimilitud , Masculino , Prevalencia , Enfermedades de Transmisión Sexual/epidemiología , Enfermedades de Transmisión Sexual/prevención & control , Sudáfrica/epidemiología
15.
BMC Public Health ; 21(1): 1219, 2021 06 24.
Artículo en Inglés | MEDLINE | ID: mdl-34167500

RESUMEN

OBJECTIVES: The relationship between reproductive factors and breast cancer (BC) risk has been investigated in previous studies. Considering the discrepancies in the results, the aim of this study was to estimate the causal effect of reproductive factors on BC risk in a case-control study using the double robust approach of targeted maximum likelihood estimation. METHODS: This is a causal reanalysis of a case-control study done between 2005 and 2008 in Shiraz, Iran, in which 787 confirmed BC cases and 928 controls were enrolled. Targeted maximum likelihood estimation along with super Learner were used to analyze the data, and risk ratio (RR), risk difference (RD), andpopulation attributable fraction (PAF) were reported. RESULTS: Our findings did not support parity and age at the first pregnancy as risk factors for BC. The risk of BC was higher among postmenopausal women (RR = 3.3, 95% confidence interval (CI) = (2.3, 4.6)), women with the age at first marriage ≥20 years (RR = 1.6, 95% CI = (1.3, 2.1)), and the history of oral contraceptive (OC) use (RR = 1.6, 95% CI = (1.3, 2.1)) or breastfeeding duration ≤60 months (RR = 1.8, 95% CI = (1.3, 2.5)). The PAF for menopause status, breastfeeding duration, and OC use were 40.3% (95% CI = 39.5, 40.6), 27.3% (95% CI = 23.1, 30.8) and 24.4% (95% CI = 10.5, 35.5), respectively. CONCLUSIONS: Postmenopausal women, and women with a higher age at first marriage, shorter duration of breastfeeding, and history of OC use are at the higher risk of BC.


Asunto(s)
Neoplasias de la Mama , Neoplasias de la Mama/epidemiología , Neoplasias de la Mama/etiología , Estudios de Casos y Controles , Femenino , Humanos , Irán/epidemiología , Funciones de Verosimilitud , Paridad , Embarazo , Historia Reproductiva , Factores de Riesgo
16.
Stat Med ; 39(27): 4069-4085, 2020 11 30.
Artículo en Inglés | MEDLINE | ID: mdl-32875627

RESUMEN

In longitudinal settings, causal inference methods usually rely on a discretization of the patient timeline that may not reflect the underlying data generation process. This article investigates the estimation of causal parameters under discretized data. It presents the implicit assumptions practitioners make but do not acknowledge when discretizing data to assess longitudinal causal parameters. We illustrate that differences in point estimates under different discretizations are due to the data coarsening resulting in both a modified definition of the parameter of interest and loss of information about time-dependent confounders. We further investigate several tools to advise analysts in selecting a timeline discretization for use with pooled longitudinal targeted maximum likelihood estimation for the estimation of the parameters of a marginal structural model. We use a simulation study to empirically evaluate bias at different discretizations and assess the use of the cross-validated variance as a measure of data support to select a discretization under a chosen data coarsening mechanism. We then apply our approach to a study on the relative effect of alternative asthma treatments during pregnancy on pregnancy duration. The results of the simulation study illustrate how coarsening changes the target parameter of interest as well as how it may create bias due to a lack of appropriate control for time-dependent confounders. We also observe evidence that the cross-validated variance acts well as a measure of support in the data, by being minimized at finer discretizations as the sample size increases.


Asunto(s)
Causalidad , Sesgo , Simulación por Computador , Humanos
17.
Stat Med ; 38(24): 4888-4911, 2019 10 30.
Artículo en Inglés | MEDLINE | ID: mdl-31436859

RESUMEN

Longitudinal targeted maximum likelihood estimation (LTMLE) has very rarely been used to estimate dynamic treatment effects in the context of time-dependent confounding affected by prior treatment when faced with long follow-up times, multiple time-varying confounders, and complex associational relationships simultaneously. Reasons for this include the potential computational burden, technical challenges, restricted modeling options for long follow-up times, and limited practical guidance in the literature. However, LTMLE has desirable asymptotic properties, ie, it is doubly robust, and can yield valid inference when used in conjunction with machine learning. It also has the advantage of easy-to-calculate analytic standard errors in contrast to the g-formula, which requires bootstrapping. We use a topical and sophisticated question from HIV treatment research to show that LTMLE can be used successfully in complex realistic settings, and we compare results to competing estimators. Our example illustrates the following practical challenges common to many epidemiological studies: (1) long follow-up time (30 months); (2) gradually declining sample size; (3) limited support for some intervention rules of interest; (4) a high-dimensional set of potential adjustment variables, increasing both the need and the challenge of integrating appropriate machine learning methods; and (5) consideration of collider bias. Our analyses, as well as simulations, shed new light on the application of LTMLE in complex and realistic settings: We show that (1) LTMLE can yield stable and good estimates, even when confronted with small samples and limited modeling options; (2) machine learning utilized with a small set of simple learners (if more complex ones cannot be fitted) can outperform a single, complex model, which is tailored to incorporate prior clinical knowledge; and (3) performance can vary considerably depending on interventions and their support in the data, and therefore critical quality checks should accompany every LTMLE analysis. We provide guidance for the practical application of LTMLE.


Asunto(s)
Fármacos Anti-VIH/administración & dosificación , Infecciones por VIH/tratamiento farmacológico , Funciones de Verosimilitud , Causalidad , Niño , Simulación por Computador , Factores de Confusión Epidemiológicos , Infecciones por VIH/epidemiología , Humanos , Tamaño de la Muestra
18.
Biometrics ; 74(2): 389-398, 2018 06.
Artículo en Inglés | MEDLINE | ID: mdl-29096036

RESUMEN

To unbiasedly estimate a causal effect on an outcome unconfoundedness is often assumed. If there is sufficient knowledge on the underlying causal structure then existing confounder selection criteria can be used to select subsets of the observed pretreatment covariates, X, sufficient for unconfoundedness, if such subsets exist. Here, estimation of these target subsets is considered when the underlying causal structure is unknown. The proposed method is to model the causal structure by a probabilistic graphical model, for example, a Markov or Bayesian network, estimate this graph from observed data and select the target subsets given the estimated graph. The approach is evaluated by simulation both in a high-dimensional setting where unconfoundedness holds given X and in a setting where unconfoundedness only holds given subsets of X. Several common target subsets are investigated and the selected subsets are compared with respect to accuracy in estimating the average causal effect. The proposed method is implemented with existing software that can easily handle high-dimensional data, in terms of large samples and large number of covariates. The results from the simulation study show that, if unconfoundedness holds given X, this approach is very successful in selecting the target subsets, outperforming alternative approaches based on random forests and LASSO, and that the subset estimating the target subset containing all causes of outcome yields smallest MSE in the average causal effect estimation.


Asunto(s)
Biometría/métodos , Factores de Confusión Epidemiológicos , Probabilidad , Teorema de Bayes , Causalidad , Simulación por Computador , Cadenas de Markov , Modelos Estadísticos
19.
Stat Med ; 37(4): 530-543, 2018 02 20.
Artículo en Inglés | MEDLINE | ID: mdl-29094375

RESUMEN

Causal inference practitioners are routinely presented with the challenge of model selection and, in particular, reducing the size of the covariate set with the goal of improving estimation efficiency. Collaborative targeted minimum loss-based estimation (CTMLE) is a general framework for constructing doubly robust semiparametric causal estimators that data-adaptively limit model complexity in the propensity score to optimize a preferred loss function. This stepwise complexity reduction is based on a loss function placed on a strategically updated model for the outcome variable through which the error is assessed using cross-validation. We demonstrate how the existing stepwise variable selection CTMLE can be generalized using regression shrinkage of the propensity score. We present 2 new algorithms that involve stepwise selection of the penalization parameter(s) in the regression shrinkage. Simulation studies demonstrate that, under a misspecified outcome model, mean squared error and bias can be reduced by a CTMLE procedure that separately penalizes individual covariates in the propensity score. We demonstrate these approaches in an example using electronic medical data with sparse indicator covariates to evaluate the relative safety of 2 similarly indicated asthma therapies for pregnant women with moderate asthma.


Asunto(s)
Aprendizaje Automático , Modelos Estadísticos , Algoritmos , Asma/complicaciones , Asma/tratamiento farmacológico , Bioestadística , Causalidad , Simulación por Computador , Femenino , Humanos , Recién Nacido , Funciones de Verosimilitud , Evaluación de Resultado en la Atención de Salud/estadística & datos numéricos , Embarazo , Complicaciones del Embarazo/tratamiento farmacológico , Puntaje de Propensión , Análisis de Regresión
20.
Stat Med ; 36(24): 3807-3819, 2017 Oct 30.
Artículo en Inglés | MEDLINE | ID: mdl-28744883

RESUMEN

Missing outcome data is a crucial threat to the validity of treatment effect estimates from randomized trials. The outcome distributions of participants with missing and observed data are often different, which increases bias. Causal inference methods may aid in reducing the bias and improving efficiency by incorporating baseline variables into the analysis. In particular, doubly robust estimators incorporate 2 nuisance parameters: the outcome regression and the missingness mechanism (ie, the probability of missingness conditional on treatment assignment and baseline variables), to adjust for differences in the observed and unobserved groups that can be explained by observed covariates. To consistently estimate the treatment effect, one of these nuisance parameters must be consistently estimated. Traditionally, nuisance parameters are estimated using parametric models, which often precludes consistency, particularly in moderate to high dimensions. Recent research on missing data has focused on data-adaptive estimation to help achieve consistency, but the large sample properties of such methods are poorly understood. In this article, we discuss a doubly robust estimator that is consistent and asymptotically normal under data-adaptive estimation of the nuisance parameters. We provide a formula for an asymptotically exact confidence interval under minimal assumptions. We show that our proposed estimator has smaller finite-sample bias compared to standard doubly robust estimators. We present a simulation study demonstrating the enhanced performance of our estimators in terms of bias, efficiency, and coverage of the confidence intervals. We present the results of an illustrative example: a randomized, double-blind phase 2/3 trial of antiretroviral therapy in HIV-infected persons.


Asunto(s)
Sesgo , Modelos Estadísticos , Ensayos Clínicos Controlados Aleatorios como Asunto/métodos , Algoritmos , Antirretrovirales/uso terapéutico , Simulación por Computador , Infecciones por VIH/tratamiento farmacológico , Humanos , Estadísticas no Paramétricas
SELECCIÓN DE REFERENCIAS
Detalles de la búsqueda