RESUMO
Administration of antenatal corticosteroids (ACS) for accelerating foetal lung maturation in threatened preterm birth is one of the cornerstones of prevention of neonatal mortality and morbidity. To identify the optimal timing of ACS administration, most studies have compared subgroups based on treatment-to-delivery intervals. Such subgroup analysis of the first placebo-controlled randomised controlled trial indicated that a one to seven day interval between ACS administration and birth resulted in the lowest rates of neonatal respiratory distress syndrome. This efficacy window was largely confirmed by a series of subgroup analyses of subsequent trials and observational studies and strongly influenced obstetric management. However, these subgroup analyses suffer from a methodological flaw that often seems to be overlooked and potentially has important consequences for drawing valid conclusions. In this commentary, we point out that studies comparing treatment outcomes between subgroups that are retrospectively identified at birth (i.e. after randomisation) may not only be plagued by post-randomisation confounding bias but, more importantly, may not adequately inform decision making before birth, when the projected duration of the interval is still unknown. We suggest two more formal interpretations of these subgroup analyses, using a counterfactual framework for causal inference, and demonstrate that each of these interpretations can be linked to a different hypothetical trial. However, given the infeasibility of these trials, we argue that none of these rescue interpretations are helpful for clinical decision making. As a result, guidelines based on these subgroup analyses may have led to suboptimal clinical practice. As an alternative to these flawed subgroup analyses, we suggest a more principled approach that clearly formulates the question about optimal timing of ACS treatment in terms of the protocol of a future randomised study. Even if this 'target trial' would never be conducted, its protocol may still provide important guidance to avoid repeating common design flaws when conducting observational 'real world' studies using statistical methods for causal inference.
Assuntos
Nascimento Prematuro , Síndrome do Desconforto Respiratório do Recém-Nascido , Recém-Nascido , Gravidez , Feminino , Humanos , Nascimento Prematuro/prevenção & controle , Nascimento Prematuro/tratamento farmacológico , Estudos Retrospectivos , Corticosteroides/uso terapêutico , Síndrome do Desconforto Respiratório do Recém-Nascido/prevenção & controle , Mortalidade Infantil , Ensaios Clínicos Controlados Aleatórios como AssuntoRESUMO
When drawing causal inference from observed data, failure time outcomes present additional challenges of censoring often combined with other missing data patterns. In this article, we follow incident cases of end-stage renal disease to examine the effect on all-cause mortality of starting treatment with transplant, so-called pre-emptive kidney transplantation, vs starting with dialysis possibly followed by delayed transplantation. The question is relatively simple: which start-off treatment is expected to bring the best survival for a target population? To address it, we emulate a target trial drawing on the long term Swedish Renal Registry, where a growing common set of baseline covariates was measured nationwide. Several lessons are learned which pertain to long term disease registers more generally. With characteristics of cases and versions of treatment evolving over time, informative censoring is already introduced in unadjusted Kaplan-Meier curves. This leads to misrepresented survival chances in observed treatment groups. The resulting biased treatment association may be aggravated upon implementing IPW for treatment. Aware of additional challenges, we further recall how similar studies to date have selected patients into treatment groups based on events occurring post treatment initiation. Our study reveals the dramatic impact of resulting immortal time bias combined with other typical features of long-term incident disease registers, including missing covariates during the early phases of the register. We discuss feasible ways of accommodating these features when targeting relevant estimands, and demonstrate how more than one causal question can be answered relying on the no unmeasured baseline confounders assumption.
Assuntos
Falência Renal Crônica , Transplante de Rim , Humanos , Incidência , Falência Renal Crônica/epidemiologia , Falência Renal Crônica/cirurgia , Transplante de Rim/efeitos adversos , Sistema de Registros , Diálise Renal , Análise de Sobrevida , Taxa de SobrevidaRESUMO
The ICH E9 addendum introduces the term intercurrent event to refer to events that happen after treatment initiation and that can either preclude observation of the outcome of interest or affect its interpretation. It proposes five strategies for handling intercurrent events to form an estimand but does not suggest statistical methods for estimation. In this article we focus on the hypothetical strategy, where the treatment effect is defined under the hypothetical scenario in which the intercurrent event is prevented. For its estimation, we consider causal inference and missing data methods. We establish that certain "causal inference estimators" are identical to certain "missing data estimators." These links may help those familiar with one set of methods but not the other. Moreover, using potential outcome notation allows us to state more clearly the assumptions on which missing data methods rely to estimate hypothetical estimands. This helps to indicate whether estimating a hypothetical estimand is reasonable, and what data should be used in the analysis. We show that hypothetical estimands can be estimated by exploiting data after intercurrent event occurrence, which is typically not used. Supplementary materials for this article are available online.
RESUMO
OBJECTIVE: To evaluate the consistency of causal statements in observational studies published in The BMJ. DESIGN: Review of observational studies published in a general medical journal. DATA SOURCE: Cohort and other longitudinal studies describing an exposure-outcome relationship published in The BMJ in 2018. We also had access to the submitted papers and reviewer reports. MAIN OUTCOME MEASURES: Proportion of published research papers with 'inconsistent' use of causal language. Papers where language was consistently causal or non-causal were classified as 'consistently causal' or 'consistently not causal', respectively. For the 'inconsistent' papers, we then compared the published and submitted version. RESULTS: Of 151 published research papers, 60 described eligible studies. Of these 60, we classified the causal language used as 'consistently causal' (48%), 'inconsistent' (20%) and 'consistently not causal'(32%). Eleven out of 12 (92%) of the 'inconsistent' papers were already inconsistent on submission. The inconsistencies found in both submitted and published versions were mainly due to mismatches between objectives and conclusions. One section might be carefully phrased in terms of association while the other presented causal language. When identifying only an association, some authors jumped to recommending acting on the findings as if motivated by the evidence presented. CONCLUSION: Further guidance is necessary for authors on what constitutes a causal statement and how to justify or discuss assumptions involved. Based on screening these papers, we provide a list of expressions beyond the obvious 'cause' word which may inspire a useful more comprehensive compendium on causal language.