Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 240
Filter
1.
Am J Epidemiol ; 193(7): 1019-1030, 2024 Jul 08.
Article in English | MEDLINE | ID: mdl-38400653

ABSTRACT

Targeted maximum likelihood estimation (TMLE) is increasingly used for doubly robust causal inference, but how missing data should be handled when using TMLE with data-adaptive approaches is unclear. Based on data (1992-1998) from the Victorian Adolescent Health Cohort Study, we conducted a simulation study to evaluate 8 missing-data methods in this context: complete-case analysis, extended TMLE incorporating an outcome-missingness model, the missing covariate missing indicator method, and 5 multiple imputation (MI) approaches using parametric or machine-learning models. We considered 6 scenarios that varied in terms of exposure/outcome generation models (presence of confounder-confounder interactions) and missingness mechanisms (whether outcome influenced missingness in other variables and presence of interaction/nonlinear terms in missingness models). Complete-case analysis and extended TMLE had small biases when outcome did not influence missingness in other variables. Parametric MI without interactions had large bias when exposure/outcome generation models included interactions. Parametric MI including interactions performed best in bias and variance reduction across all settings, except when missingness models included a nonlinear term. When choosing a method for handling missing data in the context of TMLE, researchers must consider the missingness mechanism and, for MI, compatibility with the analysis method. In many settings, a parametric MI approach that incorporates interactions and nonlinearities is expected to perform well.


Subject(s)
Causality , Humans , Likelihood Functions , Adolescent , Data Interpretation, Statistical , Bias , Models, Statistical , Computer Simulation
2.
Lancet ; 402(10412): 1580-1596, 2023 10 28.
Article in English | MEDLINE | ID: mdl-37837988

ABSTRACT

Every year, an estimated 21 million girls aged 15-19 years become pregnant in low-income and middle-income countries (LMICs). Policy responses have focused on reducing the adolescent birth rate whereas efforts to support pregnant adolescents have developed more slowly. We did a systematic review of interventions addressing any health-related outcome for pregnant adolescents and their newborn babies in LMICs and mapped its results to a framework describing high-quality health systems for pregnant adolescents. Although we identified some promising interventions, such as micronutrient supplementation, conditional cash transfers, and well facilitated group care, most studies were at high risk of bias and there were substantial gaps in evidence. These included major gaps in delivery, abortion, and postnatal care, and mental health, violence, and substance misuse-related outcomes. We recommend that the fields of adolescent, maternal, and sexual and reproductive health collaborate to develop more adolescent-inclusive maternal health care and research, and specific interventions for pregnant adolescents. We outline steps to develop high-quality, evidence-based care for the millions of pregnant adolescents and their newborns who currently do not receive this.


Subject(s)
Maternal Health Services , Pregnancy in Adolescence , Adolescent , Female , Humans , Infant, Newborn , Pregnancy , Abortion, Induced , Abortion, Spontaneous , Developing Countries , Pregnant Women , Violence
3.
Biom J ; 66(3): e2200326, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38637322

ABSTRACT

In the context of missing data, the identifiability or "recoverability" of the average causal effect (ACE) depends not only on the usual causal assumptions but also on missingness assumptions that can be depicted by adding variable-specific missingness indicators to causal diagrams, creating missingness directed acyclic graphs (m-DAGs). Previous research described canonical m-DAGs, representing typical multivariable missingness mechanisms in epidemiological studies, and examined mathematically the recoverability of the ACE in each case. However, this work assumed no effect modification and did not investigate methods for estimation across such scenarios. Here, we extend this research by determining the recoverability of the ACE in settings with effect modification and conducting a simulation study to evaluate the performance of widely used missing data methods when estimating the ACE using correctly specified g-computation. Methods assessed were complete case analysis (CCA) and various implementations of multiple imputation (MI) with varying degrees of compatibility with the outcome model used in g-computation. Simulations were based on an example from the Victorian Adolescent Health Cohort Study (VAHCS), where interest was in estimating the ACE of adolescent cannabis use on mental health in young adulthood. We found that the ACE is recoverable when no incomplete variable (exposure, outcome, or confounder) causes its own missingness, and nonrecoverable otherwise, in simplified versions of 10 canonical m-DAGs that excluded unmeasured common causes of missingness indicators. Despite this nonrecoverability, simulations showed that MI approaches that are compatible with the outcome model in g-computation may enable approximately unbiased estimation across all canonical m-DAGs considered, except when the outcome causes its own missingness or causes the missingness of a variable that causes its own missingness. In the latter settings, researchers may need to consider sensitivity analysis methods incorporating external information (e.g., delta-adjustment methods). The VAHCS case study illustrates the practical implications of these findings.


Subject(s)
Cohort Studies , Humans , Young Adult , Adult , Adolescent , Data Interpretation, Statistical , Causality , Computer Simulation
4.
Biom J ; 66(1): e2200291, 2024 Jan.
Article in English | MEDLINE | ID: mdl-38285405

ABSTRACT

Multiple imputation (MI) is a popular method for handling missing data. Auxiliary variables can be added to the imputation model(s) to improve MI estimates. However, the choice of which auxiliary variables to include is not always straightforward. Several data-driven auxiliary variable selection strategies have been proposed, but there has been limited evaluation of their performance. Using a simulation study we evaluated the performance of eight auxiliary variable selection strategies: (1, 2) two versions of selection based on correlations in the observed data; (3) selection using hypothesis tests of the "missing completely at random" assumption; (4) replacing auxiliary variables with their principal components; (5, 6) forward and forward stepwise selection; (7) forward selection based on the estimated fraction of missing information; and (8) selection via the least absolute shrinkage and selection operator (LASSO). A complete case analysis and an MI analysis using all auxiliary variables (the "full model") were included for comparison. We also applied all strategies to a motivating case study. The full model outperformed all auxiliary variable selection strategies in the simulation study, with the LASSO strategy the best performing auxiliary variable selection strategy overall. All MI analysis strategies that we were able to apply to the case study led to similar estimates, although computational time was substantially reduced when variable selection was employed. This study provides further support for adopting an inclusive auxiliary variable strategy where possible. Auxiliary variable selection using the LASSO may be a promising alternative when the full model fails or is too burdensome.


Subject(s)
Computer Simulation
5.
Cleft Palate Craniofac J ; : 10556656241253949, 2024 May 09.
Article in English | MEDLINE | ID: mdl-38725271

ABSTRACT

The Timing of Primary Surgery (TOPS) trial was published August 2023 in the New England Journal of Medicine and is a milestone achievement for a study focused on cleft palate. Due to the complexity of outcome reporting in cleft and the rarity of such comparative trials, TOPS presents a useful opportunity to critically review the design, analysis and reporting strategies utilised. This perspective article focused on the inclusion of participants, the choice of the primary outcome measure and the analysis of ordinal data within the trial. Considerations for future comparative studies in cleft care are discussed.

6.
BMC Med Res Methodol ; 23(1): 287, 2023 12 07.
Article in English | MEDLINE | ID: mdl-38062377

ABSTRACT

BACKGROUND: Case-cohort studies are conducted within cohort studies, with the defining feature that collection of exposure data is limited to a subset of the cohort, leading to a large proportion of missing data by design. Standard analysis uses inverse probability weighting (IPW) to address this intended missing data, but little research has been conducted into how best to perform analysis when there is also unintended missingness. Multiple imputation (MI) has become a default standard for handling unintended missingness and is typically used in combination with IPW to handle the intended missingness due to the case-control sampling. Alternatively, MI could be used to handle both the intended and unintended missingness. While the performance of an MI-only approach has been investigated in the context of a case-cohort study with a time-to-event outcome, it is unclear how this approach performs with a binary outcome. METHODS: We conducted a simulation study to assess and compare the performance of approaches using only MI, only IPW, and a combination of MI and IPW, for handling intended and unintended missingness in the case-cohort setting. We also applied the approaches to a case study. RESULTS: Our results show that the combined approach is approximately unbiased for estimation of the exposure effect when the sample size is large, and was the least biased with small sample sizes, while MI-only and IPW-only exhibited larger biases in both sample size settings. CONCLUSIONS: These findings suggest that a combined MI/IPW approach should be preferred to handle intended and unintended missing data in case-cohort studies with binary outcomes.


Subject(s)
Cohort Studies , Humans , Data Interpretation, Statistical , Probability , Bias , Computer Simulation
7.
BMC Med Res Methodol ; 23(1): 42, 2023 02 16.
Article in English | MEDLINE | ID: mdl-36797679

ABSTRACT

BACKGROUND: Despite recent advances in causal inference methods, outcome regression remains the most widely used approach for estimating causal effects in epidemiological studies with a single-point exposure and outcome. Missing data are common in these studies, and complete-case analysis (CCA) and multiple imputation (MI) are two frequently used methods for handling them. In randomised controlled trials (RCTs), it has been shown that MI should be conducted separately by treatment group. In observational studies, causal inference is now understood as the task of emulating an RCT, which raises the question of whether MI should be conducted by exposure group in such studies. METHODS: We addressed this question by evaluating the performance of seven methods for handling missing data when estimating causal effects with outcome regression. We conducted an extensive simulation study based on an illustrative case study from the Victorian Adolescent Health Cohort Study, assessing a range of scenarios, including seven outcome generation models with exposure-confounder interactions of differing strength. RESULTS: The simulation results showed that MI by exposure group led to the least bias when the size of the smallest exposure group was relatively large, followed by MI approaches that included the exposure-confounder interactions. CONCLUSIONS: The findings from our simulation study, which was designed based on a real case study, suggest that current practice for the conduct of MI in causal inference may need to shift to stratifying by exposure group where feasible, or otherwise including exposure-confounder interactions in the imputation model.


Subject(s)
Computer Simulation , Humans , Adolescent , Bias
8.
Clin Trials ; 20(5): 479-485, 2023 10.
Article in English | MEDLINE | ID: mdl-37144610

ABSTRACT

BACKGROUND: Blinding of treatment allocation from treating clinicians in neonatal randomised controlled trials can minimise performance bias, but its effectiveness is rarely assessed. METHODS: To examine the effectiveness of blinding a procedural intervention from treating clinicians in a multicentre randomised controlled trial of minimally invasive surfactant therapy versus sham treatment in preterm infants of gestation 25-28 weeks with respiratory distress syndrome. The intervention (minimally invasive surfactant therapy or sham) was performed behind a screen within the first 6 h of life by a 'study team' uninvolved in clinical care including decision-making. Procedure duration and the study team's words and actions during the sham treatment mimicked those of the minimally invasive surfactant therapy procedure. Post-intervention, three clinicians completed a questionnaire regarding perceived group allocation, with the responses matched against actual intervention and categorised as correct, incorrect, or unsure. Success of blinding was calculated using validated blinding indices applied to the data overall (James index, successful blinding defined as > 0.50), or to the two treatment allocation groups (Bang index, successful blinding: -0.30 to 0.30). Blinding success was measured within staff role, and the associations between blinding success and procedural duration and oxygenation improvement post-procedure were estimated. RESULTS: From 1345 questionnaires in relation to a procedural intervention in 485 participants, responses were categorised as correct in 441 (33%), incorrect in 142 (11%), and unsure in 762 (57%), with similar proportions for each of the response categories in the two treatment arms. The James index indicated successful blinding overall 0.67 (95% confidence interval (CI) 0.65-0.70). The Bang index was 0.28 (95% CI 0.23-0.32) in the minimally invasive surfactant therapy group and 0.17 (95% CI 0.12-0.21) in the sham arm. Neonatologists more frequently guessed the correct intervention (47%) than bedside nurses (36%), neonatal trainees (31%), and other nurses (24%). For the minimally invasive surfactant therapy intervention, the Bang index was linearly related to procedural duration and oxygenation improvement post-procedure. No evidence of such relationships was seen in the sham arm. CONCLUSION: Blinding of a procedural intervention from clinicians is both achievable and measurable in neonatal randomised controlled trials.


Subject(s)
Infant, Premature , Surface-Active Agents , Infant , Humans , Infant, Newborn , Randomized Controlled Trials as Topic
9.
JAMA ; 330(11): 1054-1063, 2023 09 19.
Article in English | MEDLINE | ID: mdl-37695601

ABSTRACT

Importance: The long-term effects of surfactant administration via a thin catheter (minimally invasive surfactant therapy [MIST]) in preterm infants with respiratory distress syndrome remain to be definitively clarified. Objective: To examine the effect of MIST on death or neurodevelopmental disability (NDD) at 2 years' corrected age. Design, Setting, and Participants: Follow-up study of a randomized clinical trial with blinding of clinicians and outcome assessors conducted in 33 tertiary-level neonatal intensive care units in 11 countries. The trial included 486 infants with a gestational age of 25 to 28 weeks supported with continuous positive airway pressure (CPAP). Collection of follow-up data at 2 years' corrected age was completed on December 9, 2022. Interventions: Infants assigned to MIST (n = 242) received exogenous surfactant (200 mg/kg poractant alfa) via a thin catheter; those assigned to the control group (n = 244) received sham treatment. Main Outcomes and Measures: The key secondary outcome of death or moderate to severe NDD was assessed at 2 years' corrected age. Other secondary outcomes included components of this composite outcome, as well as hospitalizations for respiratory illness and parent-reported wheezing or breathing difficulty in the first 2 years. Results: Among the 486 infants randomized, 453 had follow-up data available (median gestation, 27.3 weeks; 228 females [50.3%]); data on the key secondary outcome were available in 434 infants. Death or NDD occurred in 78 infants (36.3%) in the MIST group and 79 (36.1%) in the control group (risk difference, 0% [95% CI, -7.6% to 7.7%]; relative risk [RR], 1.0 [95% CI, 0.81-1.24]); components of this outcome did not differ significantly between groups. Secondary respiratory outcomes favored the MIST group. Hospitalization with respiratory illness occurred in 49 infants (25.1%) in the MIST group vs 78 (38.2%) in the control group (RR, 0.66 [95% CI, 0.54-0.81]) and parent-reported wheezing or breathing difficulty in 73 (40.6%) vs 104 (53.6%), respectively (RR, 0.76 [95% CI, 0.63-0.90]). Conclusions and Relevance: In this follow-up study of a randomized clinical trial of preterm infants with respiratory distress syndrome supported with CPAP, MIST compared with sham treatment did not reduce the incidence of death or NDD by 2 years of age. However, infants who received MIST had lower rates of adverse respiratory outcomes during their first 2 years of life. Trial Registration: anzctr.org.au Identifier: ACTRN12611000916943.


Subject(s)
Pulmonary Surfactants , Respiratory Distress Syndrome, Newborn , Female , Humans , Infant , Infant, Newborn , Dyspnea , Follow-Up Studies , Infant, Premature , Lipoproteins , Pulmonary Surfactants/administration & dosage , Pulmonary Surfactants/therapeutic use , Respiratory Distress Syndrome/complications , Respiratory Distress Syndrome/drug therapy , Respiratory Distress Syndrome/therapy , Respiratory Distress Syndrome, Newborn/complications , Respiratory Distress Syndrome, Newborn/drug therapy , Respiratory Distress Syndrome, Newborn/therapy , Respiratory Sounds , Surface-Active Agents/administration & dosage , Surface-Active Agents/therapeutic use , Catheterization , Minimally Invasive Surgical Procedures , Continuous Positive Airway Pressure , Male , Child, Preschool
10.
BMC Med Res Methodol ; 22(1): 87, 2022 04 03.
Article in English | MEDLINE | ID: mdl-35369860

ABSTRACT

BACKGROUND: In case-cohort studies a random subcohort is selected from the inception cohort and acts as the sample of controls for several outcome investigations. Analysis is conducted using only the cases and the subcohort, with inverse probability weighting (IPW) used to account for the unequal sampling probabilities resulting from the study design. Like all epidemiological studies, case-cohort studies are susceptible to missing data. Multiple imputation (MI) has become increasingly popular for addressing missing data in epidemiological studies. It is currently unclear how best to incorporate the weights from a case-cohort analysis in MI procedures used to address missing covariate data. METHOD: A simulation study was conducted with missingness in two covariates, motivated by a case study within the Barwon Infant Study. MI methods considered were: using the outcome, a proxy for weights in the simple case-cohort design considered, as a predictor in the imputation model, with and without exposure and covariate interactions; imputing separately within each weight category; and using a weighted imputation model. These methods were compared to a complete case analysis (CCA) within the context of a standard IPW analysis model estimating either the risk or odds ratio. The strength of associations, missing data mechanism, proportion of observations with incomplete covariate data, and subcohort selection probability varied across the simulation scenarios. Methods were also applied to the case study. RESULTS: There was similar performance in terms of relative bias and precision with all MI methods across the scenarios considered, with expected improvements compared with the CCA. Slight underestimation of the standard error was seen throughout but the nominal level of coverage (95%) was generally achieved. All MI methods showed a similar increase in precision as the subcohort selection probability increased, irrespective of the scenario. A similar pattern of results was seen in the case study. CONCLUSIONS: How weights were incorporated into the imputation model had minimal effect on the performance of MI; this may be due to case-cohort studies only having two weight categories. In this context, inclusion of the outcome in the imputation model was sufficient to account for the unequal sampling probabilities in the analysis model.


Subject(s)
Research Design , Bias , Cohort Studies , Data Interpretation, Statistical , Humans , Probability
11.
BMC Med Res Methodol ; 22(1): 112, 2022 04 13.
Article in English | MEDLINE | ID: mdl-35418034

ABSTRACT

BACKGROUND: Stepped wedge trials are an appealing and potentially powerful cluster randomized trial design. However, they are frequently implemented with a small number of clusters. Standard analysis methods for these trials such as a linear mixed model with estimation via maximum likelihood or restricted maximum likelihood (REML) rely on asymptotic properties and have been shown to yield inflated type I error when applied to studies with a small number of clusters. Small-sample methods such as the Kenward-Roger approximation in combination with REML can potentially improve estimation of the fixed effects such as the treatment effect. A Bayesian approach may also be promising for such multilevel models but has not yet seen much application in cluster randomized trials. METHODS: We conducted a simulation study comparing the performance of REML with and without a Kenward-Roger approximation to a Bayesian approach using weakly informative prior distributions on the intracluster correlation parameters. We considered a continuous outcome and a range of stepped wedge trial configurations with between 4 and 40 clusters. To assess method performance we calculated bias and mean squared error for the treatment effect and correlation parameters and the coverage of 95% confidence/credible intervals and relative percent error in model-based standard error for the treatment effect. RESULTS: Both REML with a Kenward-Roger standard error and degrees of freedom correction and the Bayesian method performed similarly well for the estimation of the treatment effect, while intracluster correlation parameter estimates obtained via the Bayesian method were less variable than REML estimates with different relative levels of bias. CONCLUSIONS: The use of REML with a Kenward-Roger approximation may be sufficient for the analysis of stepped wedge cluster randomized trials with a small number of clusters. However, a Bayesian approach with weakly informative prior distributions on the intracluster correlation parameters offers a viable alternative, particularly when there is interest in the probability-based inferences permitted within this paradigm.


Subject(s)
Research Design , Bayes Theorem , Cluster Analysis , Computer Simulation , Humans , Likelihood Functions , Randomized Controlled Trials as Topic , Sample Size
12.
Biom J ; 64(8): 1404-1425, 2022 Dec.
Article in English | MEDLINE | ID: mdl-34914127

ABSTRACT

Three-level data structures arising from repeated measures on individuals clustered within larger units are common in health research studies. Missing data are prominent in such studies and are often handled via multiple imputation (MI). Although several MI approaches can be used to account for the three-level structure, including adaptations to single- and two-level approaches, when the substantive analysis model includes interactions or quadratic effects, these too need to be accommodated in the imputation model. In such analyses, substantive model compatible (SMC) MI has shown great promise in the context of single-level data. Although there have been recent developments in multilevel SMC MI, to date only one approach that explicitly handles incomplete three-level data is available. Alternatively, researchers can use pragmatic adaptations to single- and two-level MI approaches, or two-level SMC-MI approaches. We describe the available approaches and evaluate them via simulations in the context of three three-level random effects analysis models involving an interaction between the incomplete time-varying exposure and time, an interaction between the time-varying exposure and an incomplete time-fixed confounder, or a quadratic effect of the exposure. Results showed that all approaches considered performed well in terms of bias and precision when the target analysis involved an interaction with time, but the three-level SMC MI approach performed best when the target analysis involved an interaction between the time-varying exposure and an incomplete time-fixed confounder, or a quadratic effect of the exposure. We illustrate the methods using data from the Childhood to Adolescence Transition Study.


Subject(s)
Research Design , Adolescent , Humans , Child , Bias , Computer Simulation
13.
Stat Med ; 40(26): 5765-5778, 2021 11 20.
Article in English | MEDLINE | ID: mdl-34390264

ABSTRACT

For cluster randomized trials (CRTs) with a small number of clusters, the matched-pair (MP) design, where clusters are paired before randomizing one to each trial arm, is often recommended to minimize imbalance on known prognostic factors, add face-validity to the study, and increase efficiency, provided the analysis recognizes the matching. Little evidence exists to guide decisions on when to use matching. We used simulation to compare the efficiency of the MP design with the stratified and simple designs, based on the mean confidence interval width of the estimated intervention effect. Matched and unmatched analyses were used for the MP design; a stratified analysis was used for the stratified design; and analyses without and with post-stratification adjustment for factors that would otherwise have been used for restricted allocation were used for the simple design. Results showed the MP design was generally the most efficient for CRTs with 10 or more pairs when the correlation between cluster-level outcomes within pairs (matching correlation) was moderate to strong (0.3-0.5). There was little gain in efficiency for the MP or stratified designs compared to simple randomization when the matching correlation was weak (0.05-0.1). For trials with four pairs of clusters, the simple and stratified designs were more efficient than the MP design because greater degrees of freedom were available for the analysis, although an unmatched analysis of the MP design recovered precision for weak matching correlations. Practical guidance on choosing between the MP, stratified, and simple designs is provided.


Subject(s)
Research Design , Cluster Analysis , Computer Simulation , Humans , Random Allocation , Randomized Controlled Trials as Topic
14.
Stat Med ; 40(21): 4660-4674, 2021 09 20.
Article in English | MEDLINE | ID: mdl-34102709

ABSTRACT

Medical research often involves using multi-item scales to assess individual characteristics, disease severity, and other health-related outcomes. It is common to observe missing data in the scale scores, due to missing data in one or more items that make up that score. Multiple imputation (MI) is a popular method for handling missing data. However, it is not clear how best to use MI in the context of scale scores, particularly when they are assessed at multiple waves of data collection resulting in large numbers of items. The aim of this article is to provide practical advice on how to impute missing values in a repeatedly measured multi-item scale using MI when inference on the scale score is of interest. We evaluated the performance of five MI strategies for imputing missing data at either the item or scale level using simulated data and a case study based on four waves of the Longitudinal Study of Australian Children (LSAC). MI was implemented using both multivariate normal imputation and fully conditional specification, with two rules for calculating the scale score. A complete case analysis was also performed for comparison. Based on our results, we caution against the use of a MI strategy that does not include the scale score in the imputation model(s) when the scale score is required for analysis.


Subject(s)
Research Design , Australia , Child , Computer Simulation , Data Collection , Humans , Longitudinal Studies
15.
Stat Med ; 40(27): 6093-6106, 2021 11 30.
Article in English | MEDLINE | ID: mdl-34423450

ABSTRACT

Semi-continuous variables are characterized by a point mass at one value and a continuous range of values for remaining observations. An example is alcohol consumption quantity, with a spike of zeros representing non-drinkers and positive values for drinkers. If multiple imputation is used to handle missing values for semi-continuous variables, it is unclear how this should be implemented within the standard approaches of fully conditional specification (FCS) and multivariate normal imputation (MVNI). This question is brought into focus by the use of categorized versions of semi-continuous exposure variables in analyses (eg, no drinking, drinking below binge level, binge drinking, heavy binge drinking), raising the question of how best to achieve congeniality between imputation and analysis models. We performed a simulation study comparing nine approaches for imputing semi-continuous exposures requiring categorization for analysis. Three methods imputed the categories directly: ordinal logistic regression, and imputation of binary indicator variables representing the categories using MVNI (with two variants). Six methods (predictive mean matching, zero-inflated binomial imputation, and two-part imputation methods with variants in FCS and MVNI) imputed the semi-continuous variable, with categories derived after imputation. The ordinal and zero-inflated binomial methods had good performance across most scenarios, while MVNI methods requiring rounding after imputation did not perform well. There were mixed results for predictive mean matching and the two-part methods, depending on whether the estimands were proportions or regression coefficients. The results highlight the need to consider the parameter of interest when selecting an imputation procedure.


Subject(s)
Data Collection , Research Design , Computer Simulation , Data Collection/methods , Humans , Logistic Models
16.
Emerg Themes Epidemiol ; 18(1): 5, 2021 Apr 01.
Article in English | MEDLINE | ID: mdl-33794933

ABSTRACT

Multiple imputation is a recommended method for handling incomplete data problems. One of the barriers to its successful use is the breakdown of the multiple imputation procedure, often due to numerical problems with the algorithms used within the imputation process. These problems frequently occur when imputation models contain large numbers of variables, especially with the popular approach of multivariate imputation by chained equations. This paper describes common causes of failure of the imputation procedure including perfect prediction and collinearity, focusing on issues when using Stata software. We outline a number of strategies for addressing these issues, including imputation of composite variables instead of individual components, introducing prior information and changing the form of the imputation model. These strategies are illustrated using a case study based on data from the Longitudinal Study of Australian Children.

17.
BMC Med Res Methodol ; 21(1): 39, 2021 02 22.
Article in English | MEDLINE | ID: mdl-33618655

ABSTRACT

BACKGROUND: Dynamic treatment regimens (DTRs) formalise the multi-stage and dynamic decision problems that clinicians often face when treating chronic or progressive medical conditions. Compared to randomised controlled trials, using observational data to optimise DTRs may allow a wider range of treatments to be evaluated at a lower cost. This review aimed to provide an overview of how DTRs are optimised with observational data in practice. METHODS: Using the PubMed database, a scoping review of studies in which DTRs were optimised using observational data was performed in October 2020. Data extracted from eligible articles included target medical condition, source and type of data, statistical methods, and translational relevance of the included studies. RESULTS: From 209 PubMed abstracts, 37 full-text articles were identified, and a further 26 were screened from the reference lists, totalling 63 articles for inclusion in a narrative data synthesis. Observational DTR models are a recent development and their application has been concentrated in a few medical areas, primarily HIV/AIDS (27, 43%), followed by cancer (8, 13%), and diabetes (6, 10%). There was substantial variation in the scope, intent, complexity, and quality between the included studies. Statistical methods that were used included inverse-probability weighting (26, 41%), the parametric G-formula (16, 25%), Q-learning (10, 16%), G-estimation (4, 6%), targeted maximum likelihood/minimum loss-based estimation (4, 6%), regret regression (3, 5%), and other less common approaches (10, 16%). Notably, studies that were primarily intended to address real-world clinical questions (18, 29%) tended to use inverse-probability weighting and the parametric G-formula, relatively well-established methods, along with a large amount of data. Studies focused on methodological developments (45, 71%) tended to be more complicated and included a demonstrative real-world application only. CONCLUSIONS: As chronic and progressive conditions become more common, the need will grow for personalised treatments and methods to estimate the effects of DTRs. Observational DTR studies will be necessary, but so far their use to inform clinical practice has been limited. Focusing on simple DTRs, collecting large and rich clinical datasets, and fostering tight partnerships between content experts and data analysts may result in more clinically relevant observational DTR studies.


Subject(s)
Probability , Clinical Protocols , Humans
18.
Med J Aust ; 214(4): 179-185, 2021 03.
Article in English | MEDLINE | ID: mdl-33538019

ABSTRACT

OBJECTIVES: To estimate SARS-CoV-2-specific antibody seroprevalence after the first epidemic wave of coronavirus disease 2019 (COVID-19) in Sydney. SETTING, PARTICIPANTS: People of any age who had provided blood for testing at selected diagnostic pathology services (general pathology); pregnant women aged 20-39 years who had received routine antenatal screening; and Australian Red Cross Lifeblood plasmapheresis donors aged 20-69 years. DESIGN: Cross-sectional study; testing of de-identified residual blood specimens collected during 20 April - 2 June 2020. MAIN OUTCOME MEASURE: Estimated proportions of people seropositive for anti-SARS-CoV-2-specific IgG, adjusted for test sensitivity and specificity. RESULTS: Thirty-eight of 5339 specimens were IgG-positive (general pathology, 19 of 3231; antenatal screening, 7 of 560; plasmapheresis donors, 12 of 1548); there were no clear patterns by age group, sex, or location of residence. Adjusted estimated seroprevalence among people who had had general pathology blood tests (all ages) was 0.15% (95% credible interval [CrI], 0.04-0.41%), and 0.29% (95% CrI, 0.04-0.75%) for plasmapheresis donors (20-69 years). Among 20-39-year-old people, the age group common to all three collection groups, adjusted estimated seroprevalence was 0.24% (95% CrI, 0.04-0.80%) for the general pathology group, 0.79% (95% CrI, 0.04-1.88%) for the antenatal screening group, and 0.69% (95% CrI, 0.04-1.59%) for plasmapheresis donors. CONCLUSIONS: Estimated SARS-CoV-2 seroprevalence was below 1%, indicating that community transmission was low during the first COVID-19 epidemic wave in Sydney. These findings suggest that early control of the spread of COVID-19 was successful, but efforts to reduce further transmission remain important.


Subject(s)
Antibodies, Viral/blood , COVID-19/epidemiology , COVID-19/virology , Pandemics , SARS-CoV-2/immunology , Adolescent , Adult , Aged , Australia/epidemiology , Blood Donors , Child , Child, Preschool , Cross-Sectional Studies , Female , Humans , Immunoglobulin G/blood , Infant , Infant, Newborn , Male , Middle Aged , Pregnancy , Seroepidemiologic Studies , Young Adult
19.
JAMA ; 326(24): 2478-2487, 2021 12 28.
Article in English | MEDLINE | ID: mdl-34902013

ABSTRACT

Importance: The benefits of surfactant administration via a thin catheter (minimally invasive surfactant therapy [MIST]) in preterm infants with respiratory distress syndrome are uncertain. Objective: To examine the effect of selective application of MIST at a low fraction of inspired oxygen threshold on survival without bronchopulmonary dysplasia (BPD). Design, Setting, and Participants: Randomized clinical trial including 485 preterm infants with a gestational age of 25 to 28 weeks who were supported with continuous positive airway pressure (CPAP) and required a fraction of inspired oxygen of 0.30 or greater within 6 hours of birth. The trial was conducted at 33 tertiary-level neonatal intensive care units around the world, with blinding of the clinicians and outcome assessors. Enrollment took place between December 16, 2011, and March 26, 2020; follow-up was completed on December 2, 2020. Interventions: Infants were randomized to the MIST group (n = 241) and received exogenous surfactant (200 mg/kg of poractant alfa) via a thin catheter or to the control group (n = 244) and received a sham (control) treatment; CPAP was continued thereafter in both groups unless specified intubation criteria were met. Main Outcomes and Measures: The primary outcome was the composite of death or physiological BPD assessed at 36 weeks' postmenstrual age. The components of the primary outcome (death prior to 36 weeks' postmenstrual age and BPD at 36 weeks' postmenstrual age) also were considered separately. Results: Among the 485 infants randomized (median gestational age, 27.3 weeks; 241 [49.7%] female), all completed follow-up. Death or BPD occurred in 105 infants (43.6%) in the MIST group and 121 (49.6%) in the control group (risk difference [RD], -6.3% [95% CI, -14.2% to 1.6%]; relative risk [RR], 0.87 [95% CI, 0.74 to 1.03]; P = .10). Incidence of death before 36 weeks' postmenstrual age did not differ significantly between groups (24 [10.0%] in MIST vs 19 [7.8%] in control; RD, 2.1% [95% CI, -3.6% to 7.8%]; RR, 1.27 [95% CI, 0.63 to 2.57]; P = .51), but incidence of BPD in survivors to 36 weeks' postmenstrual age was lower in the MIST group (81/217 [37.3%] vs 102/225 [45.3%] in the control group; RD, -7.8% [95% CI, -14.9% to -0.7%]; RR, 0.83 [95% CI, 0.70 to 0.98]; P = .03). Serious adverse events occurred in 10.3% of infants in the MIST group and 11.1% in the control group. Conclusions and Relevance: Among preterm infants with respiratory distress syndrome supported with CPAP, minimally invasive surfactant therapy compared with sham (control) treatment did not significantly reduce the incidence of the composite outcome of death or bronchopulmonary dysplasia at 36 weeks' postmenstrual age. However, given the statistical uncertainty reflected in the 95% CI, a clinically important effect cannot be excluded. Trial Registration: anzctr.org.au Identifier: ACTRN12611000916943.


Subject(s)
Biological Products/administration & dosage , Bronchopulmonary Dysplasia/prevention & control , Continuous Positive Airway Pressure , Infant, Premature , Phospholipids/administration & dosage , Pulmonary Surfactants/administration & dosage , Respiratory Distress Syndrome, Newborn/drug therapy , Female , Humans , Infant, Newborn , Infant, Premature, Diseases/mortality , Male , Respiratory Distress Syndrome, Newborn/mortality , Respiratory Distress Syndrome, Newborn/therapy , Single-Blind Method
20.
Am J Epidemiol ; 189(7): 717-725, 2020 07 01.
Article in English | MEDLINE | ID: mdl-32285096

ABSTRACT

Multilevel regression and poststratification (MRP) is a model-based approach for estimating a population parameter of interest, generally from large-scale surveys. It has been shown to be effective in highly selected samples, which is particularly relevant to investigators of large-scale population health and epidemiologic surveys facing increasing difficulties in recruiting representative samples of participants. We aimed to further examine the accuracy and precision of MRP in a context where census data provided reasonable proxies for true population quantities of interest. We considered 2 outcomes from the baseline wave of the Ten to Men study (Australia, 2013-2014) and obtained relevant population data from the 2011 Australian Census. MRP was found to achieve generally superior performance relative to conventional survey weighting methods for the population as a whole and for population subsets of varying sizes. MRP resulted in less variability among estimates across population subsets relative to sample weighting, and there was some evidence of small gains in precision when using MRP, particularly for smaller population subsets. These findings offer further support for MRP as a promising analytical approach for addressing participation bias in the estimation of population descriptive quantities from large-scale health surveys and cohort studies.


Subject(s)
Epidemiologic Methods , Health Surveys/methods , Population Health/statistics & numerical data , Statistics as Topic , Adult , Australia/epidemiology , Humans , Male , Middle Aged , Multilevel Analysis , Patient Selection , Regression Analysis , Selection Bias
SELECTION OF CITATIONS
SEARCH DETAIL