Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 4.844
Filter
1.
Acta Neurochir (Wien) ; 166(1): 262, 2024 Jun 12.
Article in English | MEDLINE | ID: mdl-38864938

ABSTRACT

PURPOSE: Each institution or physician has to decide on an individual basis whether to continue or discontinue antiplatelet (AP) therapy before spinal surgery. The purpose of this study was to determine if perioperative AP continuation is safe during single-level microsurgical decompression (MSD) for treating lumbar spinal stenosis (LSS) and lumbar disc hernia (LDH) without selection bias. METHODS: Patients who underwent single-level MSD for LSS and LDH between April 2018 to December 2022 at our institute were included in this retrospective study. We collected data regarding baseline characteristics, medical history/comorbidities, epidural hematoma (EDH) volume, reoperation for EDH, differences between preoperative and one-day postoperative blood cell counts (ΔRBC), hemoglobin (ΔHGB), and hematocrits (ΔHCT), and perioperative thromboembolic complications. Patients were divided into two groups: the AP continuation group received AP treatment before surgery and the control group did not receive antiplatelet medication before surgery. Propensity scores for receiving AP agents were calculated, with one-to-one matching of estimated propensity scores to adjust for patient baseline characteristics and past histories. Reoperation for EDH, EDH volume, ΔRBC, ΔHGB, ΔHCT, and perioperative thromboembolic complications were compared between the groups. RESULTS: The 303 enrolled patients included 41 patients in the AP continuation group. After propensity score matching, the rate of reoperation for EDH, the EDH volume, ΔRBC, ΔHGB, ΔHCT, and perioperative thromboembolic complication rates were not significantly different between the groups. CONCLUSION: Perioperative AP continuation is safe for single-level lumbar MSD, even without biases.


Subject(s)
Decompression, Surgical , Intervertebral Disc Displacement , Lumbar Vertebrae , Microsurgery , Platelet Aggregation Inhibitors , Spinal Stenosis , Humans , Female , Male , Spinal Stenosis/surgery , Middle Aged , Retrospective Studies , Lumbar Vertebrae/surgery , Aged , Decompression, Surgical/methods , Decompression, Surgical/adverse effects , Microsurgery/methods , Microsurgery/adverse effects , Platelet Aggregation Inhibitors/therapeutic use , Platelet Aggregation Inhibitors/administration & dosage , Platelet Aggregation Inhibitors/adverse effects , Intervertebral Disc Displacement/surgery , Selection Bias , Herniorrhaphy/methods , Herniorrhaphy/adverse effects , Postoperative Complications/epidemiology , Postoperative Complications/etiology , Postoperative Complications/prevention & control , Treatment Outcome , Perioperative Care/methods
2.
Spat Spatiotemporal Epidemiol ; 49: 100659, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38876558

ABSTRACT

Spatial cluster analyses are commonly used in epidemiologic studies of case-control data to detect whether certain areas in a study region have an excess of disease risk. Case-control studies are susceptible to potential biases including selection bias, which can result from non-participation of eligible subjects in the study. However, there has been no systematic evaluation of the effects of non-participation on the findings of spatial cluster analyses. In this paper, we perform a simulation study assessing the effect of non-participation on spatial cluster analysis using the local spatial scan statistic under a variety of scenarios that vary the location and rates of study non-participation and the presence and intensity of a zone of elevated risk for disease for simulated case-control studies. We find that geographic areas of lower participation among controls than cases can greatly inflate false-positive rates for identification of artificial spatial clusters. Additionally, we find that even modest non-participation outside of a true zone of elevated risk can decrease spatial power to identify the true zone. We propose a spatial algorithm to correct for potentially spatially structured non-participation that compares the spatial distributions of the observed sample and underlying population. We demonstrate its ability to markedly decrease false positive rates in the absence of elevated risk and resist decreasing spatial sensitivity to detect true zones of elevated risk. We apply our method to a case-control study of non-Hodgkin lymphoma. Our findings suggest that greater attention should be paid to the potential effects of non-participation in spatial cluster studies.


Subject(s)
Spatial Analysis , Humans , Cluster Analysis , Case-Control Studies , Selection Bias , Computer Simulation , Algorithms , Lymphoma, Non-Hodgkin/epidemiology
3.
Stat Med ; 43(15): 2928-2943, 2024 Jul 10.
Article in English | MEDLINE | ID: mdl-38742595

ABSTRACT

In clinical trials, multiple comparisons arising from various treatments/doses, subgroups, or endpoints are common. Typically, trial teams focus on the comparison showing the largest observed treatment effect, often involving a specific treatment pair and endpoint within a subgroup. These findings frequently lead to follow-up pivotal studies, many of which do not confirm the initial positive results. Selection bias occurs when the most promising treatment, subgroup, or endpoint is chosen for further development, potentially skewing subsequent investigations. Such bias can be defined as the deviation in the observed treatment effects from the underlying truth. In this article, we propose a general and unified Bayesian framework to address selection bias in clinical trials with multiple comparisons. Our approach does not require a priori specification of a parametric distribution for the prior, offering a more flexible and generalized solution. The proposed method facilitates a more accurate interpretation of clinical trial results by adjusting for such selection bias. Through simulation studies, we compared several methods and demonstrated their superior performance over the normal shrinkage estimator. We recommended the use of Bayesian Model Averaging estimator averaging over Gaussian Mixture Models as the prior distribution based on its performance and flexibility. We applied the method to a multicenter, randomized, double-blind, placebo-controlled study investigating the cardiovascular effects of dulaglutide.


Subject(s)
Bayes Theorem , Computer Simulation , Randomized Controlled Trials as Topic , Humans , Randomized Controlled Trials as Topic/statistics & numerical data , Models, Statistical , Double-Blind Method , Selection Bias , Bias , Multicenter Studies as Topic , Clinical Trials as Topic/statistics & numerical data
4.
Epidemiology ; 35(4): 437-446, 2024 Jul 01.
Article in English | MEDLINE | ID: mdl-38771708

ABSTRACT

BACKGROUND: The largest case-control study (Interphone study) investigating glioma risk related to mobile phone use showed a J-shaped relationship with reduced relative risks for moderate use and a 40% increased relative risk among the 10% heaviest regular mobile phone users, using a categorical risk model based on deciles of lifetime duration of use among ever regular users. METHODS: We conducted Monte Carlo simulations examining whether the reported estimates are compatible with an assumption of no effect of mobile phone use on glioma risk when the various forms of biases present in the Interphone study are accounted for. Four scenarios of sources of error in self-reported mobile phone use were considered, along with selection bias. Input parameters used for simulations were those obtained from Interphone validation studies on reporting accuracy and from using a nonresponse questionnaire. RESULTS: We found that the scenario simultaneously modeling systematic and random reporting errors produced a J-shaped relationship perfectly compatible with the observed relationship from the main Interphone study with a simulated spurious increased relative risk among heaviest users (odds ratio = 1.91) compared with never regular users. The main determinant for producing this J shape was higher reporting error variance in cases compared with controls, as observed in the validation studies. Selection bias contributed to the reduced risks as well. CONCLUSIONS: Some uncertainty remains, but the evidence from the present simulation study shifts the overall assessment to making it less likely that heavy mobile phone use is causally related to an increased glioma risk.


Subject(s)
Glioma , Monte Carlo Method , Humans , Case-Control Studies , Glioma/epidemiology , Glioma/etiology , Selection Bias , Mental Recall , Risk Assessment , Computer Simulation , Brain Neoplasms/epidemiology , Cell Phone/statistics & numerical data , Cell Phone Use/statistics & numerical data , Cell Phone Use/adverse effects , Male , Female , Risk , Adult
5.
J Am Med Inform Assoc ; 31(7): 1479-1492, 2024 Jun 20.
Article in English | MEDLINE | ID: mdl-38742457

ABSTRACT

OBJECTIVES: To develop recommendations regarding the use of weights to reduce selection bias for commonly performed analyses using electronic health record (EHR)-linked biobank data. MATERIALS AND METHODS: We mapped diagnosis (ICD code) data to standardized phecodes from 3 EHR-linked biobanks with varying recruitment strategies: All of Us (AOU; n = 244 071), Michigan Genomics Initiative (MGI; n = 81 243), and UK Biobank (UKB; n = 401 167). Using 2019 National Health Interview Survey data, we constructed selection weights for AOU and MGI to represent the US adult population more. We used weights previously developed for UKB to represent the UKB-eligible population. We conducted 4 common analyses comparing unweighted and weighted results. RESULTS: For AOU and MGI, estimated phecode prevalences decreased after weighting (weighted-unweighted median phecode prevalence ratio [MPR]: 0.82 and 0.61), while UKB estimates increased (MPR: 1.06). Weighting minimally impacted latent phenome dimensionality estimation. Comparing weighted versus unweighted phenome-wide association study for colorectal cancer, the strongest associations remained unaltered, with considerable overlap in significant hits. Weighting affected the estimated log-odds ratio for sex and colorectal cancer to align more closely with national registry-based estimates. DISCUSSION: Weighting had a limited impact on dimensionality estimation and large-scale hypothesis testing but impacted prevalence and association estimation. When interested in estimating effect size, specific signals from untargeted association analyses should be followed up by weighted analysis. CONCLUSION: EHR-linked biobanks should report recruitment and selection mechanisms and provide selection weights with defined target populations. Researchers should consider their intended estimands, specify source and target populations, and weight EHR-linked biobank analyses accordingly.


Subject(s)
Biological Specimen Banks , Electronic Health Records , Humans , Selection Bias , Female , Male , Adult , Middle Aged , Medical Record Linkage , United States , Aged , United Kingdom , Michigan
6.
Int J Cardiol ; 408: 132138, 2024 Aug 01.
Article in English | MEDLINE | ID: mdl-38705207

ABSTRACT

INTRODUCTION: Despite the growing awareness towards the importance of adequate representation of women in clinical trials among patients treated with percutaneous coronary intervention (PCI), available evidence continues to demonstrate a skewed distribution of study populations in favour of men. METHODS AND RESULTS: In this pre-specified analysis from the MASTER DAPT screening log and trial, we aimed to investigate the existence of a negative selection bias for women inclusion in a randomized clinical trial. A total of 2847 consecutive patients who underwent coronary revascularization across 65 participating sites, during a median of 14 days, were entered in the screening log, including 1749 (61.4%) non-high bleeding risk (HBR) and 1098 (38.6%) HBR patients, of whom 109 (9.9%) consented for trial participation. Female patients were less represented in consented versus non-consented HBR patients (22% versus 30%, absolute standardized difference: 0.18) and among non-consented eligible versus consented eligible patients (absolute standardized difference 0.14). The observed sex gap was primarily due investigators' choice not to offer study participation to females because deemed at very high risk of bleeding and/or ischemic complications, and only marginally to a slightly higher propensity of females compared to males to refuse study participation. CONCLUSIONS: Female HBR patients undergoing PCI are less prevalent, but also less likely to participate in the trial than male patients, mainly due to investigators' preference.


Subject(s)
Patient Selection , Percutaneous Coronary Intervention , Humans , Female , Male , Percutaneous Coronary Intervention/methods , Aged , Middle Aged , Selection Bias , Randomized Controlled Trials as Topic/methods , Hemorrhage/epidemiology , Sex Factors , Coronary Artery Disease/surgery , Coronary Artery Disease/therapy
7.
Int J Epidemiol ; 53(3)2024 Apr 11.
Article in English | MEDLINE | ID: mdl-38715336

ABSTRACT

BACKGROUND: Biobanks typically rely on volunteer-based sampling. This results in large samples (power) at the cost of representativeness (bias). The problem of volunteer bias is debated. Here, we (i) show that volunteering biases associations in UK Biobank (UKB) and (ii) estimate inverse probability (IP) weights that correct for volunteer bias in UKB. METHODS: Drawing on UK Census data, we constructed a subsample representative of UKB's target population, which consists of all individuals invited to participate. Based on demographic variables shared between the UK Census and UKB, we estimated IP weights (IPWs) for each UKB participant. We compared 21 weighted and unweighted bivariate associations between these demographic variables to assess volunteer bias. RESULTS: Volunteer bias in all associations, as naively estimated in UKB, was substantial-in some cases so severe that unweighted estimates had the opposite sign of the association in the target population. For example, older individuals in UKB reported being in better health, in contrast to evidence from the UK Census. Using IPWs in weighted regressions reduced 87% of volunteer bias on average. Volunteer-based sampling reduced the effective sample size of UKB substantially, to 32% of its original size. CONCLUSIONS: Estimates from large-scale biobanks may be misleading due to volunteer bias. We recommend IP weighting to correct for such bias. To aid in the construction of the next generation of biobanks, we provide suggestions on how to best ensure representativeness in a volunteer-based design. For UKB, IPWs have been made available.


Subject(s)
Biological Specimen Banks , Volunteers , Humans , Selection Bias , United Kingdom , Male , Female , Middle Aged , Aged , Adult , Censuses , UK Biobank
9.
Hum Brain Mapp ; 45(5): e26562, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38590154

ABSTRACT

The goal of this study was to examine what happens to established associations between attention deficit hyperactivity disorder (ADHD) symptoms and cortical surface and thickness regions once we apply inverse probability of censoring weighting (IPCW) to address potential selection bias. Moreover, we illustrate how different factors that predict participation contribute to potential selection bias. Participants were 9- to 11-year-old children from the Generation R study (N = 2707). Cortical area and thickness were measured with magnetic resonance imaging (MRI) and ADHD symptoms with the Child Behavior Checklist. We examined how associations between ADHD symptoms and brain morphology change when we weight our sample back to either follow-up (ages 9-11), baseline (cohort at birth), or eligible (population of Rotterdam at time of recruitment). Weights were derived using IPCW or raking and missing predictors of participation used to estimate weights were imputed. Weighting analyses to baseline and eligible increased beta coefficients for the middle temporal gyrus surface area, as well as fusiform gyrus cortical thickness. Alternatively, the beta coefficient for the rostral anterior cingulate decreased. Removing one group of variables used for estimating weights resulted in the weighted regression coefficient moving closer to the unweighted regression coefficient. In addition, we found considerably different beta coefficients for most surface area regions and all thickness measures when we did not impute missing covariate data. Our findings highlight the importance of using inverse probability weighting (IPW) in the neuroimaging field, especially in the context of mental health-related research. We found that including all variables related to exposure-outcome in the IPW model and combining IPW with multiple imputations can help reduce bias. We encourage future psychiatric neuroimaging studies to define their target population, collect information on eligible but not included participants and use inverse probability of censoring weighting (IPCW) to reduce selection bias.


Subject(s)
Attention Deficit Disorder with Hyperactivity , Child , Infant, Newborn , Humans , Selection Bias , Attention Deficit Disorder with Hyperactivity/pathology , Probability , Bias , Temporal Lobe/pathology
10.
Epidemiology ; 35(4): e17, 2024 Jul 01.
Article in English | MEDLINE | ID: mdl-38648109
11.
Cancer Med ; 13(7): e6966, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38572962

ABSTRACT

OBJECTIVE: Examine the influence of household income on health-related quality of life (HRQOL) among children with newly diagnosed acute myeloid leukemia (AML). DESIGN: Secondary analysis of data prospectively collected from pediatric patients receiving treatment for AML at 14 hospitals across the United States. EXPOSURE: Household income was self-reported on a demographic survey. The examined mediators included the acuity of presentation and treatment toxicity. OUTCOME: Caregiver proxy reported assessment of patient HRQOL from the Peds QL 4.0 survey. RESULT: Children with AML (n = 131) and caregivers were prospectively enrolled to complete PedsQL assessments. HRQOL scores were better for patients in the lowest versus highest income category (mean ± SD: 76.0 ± 14 household income <$25,000 vs. 59.9 ± 17 income ≥$75,000; adjusted mean difference: 11.2, 95% CI: 2.2-20.2). Seven percent of enrolled patients presented with high acuity (ICU-level care in the first 72 h), and 16% had high toxicity (any ICU-level care); there were no identifiable differences by income, refuting mediating roles in the association between income and HRQOL. Enrolled patients were less likely to be Black/African American (9.9% vs. 22.2%), more likely to be privately insured (50.4% vs. 40.7%), and more likely to have been treated on a clinical trial (26.7% vs. 18.5%) compared to eligible unenrolled patients not enrolled. Evaluations of potential selection bias on the association between income and HRQOL suggested differences in HRQOL may be smaller than observed or even in the opposing direction. CONCLUSIONS: While primary analyses suggested lower household income was associated with superior HRQOL, differential participation may have biased these results. Future studies should partner with patients/families to identify strategies for equitable participation in clinical research.


Subject(s)
Health Equity , Leukemia, Myeloid, Acute , Child , Humans , Leukemia, Myeloid, Acute/epidemiology , Leukemia, Myeloid, Acute/therapy , Quality of Life , Selection Bias , Surveys and Questionnaires , Clinical Trials as Topic
12.
PLoS One ; 19(4): e0302126, 2024.
Article in English | MEDLINE | ID: mdl-38625968

ABSTRACT

The St. Lawrence River is an important North American waterway that is subject to anthropogenic pressures including intensive urbanization, and agricultural development. Pesticides are widely used for agricultural activities in fields surrounding the yellow perch (Perca flavescens) habitat in Lake St. Pierre (Quebec, Canada), a fluvial lake of the river where the perch population has collapsed. Clothianidin and chlorantraniliprole were two of the most detected insecticides in surface waters near perch spawning areas. The objectives of the present study were to evaluate the transcriptional and biochemical effects of these two pesticides on juvenile yellow perch exposed for 28d to environmental doses of each compound alone and in a mixture under laboratory/aquaria conditions. Hepatic mRNA-sequencing revealed an effect of chlorantraniliprole alone (37 genes) and combined with clothianidin (251 genes), but no effects of clothianidin alone were observed in perch. Dysregulated genes were mostly related to circadian rhythms and to Ca2+ signaling, the latter effect has been previously associated with chlorantraniliprole mode of action in insects. Moreover, chronic exposure to clothianidin increased the activity of acetylcholinesterase in the brain of exposed fish, suggesting a potential non-target effect of this insecticide. Further analyses of three clock genes by qRT-PCR suggested that part of the observed effects of chlorantraniliprole on the circadian gene regulation of juvenile perch could be the result of time-of-day of sacrifice. These results provide insight into biological effects of insecticides in juvenile perch and highlight the importance of considering the circadian rhythm in experimental design and results analyses.


Subject(s)
Guanidines , Insecticides , Neonicotinoids , Perches , Thiazoles , Water Pollutants, Chemical , ortho-Aminobenzoates , Animals , Perches/genetics , Insecticides/toxicity , Insecticides/analysis , Acetylcholinesterase , Selection Bias , Gene Expression Profiling , Water Pollutants, Chemical/toxicity , Water Pollutants, Chemical/analysis
14.
Article in English | MEDLINE | ID: mdl-38673378

ABSTRACT

The objective of this study is to analyse the effects of attended school type and class level on the reported caries experience (DMFT) obtained in the serial cross-sectional National Oral Health Study in Children in Germany (NOHSC) for the WHO reference group of 12-year-olds. METHODS: Caries data from the 2016 NOHSC were adjusted for each federal state on the basis of two additional large-scale datasets for school type and class level. RESULTS: Twelve-year-olds in all grades in Saxony-Anhalt (n = 96,842) exhibited significantly higher DMFT values than 12-year-olds in 6th grade (n = 76,456; +0.10 DMFT; ~14.2%, p < 0.001). Adjustments for school type had effects on DMFT on the level of federal states but almost balanced out on the national level (-0.01 DMFT; ~2%). Due to putatively similar structures of the federal states, the national mean DMFT for 12-year-olds in the latest NOHSC (2016; n = 55,002) was adjusted from 0.44 to 0.50 DMFT, correcting for selection bias. CONCLUSION: Selection bias in this NOHSC leads to an underestimation of caries levels by about 15%. Due to very low caries experience in children in Germany, these precise adjustments (+0.06 DMFT) have only a minor effect on interpretations of the national epidemiologic situation. Consequently, other national caries studies worldwide using the robust marker of DMFT should also adjust for systematic selection bias related to socio-economic background rather than increasing efforts in examination strategy.


Subject(s)
Dental Caries , Schools , Humans , Dental Caries/epidemiology , Germany/epidemiology , Child , Cross-Sectional Studies , Female , Male , Schools/statistics & numerical data , Selection Bias , Dental Health Surveys , Oral Health/statistics & numerical data
15.
Epidemiology ; 35(3): 281-288, 2024 May 01.
Article in English | MEDLINE | ID: mdl-38442423

ABSTRACT

BACKGROUND: Several observational studies have described an inverse association between cancer diagnosis and subsequent dementia risk. Multiple biologic mechanisms and potential biases have been proposed in attempts to explain this association. One proposed explanation is the opposite expression of Pin1 in cancer and dementia, and we use this explanation and potential drug target to illustrate the required assumptions and potential sources of bias for inferring an effect of Pin1 on dementia risk from analyses measuring cancer diagnosis as a proxy for Pin1 expression. METHODS: We used data from the Rotterdam Study, a population-based cohort. We estimate the association between cancer diagnosis (as a proxy for Pin1) and subsequent dementia diagnosis using two different proxy methods and with confounding and censoring for death addressed with inverse probability weights. We estimate and compare the complements of a weighted Kaplan-Meier survival estimator at 20 years of follow-up. RESULTS: Out of 3634 participants, 899 (25%) were diagnosed with cancer, of whom 53 (6%) had dementia, and 567 (63%) died. Among those without cancer, 15% (411) were diagnosed with dementia, and 667 (24%) died over follow-up. Depending on the confounding and selection bias control, and the way in which cancer was used as a time-varying proxy exposure, the risk ratio for dementia diagnosis ranged from 0.71 (95% confidence interval [CI] = 0.49, 0.95) to 1.1 (95% CI = 0.79, 1.3). CONCLUSION: Being explicit about the underlying mechanism of interest is key to maximizing what we can learn from this cancer-dementia association given available or readily collected data, and to defining, detecting, and preventing potential biases.


Subject(s)
Dementia , Neoplasms , Humans , Probability , Bias , Selection Bias , Neoplasms/epidemiology , Dementia/epidemiology , Dementia/diagnosis
16.
Lifetime Data Anal ; 30(2): 383-403, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38466520

ABSTRACT

Hazard ratios are prone to selection bias, compromising their use as causal estimands. On the other hand, if Aalen's additive hazard model applies, the hazard difference has been shown to remain unaffected by the selection of frailty factors over time. Then, in the absence of confounding, observed hazard differences are equal in expectation to the causal hazard differences. However, in the presence of effect (on the hazard) heterogeneity, the observed hazard difference is also affected by selection of survivors. In this work, we formalize how the observed hazard difference (from a randomized controlled trial) evolves by selecting favourable levels of effect modifiers in the exposed group and thus deviates from the causal effect of interest. Such selection may result in a non-linear integrated hazard difference curve even when the individual causal effects are time-invariant. Therefore, a homogeneous time-varying causal additive effect on the hazard cannot be distinguished from a time-invariant but heterogeneous causal effect. We illustrate this causal issue by studying the effect of chemotherapy on the survival time of patients suffering from carcinoma of the oropharynx using data from a clinical trial. The hazard difference can thus not be used as an appropriate measure of the causal effect without making untestable assumptions.


Subject(s)
Proportional Hazards Models , Humans , Bias , Selection Bias , Causality
17.
Stat Med ; 43(10): 1993-2006, 2024 May 10.
Article in English | MEDLINE | ID: mdl-38442874

ABSTRACT

When designing confirmatory Phase 3 studies, one usually evaluates one or more efficacious and safe treatment option(s) based on data from previous studies. However, several retrospective research articles reported the phenomenon of "diminished treatment effect in Phase 3" based on many case studies. Even under basic assumptions, it was shown that the commonly used estimator could substantially overestimate the efficacy of selected group(s). As alternatives, we propose a class of computational methods to reduce estimation bias and mean squared error with a broader scope of multiple treatment groups and flexibility to accommodate summary results by group as input. Based on simulation studies and a real data example, we provide practical implementation guidance for this class of methods under different scenarios. For more complicated problems, our framework can serve as a starting point with additional layers built in. Proposed methods can also be widely applied to other selection problems.


Subject(s)
Research Design , Humans , Selection Bias , Retrospective Studies , Computer Simulation , Bias
18.
Nat Commun ; 15(1): 2499, 2024 Mar 20.
Article in English | MEDLINE | ID: mdl-38509066

ABSTRACT

Malaria genomic surveillance often estimates parasite genetic relatedness using metrics such as Identity-By-Decent (IBD), yet strong positive selection stemming from antimalarial drug resistance or other interventions may bias IBD-based estimates. In this study, we use simulations, a true IBD inference algorithm, and empirical data sets from different malaria transmission settings to investigate the extent of this bias and explore potential correction strategies. We analyze whole genome sequence data generated from 640 new and 3089 publicly available Plasmodium falciparum clinical isolates. We demonstrate that positive selection distorts IBD distributions, leading to underestimated effective population size and blurred population structure. Additionally, we discover that the removal of IBD peak regions partially restores the accuracy of IBD-based inferences, with this effect contingent on the population's background genetic relatedness and extent of inbreeding. Consequently, we advocate for selection correction for parasite populations undergoing strong, recent positive selection, particularly in high malaria transmission settings.


Subject(s)
Antimalarials , Malaria, Falciparum , Humans , Plasmodium falciparum , Malaria, Falciparum/parasitology , Selection Bias , Antimalarials/pharmacology , Demography
19.
J Am Med Inform Assoc ; 31(5): 1172-1183, 2024 Apr 19.
Article in English | MEDLINE | ID: mdl-38520723

ABSTRACT

OBJECTIVES: Leveraging artificial intelligence (AI) in conjunction with electronic health records (EHRs) holds transformative potential to improve healthcare. However, addressing bias in AI, which risks worsening healthcare disparities, cannot be overlooked. This study reviews methods to handle various biases in AI models developed using EHR data. MATERIALS AND METHODS: We conducted a systematic review following the Preferred Reporting Items for Systematic Reviews and Meta-analyses guidelines, analyzing articles from PubMed, Web of Science, and IEEE published between January 01, 2010 and December 17, 2023. The review identified key biases, outlined strategies for detecting and mitigating bias throughout the AI model development, and analyzed metrics for bias assessment. RESULTS: Of the 450 articles retrieved, 20 met our criteria, revealing 6 major bias types: algorithmic, confounding, implicit, measurement, selection, and temporal. The AI models were primarily developed for predictive tasks, yet none have been deployed in real-world healthcare settings. Five studies concentrated on the detection of implicit and algorithmic biases employing fairness metrics like statistical parity, equal opportunity, and predictive equity. Fifteen studies proposed strategies for mitigating biases, especially targeting implicit and selection biases. These strategies, evaluated through both performance and fairness metrics, predominantly involved data collection and preprocessing techniques like resampling and reweighting. DISCUSSION: This review highlights evolving strategies to mitigate bias in EHR-based AI models, emphasizing the urgent need for both standardized and detailed reporting of the methodologies and systematic real-world testing and evaluation. Such measures are essential for gauging models' practical impact and fostering ethical AI that ensures fairness and equity in healthcare.


Subject(s)
Artificial Intelligence , Electronic Health Records , Female , Pregnancy , Humans , Bias , Selection Bias , Benchmarking
20.
Biometrics ; 80(1)2024 Jan 29.
Article in English | MEDLINE | ID: mdl-38488466

ABSTRACT

Electronic health records (EHRs) contain rich clinical information for millions of patients and are increasingly used for public health research. However, non-random inclusion of subjects in EHRs can result in selection bias, with factors such as demographics, socioeconomic status, healthcare referral patterns, and underlying health status playing a role. While this issue has been well documented, little work has been done to develop or apply bias-correction methods, often due to the fact that most of these factors are unavailable in EHRs. To address this gap, we propose a series of Heckman type bias correction methods by incorporating social determinants of health selection covariates to model the EHR non-random sampling probability. Through simulations under various settings, we demonstrate the effectiveness of our proposed method in correcting biases in both the association coefficient and the outcome mean. Our method augments the utility of EHRs for public health inferences, as we show by estimating the prevalence of cardiovascular disease and its correlation with risk factors in the New York City network of EHRs.


Subject(s)
Electronic Health Records , Health Status , Humans , Selection Bias , Risk Factors , Bias
SELECTION OF CITATIONS
SEARCH DETAIL