RESUMEN
BACKGROUND: Long COVID, also known as post-acute sequelae of COVID-19 (PASC), is a poorly understood condition with symptoms across a range of biological domains that often have debilitating consequences. Some have recently suggested that lingering SARS-CoV-2 virus particles in the gut may impede serotonin production and that low serotonin may drive many Long COVID symptoms across a range of biological systems. Therefore, selective serotonin reuptake inhibitors (SSRIs), which increase synaptic serotonin availability, may be used to prevent or treat Long COVID. SSRIs are commonly prescribed for depression, therefore restricting a study sample to only include patients with depression can reduce the concern of confounding by indication. METHODS: In an observational sample of electronic health records from patients in the National COVID Cohort Collaborative (N3C) with a COVID-19 diagnosis between September 1, 2021, and December 1, 2022, and a comorbid depressive disorder, the leading indication for SSRI use, we evaluated the relationship between SSRI use during acute COVID-19 and subsequent 12-month risk of Long COVID (defined by ICD-10 code U09.9). We defined SSRI use as a prescription for SSRI medication beginning at least 30 days before acute COVID-19 and not ending before SARS-CoV-2 infection. To minimize bias, we estimated relationships using nonparametric targeted maximum likelihood estimation to aggressively adjust for high-dimensional covariates. RESULTS: We analyzed a sample (n = 302,626) of patients with a diagnosis of a depressive condition before COVID-19 diagnosis, where 100,803 (33%) were using an SSRI. We found that SSRI users had a significantly lower risk of Long COVID compared to nonusers (adjusted causal relative risk 0.92, 95% CI (0.86, 0.99)) and we found a similar relationship comparing new SSRI users (first SSRI prescription 1 to 4 months before acute COVID-19 with no prior history of SSRI use) to nonusers (adjusted causal relative risk 0.89, 95% CI (0.80, 0.98)). CONCLUSIONS: These findings suggest that SSRI use during acute COVID-19 may be protective against Long COVID, supporting the hypothesis that serotonin may be a key mechanistic biomarker of Long COVID.
Asunto(s)
COVID-19 , SARS-CoV-2 , Inhibidores Selectivos de la Recaptación de Serotonina , Humanos , COVID-19/epidemiología , COVID-19/complicaciones , Inhibidores Selectivos de la Recaptación de Serotonina/uso terapéutico , Femenino , Masculino , Persona de Mediana Edad , SARS-CoV-2/efectos de los fármacos , Adulto , Anciano , Depresión/tratamiento farmacológico , Pandemias , Síndrome Post Agudo de COVID-19 , Infecciones por Coronavirus/tratamiento farmacológico , Infecciones por Coronavirus/epidemiología , Infecciones por Coronavirus/complicaciones , Betacoronavirus/efectos de los fármacos , Neumonía Viral/tratamiento farmacológico , Neumonía Viral/epidemiología , Factores de RiesgoRESUMEN
BACKGROUND: Postacute sequelae of COVID-19 (PASC), also known as long COVID, is a broad grouping of a range of long-term symptoms following acute COVID-19. These symptoms can occur across a range of biological systems, leading to challenges in determining risk factors for PASC and the causal etiology of this disorder. An understanding of characteristics that are predictive of future PASC is valuable, as this can inform the identification of high-risk individuals and future preventative efforts. However, current knowledge regarding PASC risk factors is limited. OBJECTIVE: Using a sample of 55,257 patients (at a ratio of 1 patient with PASC to 4 matched controls) from the National COVID Cohort Collaborative, as part of the National Institutes of Health Long COVID Computational Challenge, we sought to predict individual risk of PASC diagnosis from a curated set of clinically informed covariates. The National COVID Cohort Collaborative includes electronic health records for more than 22 million patients from 84 sites across the United States. METHODS: We predicted individual PASC status, given covariate information, using Super Learner (an ensemble machine learning algorithm also known as stacking) to learn the optimal combination of gradient boosting and random forest algorithms to maximize the area under the receiver operator curve. We evaluated variable importance (Shapley values) based on 3 levels: individual features, temporal windows, and clinical domains. We externally validated these findings using a holdout set of randomly selected study sites. RESULTS: We were able to predict individual PASC diagnoses accurately (area under the curve 0.874). The individual features of the length of observation period, number of health care interactions during acute COVID-19, and viral lower respiratory infection were the most predictive of subsequent PASC diagnosis. Temporally, we found that baseline characteristics were the most predictive of future PASC diagnosis, compared with characteristics immediately before, during, or after acute COVID-19. We found that the clinical domains of health care use, demographics or anthropometry, and respiratory factors were the most predictive of PASC diagnosis. CONCLUSIONS: The methods outlined here provide an open-source, applied example of using Super Learner to predict PASC status using electronic health record data, which can be replicated across a variety of settings. Across individual predictors and clinical domains, we consistently found that factors related to health care use were the strongest predictors of PASC diagnosis. This indicates that any observational studies using PASC diagnosis as a primary outcome must rigorously account for heterogeneous health care use. Our temporal findings support the hypothesis that clinicians may be able to accurately assess the risk of PASC in patients before acute COVID-19 diagnosis, which could improve early interventions and preventive care. Our findings also highlight the importance of respiratory characteristics in PASC risk assessment. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID): RR2-10.1101/2023.07.27.23293272.
Asunto(s)
COVID-19 , Síndrome Post Agudo de COVID-19 , Humanos , COVID-19/epidemiología , Estudios de Cohortes , Femenino , Masculino , Estados Unidos/epidemiología , Persona de Mediana Edad , Anciano , Adulto , Factores de Riesgo , Aprendizaje AutomáticoRESUMEN
Background: Water, sanitation, hygiene (WSH), nutrition (N), and combined (N+WSH) interventions are often implemented by global health organizations, but WSH interventions may insufficiently reduce pathogen exposure, and nutrition interventions may be modified by environmental enteric dysfunction (EED), a condition of increased intestinal permeability and inflammation. This study investigated the heterogeneity of these treatments' effects based on individual pathogen and EED biomarker status with respect to child linear growth. Methods: We applied cross-validated targeted maximum likelihood estimation and super learner ensemble machine learning to assess the conditional treatment effects in subgroups defined by biomarker and pathogen status. We analyzed treatment (N+WSH, WSH, N, or control) randomly assigned in-utero, child pathogen and EED data at 14 months of age, and child LAZ at 28 months of age. We estimated the difference in mean child length for age Z-score (LAZ) under the treatment rule and the difference in stratified treatment effect (treatment effect difference) comparing children with high versus low pathogen/biomarker status while controlling for baseline covariates. Results: We analyzed data from 1,522 children, who had median LAZ of -1.56. We found that myeloperoxidase (N+WSH treatment effect difference 0.0007 LAZ, WSH treatment effect difference 0.1032 LAZ, N treatment effect difference 0.0037 LAZ) and Campylobacter infection (N+WSH treatment effect difference 0.0011 LAZ, WSH difference 0.0119 LAZ, N difference 0.0255 LAZ) were associated with greater effect of all interventions on growth. In other words, children with high myeloperoxidase or Campylobacter infection experienced a greater impact of the interventions on growth. We found that a treatment rule that assigned the N+WSH (LAZ difference 0.23, 95% CI (0.05, 0.41)) and WSH (LAZ difference 0.17, 95% CI (0.04, 0.30)) interventions based on EED biomarkers and pathogens increased predicted child growth compared to the randomly allocated intervention. Conclusions: These findings indicate that EED biomarker and pathogen status, particularly Campylobacter and myeloperoxidase (a measure of gut inflammation), may be related to impact of N+WSH, WSH, and N interventions on child linear growth.
RESUMEN
BACKGROUND: While many patients seem to recover from SARS-CoV-2 infections, many patients report experiencing SARS-CoV-2 symptoms for weeks or months after their acute COVID-19 ends, even developing new symptoms weeks after infection. These long-term effects are called post-acute sequelae of SARS-CoV-2 (PASC) or, more commonly, Long COVID. The overall prevalence of Long COVID is currently unknown, and tools are needed to help identify patients at risk for developing long COVID. METHODS: A working group of the Rapid Acceleration of Diagnostics-radical (RADx-rad) program, comprised of individuals from various NIH institutes and centers, in collaboration with REsearching COVID to Enhance Recovery (RECOVER) developed and organized the Long COVID Computational Challenge (L3C), a community challenge aimed at incentivizing the broader scientific community to develop interpretable and accurate methods for identifying patients at risk of developing Long COVID. From August 2022 to December 2022, participants developed Long COVID risk prediction algorithms using the National COVID Cohort Collaborative (N3C) data enclave, a harmonized data repository from over 75 healthcare institutions from across the United States (U.S.). FINDINGS: Over the course of the challenge, 74 teams designed and built 35 Long COVID prediction models using the N3C data enclave. The top 10 teams all scored above a 0.80 Area Under the Receiver Operator Curve (AUROC) with the highest scoring model achieving a mean AUROC of 0.895. Included in the top submission was a visualization dashboard that built timelines for each patient, updating the risk of a patient developing Long COVID in response to clinical events. INTERPRETATION: As a result of L3C, federal reviewers identified multiple machine learning models that can be used to identify patients at risk for developing Long COVID. Many of the teams used approaches in their submissions which can be applied to future clinical prediction questions. FUNDING: Research reported in this RADx® Rad publication was supported by the National Institutes of Health. Timothy Bergquist, Johanna Loomba, and Emily Pfaff were supported by Axle Subcontract: NCATS-STSS-P00438.