Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 190
Filtrar
1.
Res Sq ; 2024 Apr 26.
Artículo en Inglés | MEDLINE | ID: mdl-38746290

RESUMEN

Estimates of post-acute sequelae of SARS-CoV-2 infection (PASC) incidence, also known as Long COVID, have varied across studies and changed over time. We estimated PASC incidence among adult and pediatric populations in three nationwide research networks of electronic health records (EHR) participating in the RECOVER Initiative using different classification algorithms (computable phenotypes). Overall, 7% of children and 8.5%-26.4% of adults developed PASC, depending on computable phenotype used. Excess incidence among SARS-CoV-2 patients was 4% in children and ranged from 4-7% among adults, representing a lower-bound incidence estimation based on two control groups - contemporary COVID-19 negative and historical patients (2019). Temporal patterns were consistent across networks, with peaks associated with introduction of new viral variants. Our findings indicate that preventing and mitigating Long COVID remains a public health priority. Examining temporal patterns and risk factors of PASC incidence informs our understanding of etiology and can improve prevention and management.

2.
JMIR Mhealth Uhealth ; 12: e54622, 2024 May 02.
Artículo en Inglés | MEDLINE | ID: mdl-38696234

RESUMEN

BACKGROUND: Postpartum depression (PPD) poses a significant maternal health challenge. The current approach to detecting PPD relies on in-person postpartum visits, which contributes to underdiagnosis. Furthermore, recognizing PPD symptoms can be challenging. Therefore, we explored the potential of using digital biomarkers from consumer wearables for PPD recognition. OBJECTIVE: The main goal of this study was to showcase the viability of using machine learning (ML) and digital biomarkers related to heart rate, physical activity, and energy expenditure derived from consumer-grade wearables for the recognition of PPD. METHODS: Using the All of Us Research Program Registered Tier v6 data set, we performed computational phenotyping of women with and without PPD following childbirth. Intraindividual ML models were developed using digital biomarkers from Fitbit to discern between prepregnancy, pregnancy, postpartum without depression, and postpartum with depression (ie, PPD diagnosis) periods. Models were built using generalized linear models, random forest, support vector machine, and k-nearest neighbor algorithms and evaluated using the κ statistic and multiclass area under the receiver operating characteristic curve (mAUC) to determine the algorithm with the best performance. The specificity of our individualized ML approach was confirmed in a cohort of women who gave birth and did not experience PPD. Moreover, we assessed the impact of a previous history of depression on model performance. We determined the variable importance for predicting the PPD period using Shapley additive explanations and confirmed the results using a permutation approach. Finally, we compared our individualized ML methodology against a traditional cohort-based ML model for PPD recognition and compared model performance using sensitivity, specificity, precision, recall, and F1-score. RESULTS: Patient cohorts of women with valid Fitbit data who gave birth included <20 with PPD and 39 without PPD. Our results demonstrated that intraindividual models using digital biomarkers discerned among prepregnancy, pregnancy, postpartum without depression, and postpartum with depression (ie, PPD diagnosis) periods, with random forest (mAUC=0.85; κ=0.80) models outperforming generalized linear models (mAUC=0.82; κ=0.74), support vector machine (mAUC=0.75; κ=0.72), and k-nearest neighbor (mAUC=0.74; κ=0.62). Model performance decreased in women without PPD, illustrating the method's specificity. Previous depression history did not impact the efficacy of the model for PPD recognition. Moreover, we found that the most predictive biomarker of PPD was calories burned during the basal metabolic rate. Finally, individualized models surpassed the performance of a conventional cohort-based model for PPD detection. CONCLUSIONS: This research establishes consumer wearables as a promising tool for PPD identification and highlights personalized ML approaches, which could transform early disease detection strategies.


Asunto(s)
Biomarcadores , Depresión Posparto , Dispositivos Electrónicos Vestibles , Humanos , Depresión Posparto/diagnóstico , Depresión Posparto/psicología , Femenino , Adulto , Biomarcadores/análisis , Estudios Transversales , Dispositivos Electrónicos Vestibles/estadística & datos numéricos , Dispositivos Electrónicos Vestibles/normas , Aprendizaje Automático/normas , Embarazo , Estados Unidos , Conjuntos de Datos como Asunto , Curva ROC
3.
Int J Med Inform ; 187: 105461, 2024 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-38643701

RESUMEN

OBJECTIVE: Female reproductive disorders (FRDs) are common health conditions that may present with significant symptoms. Diet and environment are potential areas for FRD interventions. We utilized a knowledge graph (KG) method to predict factors associated with common FRDs (for example, endometriosis, ovarian cyst, and uterine fibroids). MATERIALS AND METHODS: We harmonized survey data from the Personalized Environment and Genes Study (PEGS) on internal and external environmental exposures and health conditions with biomedical ontology content. We merged the harmonized data and ontologies with supplemental nutrient and agricultural chemical data to create a KG. We analyzed the KG by embedding edges and applying a random forest for edge prediction to identify variables potentially associated with FRDs. We also conducted logistic regression analysis for comparison. RESULTS: Across 9765 PEGS respondents, the KG analysis resulted in 8535 significant or suggestive predicted links between FRDs and chemicals, phenotypes, and diseases. Amongst these links, 32 were exact matches when compared with the logistic regression results, including comorbidities, medications, foods, and occupational exposures. DISCUSSION: Mechanistic underpinnings of predicted links documented in the literature may support some of our findings. Our KG methods are useful for predicting possible associations in large, survey-based datasets with added information on directionality and magnitude of effect from logistic regression. These results should not be construed as causal but can support hypothesis generation. CONCLUSION: This investigation enabled the generation of hypotheses on a variety of potential links between FRDs and exposures. Future investigations should prospectively evaluate the variables hypothesized to impact FRDs.


Asunto(s)
Exposición a Riesgos Ambientales , Humanos , Femenino , Exposición a Riesgos Ambientales/efectos adversos , Enfermedades de los Genitales Femeninos , Modelos Logísticos , Estado Nutricional , Dieta , Adulto , Bosques Aleatorios
4.
Diabetes Ther ; 15(5): 1169-1186, 2024 May.
Artículo en Inglés | MEDLINE | ID: mdl-38536629

RESUMEN

INTRODUCTION: People with type 2 diabetes are at heightened risk for severe outcomes related to COVID-19 infection, including hospitalization, intensive care unit admission, and mortality. This study was designed to examine the impact of premorbid use of glucagon-like peptide-1 receptor agonist (GLP-1RA) monotherapy, sodium-glucose cotransporter-2 inhibitor (SGLT-2i) monotherapy, and concomitant GLP1-RA/SGLT-2i therapy on the severity of outcomes in individuals with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection. METHODS: Utilizing observational data from the National COVID Cohort Collaborative through September 2022, we compared outcomes in 78,806 individuals with a prescription of GLP-1RA and SGLT-2i versus a prescription of dipeptidyl peptidase 4 inhibitors (DPP-4i) within 24 months of a positive SARS-CoV-2 PCR test. We also compared concomitant GLP-1RA/SGLT-2i therapy to GLP-1RA and SGLT-2i monotherapy. The primary outcome was 60-day mortality, measured from the positive test date. Secondary outcomes included emergency room (ER) visits, hospitalization, and mechanical ventilation within 14 days. Using a super learner approach and accounting for baseline characteristics, associations were quantified with odds ratios (OR) estimated with targeted maximum likelihood estimation (TMLE). RESULTS: Use of GLP-1RA (OR 0.64, 95% confidence interval [CI] 0.56-0.72) and SGLT-2i (OR 0.62, 95% CI 0.57-0.68) were associated with lower odds of 60-day mortality compared to DPP-4i use. Additionally, the OR of ER visits and hospitalizations were similarly reduced with GLP1-RA and SGLT-2i use. Concomitant GLP-1RA/SGLT-2i use showed similar odds of 60-day mortality when compared to GLP-1RA or SGLT-2i use alone (OR 0.92, 95% CI 0.81-1.05 and OR 0.88, 95% CI 0.76-1.01, respectively). However, lower OR of all secondary outcomes were associated with concomitant GLP-1RA/SGLT-2i use when compared to SGLT-2i use alone. CONCLUSION: Among adults who tested positive for SARS-CoV-2, premorbid use of either GLP-1RA or SGLT-2i is associated with lower odds of mortality compared to DPP-4i. Furthermore, concomitant use of GLP-1RA and SGLT-2i is linked to lower odds of other severe COVID-19 outcomes, including ER visits, hospitalizations, and mechanical ventilation, compared to SGLT-2i use alone. Graphical abstract available for this article.

5.
Healthc (Amst) ; 12(2): 100738, 2024 Mar 25.
Artículo en Inglés | MEDLINE | ID: mdl-38531228

RESUMEN

The COVID-19 pandemic generated tremendous interest in using real world data (RWD). Many consortia across the public and private sectors formed in 2020 with the goal of rapidly producing high-quality evidence from RWD to guide medical decision-making, public health priorities, and more. Experiences were gathered from five large consortia on rapid multi-institutional evidence generation during the COVID-19 pandemic. Insights have been compiled across five dimensions: consortium composition, governance structure and alignment of priorities, data sharing, data analysis, and evidence dissemination. The purpose of this piece is to offer guidance on building large-scale multi-institutional RWD analysis pipelines for future public health issues. The composition of each consortium was largely influenced by existing collaborations. A central set of priorities for evidence generation guided each consortium, however different approaches to governance emerged. Challenges surrounding limited access to clinical data due to various contributors were overcome in unique ways. While all consortia used different methods to construct and analyze patient cohorts ranging from centralized to federated approaches, all proved effective for generating meaningful real-world evidence. Actionable recommendations for clinical practice and public health agencies were made from translating insights from consortium analyses. Each consortium was successful in rapidly answering questions about COVID-19 diagnosis and treatment despite all taking slightly different approaches to data sharing and analysis. Leveraging RWD, leveraged in a manner that applies scientific rigor and transparency, can complement higher-level evidence and serve as an important adjunct to clinical trials to quickly guide policy and critical care, especially for a pandemic response.

6.
medRxiv ; 2024 Apr 04.
Artículo en Inglés | MEDLINE | ID: mdl-38343863

RESUMEN

Preventing and treating post-acute sequelae of SARS-CoV-2 infection (PASC), commonly known as Long COVID, has become a public health priority. In this study, we examined whether treatment with Paxlovid in the acute phase of COVID-19 helps prevent the onset of PASC. We used electronic health records from the National Covid Cohort Collaborative (N3C) to define a cohort of 426,352 patients who had COVID-19 since April 1, 2022, and were eligible for Paxlovid treatment due to risk for progression to severe COVID-19. We used the target trial emulation (TTE) framework to estimate the effect of Paxlovid treatment on PASC incidence. We estimated overall PASC incidence using a computable phenotype. We also measured the onset of novel cognitive, fatigue, and respiratory symptoms in the post-acute period. Paxlovid treatment did not have a significant effect on overall PASC incidence (relative risk [RR] = 0.98, 95% confidence interval [CI] 0.95-1.01). However, it had a protective effect on cognitive (RR = 0.90, 95% CI 0.84-0.96) and fatigue (RR = 0.95, 95% CI 0.91-0.98) symptom clusters, which suggests that the etiology of these symptoms may be more closely related to viral load than that of respiratory symptoms.

7.
Open Forum Infect Dis ; 11(2): ofae019, 2024 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-38379569

RESUMEN

Background: Real-world evidence of coronavirus disease 2019 (COVID-19) messenger RNA (mRNA) booster effectiveness among patients with immune dysfunction are limited. Methods: We included data from patients in the United States National COVID Cohort Collaborative (N3C) who completed ≥2 doses of mRNA vaccination between 10 December 2020 and 27 May 2022. Immune dysfunction conditions included human immunodeficiency virus infection, solid organ or bone marrow transplant, autoimmune diseases, and cancer. We defined incident COVID-19 BTI as positive results from laboratory tests or diagnostic codes 14 days after at least 2 doses of mRNA vaccination; and severe COVID-19 BTI as hospitalization, invasive cardiopulmonary support, and/or death. We used propensity scores to match boosted versus nonboosted patients and evaluated hazards of incident and severe COVID-19 BTI using Cox regression after matching. Results: Among patients without immune dysfunction, the relative effectiveness of booster (3 doses) after 6 months from the primary (2 doses) vaccination against BTI ranged from 69% to 81% during the Delta-predominant period and from 33% to 39% during the Omicron-predominant period. Relative effectiveness against BTI was lower among patients with immune dysfunction but remained statistically significant in both periods. Boosted patients had lower risk of COVID-19-related hospitalization (hazard ratios [HR] ranged from 0.5 [95% confidence interval {CI}, .48-.53] to 0.63 [95% CI, .56-.70]), invasive cardiopulmonary support, or death (HRs ranged from 0.46 [95% CI, .41-.52] to 0.63 [95% CI, .50-.79]) during both periods. Conclusions: Booster vaccines remain effective against severe COVID-19 BTI throughout the Delta- and Omicron-predominant periods, regardless of patients' immune status.

9.
Bioinformatics ; 40(3)2024 Mar 04.
Artículo en Inglés | MEDLINE | ID: mdl-38383067

RESUMEN

MOTIVATION: Creating knowledge bases and ontologies is a time consuming task that relies on manual curation. AI/NLP approaches can assist expert curators in populating these knowledge bases, but current approaches rely on extensive training data, and are not able to populate arbitrarily complex nested knowledge schemas. RESULTS: Here we present Structured Prompt Interrogation and Recursive Extraction of Semantics (SPIRES), a Knowledge Extraction approach that relies on the ability of Large Language Models (LLMs) to perform zero-shot learning and general-purpose query answering from flexible prompts and return information conforming to a specified schema. Given a detailed, user-defined knowledge schema and an input text, SPIRES recursively performs prompt interrogation against an LLM to obtain a set of responses matching the provided schema. SPIRES uses existing ontologies and vocabularies to provide identifiers for matched elements. We present examples of applying SPIRES in different domains, including extraction of food recipes, multi-species cellular signaling pathways, disease treatments, multi-step drug mechanisms, and chemical to disease relationships. Current SPIRES accuracy is comparable to the mid-range of existing Relation Extraction methods, but greatly surpasses an LLM's native capability of grounding entities with unique identifiers. SPIRES has the advantage of easy customization, flexibility, and, crucially, the ability to perform new tasks in the absence of any new training data. This method supports a general strategy of leveraging the language interpreting capabilities of LLMs to assemble knowledge bases, assisting manual knowledge curation and acquisition while supporting validation with publicly-available databases and ontologies external to the LLM. AVAILABILITY AND IMPLEMENTATION: SPIRES is available as part of the open source OntoGPT package: https://github.com/monarch-initiative/ontogpt.


Asunto(s)
Bases del Conocimiento , Semántica , Bases de Datos Factuales
10.
BMC Med Inform Decis Mak ; 24(1): 30, 2024 Jan 31.
Artículo en Inglés | MEDLINE | ID: mdl-38297371

RESUMEN

OBJECTIVE: Clinical deep phenotyping and phenotype annotation play a critical role in both the diagnosis of patients with rare disorders as well as in building computationally-tractable knowledge in the rare disorders field. These processes rely on using ontology concepts, often from the Human Phenotype Ontology, in conjunction with a phenotype concept recognition task (supported usually by machine learning methods) to curate patient profiles or existing scientific literature. With the significant shift in the use of large language models (LLMs) for most NLP tasks, we examine the performance of the latest Generative Pre-trained Transformer (GPT) models underpinning ChatGPT as a foundation for the tasks of clinical phenotyping and phenotype annotation. MATERIALS AND METHODS: The experimental setup of the study included seven prompts of various levels of specificity, two GPT models (gpt-3.5-turbo and gpt-4.0) and two established gold standard corpora for phenotype recognition, one consisting of publication abstracts and the other clinical observations. RESULTS: The best run, using in-context learning, achieved 0.58 document-level F1 score on publication abstracts and 0.75 document-level F1 score on clinical observations, as well as a mention-level F1 score of 0.7, which surpasses the current best in class tool. Without in-context learning, however, performance is significantly below the existing approaches. CONCLUSION: Our experiments show that gpt-4.0 surpasses the state of the art performance if the task is constrained to a subset of the target ontology where there is prior knowledge of the terms that are expected to be matched. While the results are promising, the non-deterministic nature of the outcomes, the high cost and the lack of concordance between different runs using the same prompt and input make the use of these LLMs challenging for this particular task.


Asunto(s)
Conocimiento , Lenguaje , Humanos , Aprendizaje Automático , Fenotipo , Enfermedades Raras
11.
Nucleic Acids Res ; 52(D1): D938-D949, 2024 Jan 05.
Artículo en Inglés | MEDLINE | ID: mdl-38000386

RESUMEN

Bridging the gap between genetic variations, environmental determinants, and phenotypic outcomes is critical for supporting clinical diagnosis and understanding mechanisms of diseases. It requires integrating open data at a global scale. The Monarch Initiative advances these goals by developing open ontologies, semantic data models, and knowledge graphs for translational research. The Monarch App is an integrated platform combining data about genes, phenotypes, and diseases across species. Monarch's APIs enable access to carefully curated datasets and advanced analysis tools that support the understanding and diagnosis of disease for diverse applications such as variant prioritization, deep phenotyping, and patient profile-matching. We have migrated our system into a scalable, cloud-based infrastructure; simplified Monarch's data ingestion and knowledge graph integration systems; enhanced data mapping and integration standards; and developed a new user interface with novel search and graph navigation features. Furthermore, we advanced Monarch's analytic tools by developing a customized plugin for OpenAI's ChatGPT to increase the reliability of its responses about phenotypic data, allowing us to interrogate the knowledge in the Monarch graph using state-of-the-art Large Language Models. The resources of the Monarch Initiative can be found at monarchinitiative.org and its corresponding code repository at github.com/monarch-initiative/monarch-app.


Asunto(s)
Bases de Datos Factuales , Enfermedad , Genes , Fenotipo , Humanos , Internet , Bases de Datos Factuales/normas , Programas Informáticos , Genes/genética , Enfermedad/genética
12.
Med ; 4(12): 913-927.e3, 2023 Dec 08.
Artículo en Inglés | MEDLINE | ID: mdl-37963467

RESUMEN

BACKGROUND: Navigating the clinical literature to determine the optimal clinical management for rare diseases presents significant challenges. We introduce the Medical Action Ontology (MAxO), an ontology specifically designed to organize medical procedures, therapies, and interventions. METHODS: MAxO incorporates logical structures that link MAxO terms to numerous other ontologies within the OBO Foundry. Term development involves a blend of manual and semi-automated processes. Additionally, we have generated annotations detailing diagnostic modalities for specific phenotypic abnormalities defined by the Human Phenotype Ontology (HPO). We introduce a web application, POET, that facilitates MAxO annotations for specific medical actions for diseases using the Mondo Disease Ontology. FINDINGS: MAxO encompasses 1,757 terms spanning a wide range of biomedical domains, from human anatomy and investigations to the chemical and protein entities involved in biological processes. These terms annotate phenotypic features associated with specific disease (using HPO and Mondo). Presently, there are over 16,000 MAxO diagnostic annotations that target HPO terms. Through POET, we have created 413 MAxO annotations specifying treatments for 189 rare diseases. CONCLUSIONS: MAxO offers a computational representation of treatments and other actions taken for the clinical management of patients. Its development is closely coupled to Mondo and HPO, broadening the scope of our computational modeling of diseases and phenotypic features. We invite the community to contribute disease annotations using POET (https://poet.jax.org/). MAxO is available under the open-source CC-BY 4.0 license (https://github.com/monarch-initiative/MAxO). FUNDING: NHGRI 1U24HG011449-01A1 and NHGRI 5RM1HG010860-04.


Asunto(s)
Ontologías Biológicas , Humanos , Enfermedades Raras , Programas Informáticos , Simulación por Computador
13.
medRxiv ; 2023 Oct 14.
Artículo en Inglés | MEDLINE | ID: mdl-37873471

RESUMEN

Postpartum depression (PPD), afflicting one in seven women, poses a major challenge in maternal health. Existing approaches to detect PPD heavily depend on in-person postpartum visits, leading to cases of the condition being overlooked and untreated. We explored the potential of consumer wearable-derived digital biomarkers for PPD recognition to address this gap. Our study demonstrated that intra-individual machine learning (ML) models developed using these digital biomarkers can discern between pre-pregnancy, pregnancy, postpartum without depression, and postpartum with depression time periods (i.e., PPD diagnosis). When evaluating variable importance, calories burned from the basal metabolic rate (calories BMR) emerged as the digital biomarker most predictive of PPD. To confirm the specificity of our method, we demonstrated that models developed in women without PPD could not accurately classify the PPD-equivalent phase. Prior depression history did not alter model efficacy for PPD recognition. Furthermore, the individualized models demonstrated superior performance compared to a conventional cohort-based model for the detection of PPD, underscoring the effectiveness of our individualized ML approach. This work establishes consumer wearables as a promising avenue for PPD identification. More importantly, it also emphasizes the utility of individualized ML model methodology, potentially transforming early disease detection strategies.

14.
Injury ; 54(12): 111092, 2023 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-37871347

RESUMEN

BACKGROUND: The objective of this study was to investigate the outcomes of COVID-19-positive patients undergoing orthopaedic fracture surgery using data from a national database of U.S. adults with a COVID-19 test for SARS-CoV-2. METHODS: This is a retrospective cohort study using data from a national database to compare orthopaedic fracture surgery outcomes between COVID-19-positive and COVID-19-negative patients in the United States. Participants aged 18-99 with orthopaedic fracture surgery between March and December 2020 were included. The main exposure was COVID-19 status. Outcomes included perioperative complications, 30-day all-cause mortality, and overall all-cause mortality. Multivariable adjusted models were fitted to determine the association of COVID-positivity with all-cause mortality. RESULTS: The total population of 6.5 million patient records was queried, identifying 76,697 participants with a fracture. There were 7,628 participants in the National COVID Cohort who had a fracture and operative management. The Charlson Comorbidity Index was higher in the COVID-19-positive group (n = 476, 6.2 %) than the COVID-19-negative group (n = 7,152, 93.8 %) (2.2 vs 1.4, p<0.001). The COVID-19-positive group had higher mortality (13.2 % vs 5.2 %, p<0.001) than the COVID-19-negative group with higher odds of death in the fully adjusted model (Odds Ratio=1.59; 95 % Confidence Interval: 1.16-2.18). CONCLUSION: COVID-19-positive participants with a fracture requiring surgery had higher mortality and perioperative complications than COVID-19-negative patients in this national cohort of U.S. adults tested for COVID-19. The risks associated with COVID-19 can guide potential treatment options and counseling of patients and their families. Future studies can be conducted as data accumulates. LEVEL OF EVIDENCE: Level III.


Asunto(s)
COVID-19 , Fracturas de Cadera , Ortopedia , Adulto , Humanos , Estados Unidos/epidemiología , COVID-19/complicaciones , COVID-19/epidemiología , SARS-CoV-2 , Estudios Retrospectivos , Fracturas de Cadera/cirugía
15.
BMC Public Health ; 23(1): 2103, 2023 10 25.
Artículo en Inglés | MEDLINE | ID: mdl-37880596

RESUMEN

BACKGROUND: More than one-third of individuals experience post-acute sequelae of SARS-CoV-2 infection (PASC, which includes long-COVID). The objective is to identify risk factors associated with PASC/long-COVID diagnosis. METHODS: This was a retrospective case-control study including 31 health systems in the United States from the National COVID Cohort Collaborative (N3C). 8,325 individuals with PASC (defined by the presence of the International Classification of Diseases, version 10 code U09.9 or a long-COVID clinic visit) matched to 41,625 controls within the same health system and COVID index date within ± 45 days of the corresponding case's earliest COVID index date. Measurements of risk factors included demographics, comorbidities, treatment and acute characteristics related to COVID-19. Multivariable logistic regression, random forest, and XGBoost were used to determine the associations between risk factors and PASC. RESULTS: Among 8,325 individuals with PASC, the majority were > 50 years of age (56.6%), female (62.8%), and non-Hispanic White (68.6%). In logistic regression, middle-age categories (40 to 69 years; OR ranging from 2.32 to 2.58), female sex (OR 1.4, 95% CI 1.33-1.48), hospitalization associated with COVID-19 (OR 3.8, 95% CI 3.05-4.73), long (8-30 days, OR 1.69, 95% CI 1.31-2.17) or extended hospital stay (30 + days, OR 3.38, 95% CI 2.45-4.67), receipt of mechanical ventilation (OR 1.44, 95% CI 1.18-1.74), and several comorbidities including depression (OR 1.50, 95% CI 1.40-1.60), chronic lung disease (OR 1.63, 95% CI 1.53-1.74), and obesity (OR 1.23, 95% CI 1.16-1.3) were associated with increased likelihood of PASC diagnosis or care at a long-COVID clinic. Characteristics associated with a lower likelihood of PASC diagnosis or care at a long-COVID clinic included younger age (18 to 29 years), male sex, non-Hispanic Black race, and comorbidities such as substance abuse, cardiomyopathy, psychosis, and dementia. More doctors per capita in the county of residence was associated with an increased likelihood of PASC diagnosis or care at a long-COVID clinic. Our findings were consistent in sensitivity analyses using a variety of analytic techniques and approaches to select controls. CONCLUSIONS: This national study identified important risk factors for PASC diagnosis such as middle age, severe COVID-19 disease, and specific comorbidities. Further clinical and epidemiological research is needed to better understand underlying mechanisms and the potential role of vaccines and therapeutics in altering PASC course.


Asunto(s)
COVID-19 , SARS-CoV-2 , Persona de Mediana Edad , Femenino , Masculino , Humanos , Adulto , Anciano , Adolescente , Adulto Joven , COVID-19/epidemiología , Síndrome Post Agudo de COVID-19 , Estudios de Casos y Controles , Estudios Retrospectivos , Factores de Riesgo , Progresión de la Enfermedad
16.
JAMIA Open ; 6(3): ooad067, 2023 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-37600074

RESUMEN

Objectives: To define pregnancy episodes and estimate gestational age within electronic health record (EHR) data from the National COVID Cohort Collaborative (N3C). Materials and Methods: We developed a comprehensive approach, named Hierarchy and rule-based pregnancy episode Inference integrated with Pregnancy Progression Signatures (HIPPS), and applied it to EHR data in the N3C (January 1, 2018-April 7, 2022). HIPPS combines: (1) an extension of a previously published pregnancy episode algorithm, (2) a novel algorithm to detect gestational age-specific signatures of a progressing pregnancy for further episode support, and (3) pregnancy start date inference. Clinicians performed validation of HIPPS on a subset of episodes. We then generated pregnancy cohorts based on gestational age precision and pregnancy outcomes for assessment of accuracy and comparison of COVID-19 and other characteristics. Results: We identified 628 165 pregnant persons with 816 471 pregnancy episodes, of which 52.3% were live births, 24.4% were other outcomes (stillbirth, ectopic pregnancy, abortions), and 23.3% had unknown outcomes. Clinician validation agreed 98.8% with HIPPS-identified episodes. We were able to estimate start dates within 1 week of precision for 475 433 (58.2%) episodes. 62 540 (7.7%) episodes had incident COVID-19 during pregnancy. Discussion: HIPPS provides measures of support for pregnancy-related variables such as gestational age and pregnancy outcomes based on N3C data. Gestational age precision allows researchers to find time to events with reasonable confidence. Conclusion: We have developed a novel and robust approach for inferring pregnancy episodes and gestational age that addresses data inconsistency and missingness in EHR data.

17.
J Am Med Inform Assoc ; 30(12): 2036-2040, 2023 11 17.
Artículo en Inglés | MEDLINE | ID: mdl-37555837

RESUMEN

Despite recent methodology advancements in clinical natural language processing (NLP), the adoption of clinical NLP models within the translational research community remains hindered by process heterogeneity and human factor variations. Concurrently, these factors also dramatically increase the difficulty in developing NLP models in multi-site settings, which is necessary for algorithm robustness and generalizability. Here, we reported on our experience developing an NLP solution for Coronavirus Disease 2019 (COVID-19) signs and symptom extraction in an open NLP framework from a subset of sites participating in the National COVID Cohort (N3C). We then empirically highlight the benefits of multi-site data for both symbolic and statistical methods, as well as highlight the need for federated annotation and evaluation to resolve several pitfalls encountered in the course of these efforts.


Asunto(s)
COVID-19 , Procesamiento de Lenguaje Natural , Humanos , Registros Electrónicos de Salud , Algoritmos
18.
medRxiv ; 2023 Jul 16.
Artículo en Inglés | MEDLINE | ID: mdl-37502882

RESUMEN

Objective: Female reproductive disorders (FRDs) are common health conditions that may present with significant symptoms. Diet and environment are potential areas for FRD interventions. We utilized a knowledge graph (KG) method to predict factors associated with common FRDs (e.g., endometriosis, ovarian cyst, and uterine fibroids). Materials and Methods: We harmonized survey data from the Personalized Environment and Genes Study on internal and external environmental exposures and health conditions with biomedical ontology content. We merged the harmonized data and ontologies with supplemental nutrient and agricultural chemical data to create a KG. We analyzed the KG by embedding edges and applying a random forest for edge prediction to identify variables potentially associated with FRDs. We also conducted logistic regression analysis for comparison. Results: Across 9765 PEGS respondents, the KG analysis resulted in 8535 significant predicted links between FRDs and chemicals, phenotypes, and diseases. Amongst these links, 32 were exact matches when compared with the logistic regression results, including comorbidities, medications, foods, and occupational exposures. Discussion: Mechanistic underpinnings of predicted links documented in the literature may support some of our findings. Our KG methods are useful for predicting possible associations in large, survey-based datasets with added information on directionality and magnitude of effect from logistic regression. These results should not be construed as causal, but can support hypothesis generation. Conclusion: This investigation enabled the generation of hypotheses on a variety of potential links between FRDs and exposures. Future investigations should prospectively evaluate the variables hypothesized to impact FRDs.

19.
medRxiv ; 2023 Jul 13.
Artículo en Inglés | MEDLINE | ID: mdl-37503136

RESUMEN

Navigating the vast landscape of clinical literature to find optimal treatments and management strategies can be a challenging task, especially for rare diseases. To address this task, we introduce the Medical Action Ontology (MAxO), the first ontology specifically designed to organize medical procedures, therapies, and interventions in a structured way. Currently, MAxO contains 1757 medical action terms added through a combination of manual and semi-automated processes. MAxO was developed with logical structures that make it compatible with several other ontologies within the Open Biological and Biomedical Ontologies (OBO) Foundry. These cover a wide range of biomedical domains, from human anatomy and investigations to the chemical and protein entities involved in biological processes. We have created a database of over 16000 annotations that describe diagnostic modalities for specific phenotypic abnormalities as defined by the Human Phenotype Ontology (HPO). Additionally, 413 annotations are provided for medical actions for 189 rare diseases. We have developed a web application called POET (https://poet.jax.org/) for the community to use to contribute MAxO annotations. MAxO provides a computational representation of treatments and other actions taken for the clinical management of patients. The development of MAxO is closely coupled to the Mondo Disease Ontology (Mondo) and the Human Phenotype Ontology (HPO) and expands the scope of our computational modeling of diseases and phenotypic features to include diagnostics and therapeutic actions. MAxO is available under the open-source CC-BY 4.0 license (https://github.com/monarch-initiative/MAxO).

20.
Bioinformatics ; 39(7)2023 07 01.
Artículo en Inglés | MEDLINE | ID: mdl-37389415

RESUMEN

MOTIVATION: Knowledge graphs (KGs) are a powerful approach for integrating heterogeneous data and making inferences in biology and many other domains, but a coherent solution for constructing, exchanging, and facilitating the downstream use of KGs is lacking. RESULTS: Here we present KG-Hub, a platform that enables standardized construction, exchange, and reuse of KGs. Features include a simple, modular extract-transform-load pattern for producing graphs compliant with Biolink Model (a high-level data model for standardizing biological data), easy integration of any OBO (Open Biological and Biomedical Ontologies) ontology, cached downloads of upstream data sources, versioned and automatically updated builds with stable URLs, web-browsable storage of KG artifacts on cloud infrastructure, and easy reuse of transformed subgraphs across projects. Current KG-Hub projects span use cases including COVID-19 research, drug repurposing, microbial-environmental interactions, and rare disease research. KG-Hub is equipped with tooling to easily analyze and manipulate KGs. KG-Hub is also tightly integrated with graph machine learning (ML) tools which allow automated graph ML, including node embeddings and training of models for link prediction and node classification. AVAILABILITY AND IMPLEMENTATION: https://kghub.org.


Asunto(s)
Ontologías Biológicas , COVID-19 , Humanos , Reconocimiento de Normas Patrones Automatizadas , Enfermedades Raras , Aprendizaje Automático
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...