Búsqueda | BVS Bolivia

1.

Finding Long-COVID: Temporal Topic Modeling of Electronic Health Records from the N3C and RECOVER Programs.

O'Neil, Shawn T; Madlock-Brown, Charisse; Wilkins, Kenneth J; McGrath, Brenda M; Davis, Hannah E; Assaf, Gina S; Wei, Hannah; Zareie, Parya; French, Evan T; Loomba, Johanna; McMurry, Julie A; Zhou, Andrea; Chute, Christopher G; Moffitt, Richard A; Pfaff, Emily R; Yoo, Yun Jae; Leese, Peter; Chew, Robert F; Lieberman, Michael; Haendel, Melissa A.

medRxiv ; 2024 Jun 11.

Artículo en Inglés | MEDLINE | ID: mdl-38947087

RESUMEN

Post-Acute Sequelae of SARS-CoV-2 infection (PASC), also known as Long-COVID, encompasses a variety of complex and varied outcomes following COVID-19 infection that are still poorly understood. We clustered over 600 million condition diagnoses from 14 million patients available through the National COVID Cohort Collaborative (N3C), generating hundreds of highly detailed clinical phenotypes. Assessing patient clinical trajectories using these clusters allowed us to identify individual conditions and phenotypes strongly increased after acute infection. We found many conditions increased in COVID-19 patients compared to controls, and using a novel method to associate patients with clusters over time, we additionally found phenotypes specific to patient sex, age, wave of infection, and PASC diagnosis status. While many of these results reflect known PASC symptoms, the resolution provided by this unprecedented data scale suggests avenues for improved diagnostics and mechanistic understanding of this multifaceted disease.

2.

Association of post-COVID phenotypic manifestations with new-onset psychiatric disease.

Coleman, Ben; Casiraghi, Elena; Callahan, Tiffany J; Blau, Hannah; Chan, Lauren E; Laraway, Bryan; Clark, Kevin B; Re'em, Yochai; Gersing, Ken R; Wilkins, Kenneth J; Harris, Nomi L; Valentini, Giorgio; Haendel, Melissa A; Reese, Justin T; Robinson, Peter N.

Transl Psychiatry ; 14(1): 246, 2024 Jun 08.

Artículo en Inglés | MEDLINE | ID: mdl-38851761

RESUMEN

Acute COVID-19 infection can be followed by diverse clinical manifestations referred to as Post Acute Sequelae of SARS-CoV2 Infection (PASC). Studies have shown an increased risk of being diagnosed with new-onset psychiatric disease following a diagnosis of acute COVID-19. However, it was unclear whether non-psychiatric PASC-associated manifestations (PASC-AMs) are associated with an increased risk of new-onset psychiatric disease following COVID-19. A retrospective electronic health record (EHR) cohort study of 2,391,006 individuals with acute COVID-19 was performed to evaluate whether non-psychiatric PASC-AMs are associated with new-onset psychiatric disease. Data were obtained from the National COVID Cohort Collaborative (N3C), which has EHR data from 76 clinical organizations. EHR codes were mapped to 151 non-psychiatric PASC-AMs recorded 28-120 days following SARS-CoV-2 diagnosis and before diagnosis of new-onset psychiatric disease. Association of newly diagnosed psychiatric disease with age, sex, race, pre-existing comorbidities, and PASC-AMs in seven categories was assessed by logistic regression. There were significant associations between a diagnosis of any psychiatric disease and five categories of PASC-AMs with odds ratios highest for neurological, cardiovascular, and constitutional PASC-AMs with odds ratios of 1.31, 1.29, and 1.23 respectively. Secondary analysis revealed that the proportions of 50 individual clinical features significantly differed between patients diagnosed with different psychiatric diseases. Our study provides evidence for association between non-psychiatric PASC-AMs and the incidence of newly diagnosed psychiatric disease. Significant associations were found for features related to multiple organ systems. This information could prove useful in understanding risk stratification for new-onset psychiatric disease following COVID-19. Prospective studies are needed to corroborate these findings.

Asunto(s)

COVID-19 , Trastornos Mentales , SARS-CoV-2 , Humanos , COVID-19/psicología , COVID-19/complicaciones , COVID-19/epidemiología , Masculino , Femenino , Trastornos Mentales/epidemiología , Persona de Mediana Edad , Adulto , Estudios Retrospectivos , Anciano , Fenotipo , Síndrome Post Agudo de COVID-19 , Comorbilidad , Registros Electrónicos de Salud , Adulto Joven , Factores de Riesgo , Adolescente

3.

The Vertebrate Breed Ontology: Towards Effective Breed Data Standardization.

Mullen, Kathleen R; Tammen, Imke; Matentzoglu, Nicolas A; Mather, Marius; Mungall, Christopher J; Haendel, Melissa A; Nicholas, Frank W; Toro, Sabrina.

ArXiv ; 2024 Jun 03.

Artículo en Inglés | MEDLINE | ID: mdl-38883236

RESUMEN

Background : Limited universally adopted data standards in veterinary science hinders data interoperability and therefore integration and comparison; this ultimately impedes application of existing information-based tools to support advancement in veterinary diagnostics, treatments, and precision medicine. Hypothesis/Objectives : Creation of a Vertebrate Breed Ontology (VBO) as a single, coherent logic-based standard for documenting breed names in animal health, production and research-related records will improve data use capabilities in veterinary and comparative medicine. Animals : No live animals were used in this study. Methods : A list of breed names and related information was compiled from relevant sources, organizations, communities, and experts using manual and computational approaches to create VBO. Each breed is represented by a VBO term that includes all provenance and the breed's related information as metadata. VBO terms are classified using description logic to allow computational applications and Artificial Intelligence-readiness. Results : VBO is an open, community-driven ontology representing over 19,000 livestock and companion animal breeds covering 41 species. Breeds are classified based on community and expert conventions (e.g., horse breed, cattle breed). This classification is supported by relations to the breeds' genus and species indicated by NCBI Taxonomy terms. Relationships between VBO terms, e.g. relating breeds to their foundation stock, provide additional context to support advanced data analytics. VBO term metadata includes common names and synonyms, breed identifiers/codes, and attributed cross-references to other databases. Conclusion and clinical importance : Veterinary data interoperability and computability can be enhanced by the adoption of VBO as a source of standard breed names in databases and veterinary electronic health records.

4.

A corpus of GA4GH Phenopackets: case-level phenotyping for genomic diagnostics and discovery.

Danis, Daniel; Bamshad, Michael J; Bridges, Yasemin; Cacheiro, Pilar; Carmody, Leigh C; Chong, Jessica X; Coleman, Ben; Dalgleish, Raymond; Freeman, Peter J; Graefe, Adam S L; Groza, Tudor; Jacobsen, Julius O B; Klocperk, Adam; Kusters, Maaike; Ladewig, Markus S; Marcello, Anthony J; Mattina, Teresa; Mungall, Christopher J; Munoz-Torres, Monica C; Reese, Justin T; Rehburg, Filip; Reis, Bárbara C S; Schuetz, Catharina; Smedley, Damian; Strauss, Timmy; Sundaramurthi, Jagadish Chandrabose; Thun, Sylvia; Wissink, Kyran; Wagstaff, John F; Zocche, David; Haendel, Melissa A; Robinson, Peter N.

medRxiv ; 2024 May 29.

Artículo en Inglés | MEDLINE | ID: mdl-38854034

RESUMEN

The Global Alliance for Genomics and Health (GA4GH) Phenopacket Schema was released in 2022 and approved by ISO as a standard for sharing clinical and genomic information about an individual, including phenotypic descriptions, numerical measurements, genetic information, diagnoses, and treatments. A phenopacket can be used as an input file for software that supports phenotype-driven genomic diagnostics and for algorithms that facilitate patient classification and stratification for identifying new diseases and treatments. There has been a great need for a collection of phenopackets to test software pipelines and algorithms. Here, we present phenopacket-store. Version 0.1.12 of phenopacket-store includes 4916 phenopackets representing 277 Mendelian and chromosomal diseases associated with 236 genes, and 2872 unique pathogenic alleles curated from 605 different publications. This represents the first large-scale collection of case-level, standardized phenotypic information derived from case reports in the literature with detailed descriptions of the clinical data and will be useful for many purposes, including the development and testing of software for prioritizing genes and diseases in diagnostic genomics, machine learning analysis of clinical phenotype data, patient stratification, and genotype-phenotype correlations. This corpus also provides best-practice examples for curating literature-derived data using the GA4GH Phenopacket Schema.

5.

Harnessing Consumer Wearable Digital Biomarkers for Individualized Recognition of Postpartum Depression Using the All of Us Research Program Data Set: Cross-Sectional Study.

Hurwitz, Eric; Butzin-Dozier, Zachary; Master, Hiral; O'Neil, Shawn T; Walden, Anita; Holko, Michelle; Patel, Rena C; Haendel, Melissa A.

JMIR Mhealth Uhealth ; 12: e54622, 2024 May 02.

Artículo en Inglés | MEDLINE | ID: mdl-38696234

RESUMEN

BACKGROUND: Postpartum depression (PPD) poses a significant maternal health challenge. The current approach to detecting PPD relies on in-person postpartum visits, which contributes to underdiagnosis. Furthermore, recognizing PPD symptoms can be challenging. Therefore, we explored the potential of using digital biomarkers from consumer wearables for PPD recognition. OBJECTIVE: The main goal of this study was to showcase the viability of using machine learning (ML) and digital biomarkers related to heart rate, physical activity, and energy expenditure derived from consumer-grade wearables for the recognition of PPD. METHODS: Using the All of Us Research Program Registered Tier v6 data set, we performed computational phenotyping of women with and without PPD following childbirth. Intraindividual ML models were developed using digital biomarkers from Fitbit to discern between prepregnancy, pregnancy, postpartum without depression, and postpartum with depression (ie, PPD diagnosis) periods. Models were built using generalized linear models, random forest, support vector machine, and k-nearest neighbor algorithms and evaluated using the κ statistic and multiclass area under the receiver operating characteristic curve (mAUC) to determine the algorithm with the best performance. The specificity of our individualized ML approach was confirmed in a cohort of women who gave birth and did not experience PPD. Moreover, we assessed the impact of a previous history of depression on model performance. We determined the variable importance for predicting the PPD period using Shapley additive explanations and confirmed the results using a permutation approach. Finally, we compared our individualized ML methodology against a traditional cohort-based ML model for PPD recognition and compared model performance using sensitivity, specificity, precision, recall, and F1-score. RESULTS: Patient cohorts of women with valid Fitbit data who gave birth included <20 with PPD and 39 without PPD. Our results demonstrated that intraindividual models using digital biomarkers discerned among prepregnancy, pregnancy, postpartum without depression, and postpartum with depression (ie, PPD diagnosis) periods, with random forest (mAUC=0.85; κ=0.80) models outperforming generalized linear models (mAUC=0.82; κ=0.74), support vector machine (mAUC=0.75; κ=0.72), and k-nearest neighbor (mAUC=0.74; κ=0.62). Model performance decreased in women without PPD, illustrating the method's specificity. Previous depression history did not impact the efficacy of the model for PPD recognition. Moreover, we found that the most predictive biomarker of PPD was calories burned during the basal metabolic rate. Finally, individualized models surpassed the performance of a conventional cohort-based model for PPD detection. CONCLUSIONS: This research establishes consumer wearables as a promising tool for PPD identification and highlights personalized ML approaches, which could transform early disease detection strategies.

Asunto(s)

Biomarcadores , Depresión Posparto , Dispositivos Electrónicos Vestibles , Humanos , Depresión Posparto/diagnóstico , Depresión Posparto/psicología , Femenino , Adulto , Biomarcadores/análisis , Estudios Transversales , Dispositivos Electrónicos Vestibles/estadística & datos numéricos , Dispositivos Electrónicos Vestibles/normas , Aprendizaje Automático/normas , Embarazo , Estados Unidos , Conjuntos de Datos como Asunto , Curva ROC

6.

Predicting nutrition and environmental factors associated with female reproductive disorders using a knowledge graph and random forests.

Chan, Lauren E; Casiraghi, Elena; Reese, Justin; Harmon, Quaker E; Schaper, Kevin; Hegde, Harshad; Valentini, Giorgio; Schmitt, Charles; Motsinger-Reif, Alison; Hall, Janet E; Mungall, Christopher J; Robinson, Peter N; Haendel, Melissa A.

Int J Med Inform ; 187: 105461, 2024 Jul.

Artículo en Inglés | MEDLINE | ID: mdl-38643701

RESUMEN

OBJECTIVE: Female reproductive disorders (FRDs) are common health conditions that may present with significant symptoms. Diet and environment are potential areas for FRD interventions. We utilized a knowledge graph (KG) method to predict factors associated with common FRDs (for example, endometriosis, ovarian cyst, and uterine fibroids). MATERIALS AND METHODS: We harmonized survey data from the Personalized Environment and Genes Study (PEGS) on internal and external environmental exposures and health conditions with biomedical ontology content. We merged the harmonized data and ontologies with supplemental nutrient and agricultural chemical data to create a KG. We analyzed the KG by embedding edges and applying a random forest for edge prediction to identify variables potentially associated with FRDs. We also conducted logistic regression analysis for comparison. RESULTS: Across 9765 PEGS respondents, the KG analysis resulted in 8535 significant or suggestive predicted links between FRDs and chemicals, phenotypes, and diseases. Amongst these links, 32 were exact matches when compared with the logistic regression results, including comorbidities, medications, foods, and occupational exposures. DISCUSSION: Mechanistic underpinnings of predicted links documented in the literature may support some of our findings. Our KG methods are useful for predicting possible associations in large, survey-based datasets with added information on directionality and magnitude of effect from logistic regression. These results should not be construed as causal but can support hypothesis generation. CONCLUSION: This investigation enabled the generation of hypotheses on a variety of potential links between FRDs and exposures. Future investigations should prospectively evaluate the variables hypothesized to impact FRDs.

Asunto(s)

Exposición a Riesgos Ambientales , Humanos , Femenino , Exposición a Riesgos Ambientales/efectos adversos , Enfermedades de los Genitales Femeninos , Modelos Logísticos , Estado Nutricional , Dieta , Adulto , Bosques Aleatorios

7.

Structured Prompt Interrogation and Recursive Extraction of Semantics (SPIRES): a method for populating knowledge bases using zero-shot learning.

Caufield, J Harry; Hegde, Harshad; Emonet, Vincent; Harris, Nomi L; Joachimiak, Marcin P; Matentzoglu, Nicolas; Kim, HyeongSik; Moxon, Sierra; Reese, Justin T; Haendel, Melissa A; Robinson, Peter N; Mungall, Christopher J.

Bioinformatics ; 40(3)2024 Mar 04.

Artículo en Inglés | MEDLINE | ID: mdl-38383067

RESUMEN

MOTIVATION: Creating knowledge bases and ontologies is a time consuming task that relies on manual curation. AI/NLP approaches can assist expert curators in populating these knowledge bases, but current approaches rely on extensive training data, and are not able to populate arbitrarily complex nested knowledge schemas. RESULTS: Here we present Structured Prompt Interrogation and Recursive Extraction of Semantics (SPIRES), a Knowledge Extraction approach that relies on the ability of Large Language Models (LLMs) to perform zero-shot learning and general-purpose query answering from flexible prompts and return information conforming to a specified schema. Given a detailed, user-defined knowledge schema and an input text, SPIRES recursively performs prompt interrogation against an LLM to obtain a set of responses matching the provided schema. SPIRES uses existing ontologies and vocabularies to provide identifiers for matched elements. We present examples of applying SPIRES in different domains, including extraction of food recipes, multi-species cellular signaling pathways, disease treatments, multi-step drug mechanisms, and chemical to disease relationships. Current SPIRES accuracy is comparable to the mid-range of existing Relation Extraction methods, but greatly surpasses an LLM's native capability of grounding entities with unique identifiers. SPIRES has the advantage of easy customization, flexibility, and, crucially, the ability to perform new tasks in the absence of any new training data. This method supports a general strategy of leveraging the language interpreting capabilities of LLMs to assemble knowledge bases, assisting manual knowledge curation and acquisition while supporting validation with publicly-available databases and ontologies external to the LLM. AVAILABILITY AND IMPLEMENTATION: SPIRES is available as part of the open source OntoGPT package: https://github.com/monarch-initiative/ontogpt.

Asunto(s)

Bases del Conocimiento , Semántica , Bases de Datos Factuales

8.

Effectiveness of mRNA Booster Vaccine Against Coronavirus Disease 2019 Infection and Severe Outcomes Among Persons With and Without Immune Dysfunction: A Retrospective Cohort Study of National Electronic Medical Record Data in the United States.

Sun, Jing; Zheng, Qulu; Anzalone, Alfred J; Abraham, Alison G; Olex, Amy L; Zhang, Yifan; Mathew, Jomol; Safdar, Nasia; Haendel, Melissa A; Segev, Dorry; Islam, Jessica Y; Singh, Jasvinder A; Mannon, Roslyn B; Chute, Christopher G; Patel, Rena C; Kirk, Gregory D.

Open Forum Infect Dis ; 11(2): ofae019, 2024 Feb.

Artículo en Inglés | MEDLINE | ID: mdl-38379569

RESUMEN

Background: Real-world evidence of coronavirus disease 2019 (COVID-19) messenger RNA (mRNA) booster effectiveness among patients with immune dysfunction are limited. Methods: We included data from patients in the United States National COVID Cohort Collaborative (N3C) who completed ≥2 doses of mRNA vaccination between 10 December 2020 and 27 May 2022. Immune dysfunction conditions included human immunodeficiency virus infection, solid organ or bone marrow transplant, autoimmune diseases, and cancer. We defined incident COVID-19 BTI as positive results from laboratory tests or diagnostic codes 14 days after at least 2 doses of mRNA vaccination; and severe COVID-19 BTI as hospitalization, invasive cardiopulmonary support, and/or death. We used propensity scores to match boosted versus nonboosted patients and evaluated hazards of incident and severe COVID-19 BTI using Cox regression after matching. Results: Among patients without immune dysfunction, the relative effectiveness of booster (3 doses) after 6 months from the primary (2 doses) vaccination against BTI ranged from 69% to 81% during the Delta-predominant period and from 33% to 39% during the Omicron-predominant period. Relative effectiveness against BTI was lower among patients with immune dysfunction but remained statistically significant in both periods. Boosted patients had lower risk of COVID-19-related hospitalization (hazard ratios [HR] ranged from 0.5 [95% confidence interval {CI}, .48-.53] to 0.63 [95% CI, .56-.70]), invasive cardiopulmonary support, or death (HRs ranged from 0.46 [95% CI, .41-.52] to 0.63 [95% CI, .50-.79]) during both periods. Conclusions: Booster vaccines remain effective against severe COVID-19 BTI throughout the Delta- and Omicron-predominant periods, regardless of patients' immune status.

9.

An evaluation of GPT models for phenotype concept recognition.

Groza, Tudor; Caufield, Harry; Gration, Dylan; Baynam, Gareth; Haendel, Melissa A; Robinson, Peter N; Mungall, Christopher J; Reese, Justin T.

BMC Med Inform Decis Mak ; 24(1): 30, 2024 Jan 31.

Artículo en Inglés | MEDLINE | ID: mdl-38297371

RESUMEN

OBJECTIVE: Clinical deep phenotyping and phenotype annotation play a critical role in both the diagnosis of patients with rare disorders as well as in building computationally-tractable knowledge in the rare disorders field. These processes rely on using ontology concepts, often from the Human Phenotype Ontology, in conjunction with a phenotype concept recognition task (supported usually by machine learning methods) to curate patient profiles or existing scientific literature. With the significant shift in the use of large language models (LLMs) for most NLP tasks, we examine the performance of the latest Generative Pre-trained Transformer (GPT) models underpinning ChatGPT as a foundation for the tasks of clinical phenotyping and phenotype annotation. MATERIALS AND METHODS: The experimental setup of the study included seven prompts of various levels of specificity, two GPT models (gpt-3.5-turbo and gpt-4.0) and two established gold standard corpora for phenotype recognition, one consisting of publication abstracts and the other clinical observations. RESULTS: The best run, using in-context learning, achieved 0.58 document-level F1 score on publication abstracts and 0.75 document-level F1 score on clinical observations, as well as a mention-level F1 score of 0.7, which surpasses the current best in class tool. Without in-context learning, however, performance is significantly below the existing approaches. CONCLUSION: Our experiments show that gpt-4.0 surpasses the state of the art performance if the task is constrained to a subset of the target ontology where there is prior knowledge of the terms that are expected to be matched. While the results are promising, the non-deterministic nature of the outcomes, the high cost and the lack of concordance between different runs using the same prompt and input make the use of these LLMs challenging for this particular task.

Asunto(s)

Conocimiento , Lenguaje , Humanos , Aprendizaje Automático , Fenotipo , Enfermedades Raras

10.

The Monarch Initiative in 2024: an analytic platform integrating phenotypes, genes and diseases across species.

Putman, Tim E; Schaper, Kevin; Matentzoglu, Nicolas; Rubinetti, Vincent P; Alquaddoomi, Faisal S; Cox, Corey; Caufield, J Harry; Elsarboukh, Glass; Gehrke, Sarah; Hegde, Harshad; Reese, Justin T; Braun, Ian; Bruskiewich, Richard M; Cappelletti, Luca; Carbon, Seth; Caron, Anita R; Chan, Lauren E; Chute, Christopher G; Cortes, Katherina G; De Souza, Vinícius; Fontana, Tommaso; Harris, Nomi L; Hartley, Emily L; Hurwitz, Eric; Jacobsen, Julius O B; Krishnamurthy, Madan; Laraway, Bryan J; McLaughlin, James A; McMurry, Julie A; Moxon, Sierra A T; Mullen, Kathleen R; O'Neil, Shawn T; Shefchek, Kent A; Stefancsik, Ray; Toro, Sabrina; Vasilevsky, Nicole A; Walls, Ramona L; Whetzel, Patricia L; Osumi-Sutherland, David; Smedley, Damian; Robinson, Peter N; Mungall, Christopher J; Haendel, Melissa A; Munoz-Torres, Monica C.

Nucleic Acids Res ; 52(D1): D938-D949, 2024 Jan 05.

Artículo en Inglés | MEDLINE | ID: mdl-38000386

RESUMEN

Bridging the gap between genetic variations, environmental determinants, and phenotypic outcomes is critical for supporting clinical diagnosis and understanding mechanisms of diseases. It requires integrating open data at a global scale. The Monarch Initiative advances these goals by developing open ontologies, semantic data models, and knowledge graphs for translational research. The Monarch App is an integrated platform combining data about genes, phenotypes, and diseases across species. Monarch's APIs enable access to carefully curated datasets and advanced analysis tools that support the understanding and diagnosis of disease for diverse applications such as variant prioritization, deep phenotyping, and patient profile-matching. We have migrated our system into a scalable, cloud-based infrastructure; simplified Monarch's data ingestion and knowledge graph integration systems; enhanced data mapping and integration standards; and developed a new user interface with novel search and graph navigation features. Furthermore, we advanced Monarch's analytic tools by developing a customized plugin for OpenAI's ChatGPT to increase the reliability of its responses about phenotypic data, allowing us to interrogate the knowledge in the Monarch graph using state-of-the-art Large Language Models. The resources of the Monarch Initiative can be found at monarchinitiative.org and its corresponding code repository at github.com/monarch-initiative/monarch-app.

Asunto(s)

Bases de Datos Factuales , Enfermedad , Genes , Fenotipo , Humanos , Internet , Bases de Datos Factuales/normas , Programas Informáticos , Genes/genética , Enfermedad/genética

11.

The Medical Action Ontology: A tool for annotating and analyzing treatments and clinical management of human disease.

Carmody, Leigh C; Gargano, Michael A; Toro, Sabrina; Vasilevsky, Nicole A; Adam, Margaret P; Blau, Hannah; Chan, Lauren E; Gomez-Andres, David; Horvath, Rita; Kraus, Megan L; Ladewig, Markus S; Lewis-Smith, David; Lochmüller, Hanns; Matentzoglu, Nicolas A; Munoz-Torres, Monica C; Schuetz, Catharina; Seitz, Berthold; Similuk, Morgan N; Sparks, Teresa N; Strauss, Timmy; Swietlik, Emilia M; Thompson, Rachel; Zhang, Xingmin Aaron; Mungall, Christopher J; Haendel, Melissa A; Robinson, Peter N.

Med ; 4(12): 913-927.e3, 2023 Dec 08.

Artículo en Inglés | MEDLINE | ID: mdl-37963467

RESUMEN

BACKGROUND: Navigating the clinical literature to determine the optimal clinical management for rare diseases presents significant challenges. We introduce the Medical Action Ontology (MAxO), an ontology specifically designed to organize medical procedures, therapies, and interventions. METHODS: MAxO incorporates logical structures that link MAxO terms to numerous other ontologies within the OBO Foundry. Term development involves a blend of manual and semi-automated processes. Additionally, we have generated annotations detailing diagnostic modalities for specific phenotypic abnormalities defined by the Human Phenotype Ontology (HPO). We introduce a web application, POET, that facilitates MAxO annotations for specific medical actions for diseases using the Mondo Disease Ontology. FINDINGS: MAxO encompasses 1,757 terms spanning a wide range of biomedical domains, from human anatomy and investigations to the chemical and protein entities involved in biological processes. These terms annotate phenotypic features associated with specific disease (using HPO and Mondo). Presently, there are over 16,000 MAxO diagnostic annotations that target HPO terms. Through POET, we have created 413 MAxO annotations specifying treatments for 189 rare diseases. CONCLUSIONS: MAxO offers a computational representation of treatments and other actions taken for the clinical management of patients. Its development is closely coupled to Mondo and HPO, broadening the scope of our computational modeling of diseases and phenotypic features. We invite the community to contribute disease annotations using POET (https://poet.jax.org/). MAxO is available under the open-source CC-BY 4.0 license (https://github.com/monarch-initiative/MAxO). FUNDING: NHGRI 1U24HG011449-01A1 and NHGRI 5RM1HG010860-04.

Asunto(s)

Ontologías Biológicas , Humanos , Enfermedades Raras , Programas Informáticos , Simulación por Computador

12.

Harnessing consumer wearable digital biomarkers for individualized recognition of postpartum depression using the All of Us Research Program dataset.

Hurwitz, Eric; Butzin-Dozier, Zachary; Master, Hiral; O'Neil, Shawn T; Walden, Anita; Holko, Michelle; Patel, Rena C; Haendel, Melissa A.

medRxiv ; 2023 Oct 14.

Artículo en Inglés | MEDLINE | ID: mdl-37873471

RESUMEN

Postpartum depression (PPD), afflicting one in seven women, poses a major challenge in maternal health. Existing approaches to detect PPD heavily depend on in-person postpartum visits, leading to cases of the condition being overlooked and untreated. We explored the potential of consumer wearable-derived digital biomarkers for PPD recognition to address this gap. Our study demonstrated that intra-individual machine learning (ML) models developed using these digital biomarkers can discern between pre-pregnancy, pregnancy, postpartum without depression, and postpartum with depression time periods (i.e., PPD diagnosis). When evaluating variable importance, calories burned from the basal metabolic rate (calories BMR) emerged as the digital biomarker most predictive of PPD. To confirm the specificity of our method, we demonstrated that models developed in women without PPD could not accurately classify the PPD-equivalent phase. Prior depression history did not alter model efficacy for PPD recognition. Furthermore, the individualized models demonstrated superior performance compared to a conventional cohort-based model for the detection of PPD, underscoring the effectiveness of our individualized ML approach. This work establishes consumer wearables as a promising avenue for PPD identification. More importantly, it also emphasizes the utility of individualized ML model methodology, potentially transforming early disease detection strategies.

13.

Risk factors associated with post-acute sequelae of SARS-CoV-2: an N3C and NIH RECOVER study.

Hill, Elaine L; Mehta, Hemalkumar B; Sharma, Suchetha; Mane, Klint; Singh, Sharad Kumar; Xie, Catherine; Cathey, Emily; Loomba, Johanna; Russell, Seth; Spratt, Heidi; DeWitt, Peter E; Ammar, Nariman; Madlock-Brown, Charisse; Brown, Donald; McMurry, Julie A; Chute, Christopher G; Haendel, Melissa A; Moffitt, Richard; Pfaff, Emily R; Bennett, Tellen D.

BMC Public Health ; 23(1): 2103, 2023 10 25.

Artículo en Inglés | MEDLINE | ID: mdl-37880596

RESUMEN

BACKGROUND: More than one-third of individuals experience post-acute sequelae of SARS-CoV-2 infection (PASC, which includes long-COVID). The objective is to identify risk factors associated with PASC/long-COVID diagnosis. METHODS: This was a retrospective case-control study including 31 health systems in the United States from the National COVID Cohort Collaborative (N3C). 8,325 individuals with PASC (defined by the presence of the International Classification of Diseases, version 10 code U09.9 or a long-COVID clinic visit) matched to 41,625 controls within the same health system and COVID index date within ± 45 days of the corresponding case's earliest COVID index date. Measurements of risk factors included demographics, comorbidities, treatment and acute characteristics related to COVID-19. Multivariable logistic regression, random forest, and XGBoost were used to determine the associations between risk factors and PASC. RESULTS: Among 8,325 individuals with PASC, the majority were > 50 years of age (56.6%), female (62.8%), and non-Hispanic White (68.6%). In logistic regression, middle-age categories (40 to 69 years; OR ranging from 2.32 to 2.58), female sex (OR 1.4, 95% CI 1.33-1.48), hospitalization associated with COVID-19 (OR 3.8, 95% CI 3.05-4.73), long (8-30 days, OR 1.69, 95% CI 1.31-2.17) or extended hospital stay (30 + days, OR 3.38, 95% CI 2.45-4.67), receipt of mechanical ventilation (OR 1.44, 95% CI 1.18-1.74), and several comorbidities including depression (OR 1.50, 95% CI 1.40-1.60), chronic lung disease (OR 1.63, 95% CI 1.53-1.74), and obesity (OR 1.23, 95% CI 1.16-1.3) were associated with increased likelihood of PASC diagnosis or care at a long-COVID clinic. Characteristics associated with a lower likelihood of PASC diagnosis or care at a long-COVID clinic included younger age (18 to 29 years), male sex, non-Hispanic Black race, and comorbidities such as substance abuse, cardiomyopathy, psychosis, and dementia. More doctors per capita in the county of residence was associated with an increased likelihood of PASC diagnosis or care at a long-COVID clinic. Our findings were consistent in sensitivity analyses using a variety of analytic techniques and approaches to select controls. CONCLUSIONS: This national study identified important risk factors for PASC diagnosis such as middle age, severe COVID-19 disease, and specific comorbidities. Further clinical and epidemiological research is needed to better understand underlying mechanisms and the potential role of vaccines and therapeutics in altering PASC course.

Asunto(s)

COVID-19 , SARS-CoV-2 , Persona de Mediana Edad , Femenino , Masculino , Humanos , Adulto , Anciano , Adolescente , Adulto Joven , COVID-19/epidemiología , Síndrome Post Agudo de COVID-19 , Estudios de Casos y Controles , Estudios Retrospectivos , Factores de Riesgo , Progresión de la Enfermedad

14.

Outcomes of SARS-CoV-2 infection among patients with orthopaedic fracture surgery in the National COVID Cohort Collaborative (N3C).

Levitt, Eli B; Patch, David A; Hess, Matthew C; Terrero, Alfredo; Jaeger, Byron; Haendel, Melissa A; Chute, Christopher G; Yeager, Matthew T; Ponce, Brent A; Theiss, Steven M; Spitler, Clay A; Johnson, Joey P.

Injury ; 54(12): 111092, 2023 Dec.

Artículo en Inglés | MEDLINE | ID: mdl-37871347

RESUMEN

BACKGROUND: The objective of this study was to investigate the outcomes of COVID-19-positive patients undergoing orthopaedic fracture surgery using data from a national database of U.S. adults with a COVID-19 test for SARS-CoV-2. METHODS: This is a retrospective cohort study using data from a national database to compare orthopaedic fracture surgery outcomes between COVID-19-positive and COVID-19-negative patients in the United States. Participants aged 18-99 with orthopaedic fracture surgery between March and December 2020 were included. The main exposure was COVID-19 status. Outcomes included perioperative complications, 30-day all-cause mortality, and overall all-cause mortality. Multivariable adjusted models were fitted to determine the association of COVID-positivity with all-cause mortality. RESULTS: The total population of 6.5 million patient records was queried, identifying 76,697 participants with a fracture. There were 7,628 participants in the National COVID Cohort who had a fracture and operative management. The Charlson Comorbidity Index was higher in the COVID-19-positive group (n = 476, 6.2 %) than the COVID-19-negative group (n = 7,152, 93.8 %) (2.2 vs 1.4, p<0.001). The COVID-19-positive group had higher mortality (13.2 % vs 5.2 %, p<0.001) than the COVID-19-negative group with higher odds of death in the fully adjusted model (Odds Ratio=1.59; 95 % Confidence Interval: 1.16-2.18). CONCLUSION: COVID-19-positive participants with a fracture requiring surgery had higher mortality and perioperative complications than COVID-19-negative patients in this national cohort of U.S. adults tested for COVID-19. The risks associated with COVID-19 can guide potential treatment options and counseling of patients and their families. Future studies can be conducted as data accumulates. LEVEL OF EVIDENCE: Level III.

Asunto(s)

COVID-19 , Fracturas de Cadera , Ortopedia , Adulto , Humanos , Estados Unidos/epidemiología , COVID-19/complicaciones , COVID-19/epidemiología , SARS-CoV-2 , Estudios Retrospectivos , Fracturas de Cadera/cirugía

15.

Who is pregnant? Defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C).

Jones, Sara E; Bradwell, Katie R; Chan, Lauren E; McMurry, Julie A; Olson-Chen, Courtney; Tarleton, Jessica; Wilkins, Kenneth J; Ly, Victoria; Ljazouli, Saad; Qin, Qiuyuan; Faherty, Emily Groene; Lau, Yan Kwan; Xie, Catherine; Kao, Yu-Han; Liebman, Michael N; Mariona, Federico; Challa, Anup P; Li, Li; Ratcliffe, Sarah J; Haendel, Melissa A; Patel, Rena C; Hill, Elaine L.

JAMIA Open ; 6(3): ooad067, 2023 Oct.

Artículo en Inglés | MEDLINE | ID: mdl-37600074

RESUMEN

Objectives: To define pregnancy episodes and estimate gestational age within electronic health record (EHR) data from the National COVID Cohort Collaborative (N3C). Materials and Methods: We developed a comprehensive approach, named Hierarchy and rule-based pregnancy episode Inference integrated with Pregnancy Progression Signatures (HIPPS), and applied it to EHR data in the N3C (January 1, 2018-April 7, 2022). HIPPS combines: (1) an extension of a previously published pregnancy episode algorithm, (2) a novel algorithm to detect gestational age-specific signatures of a progressing pregnancy for further episode support, and (3) pregnancy start date inference. Clinicians performed validation of HIPPS on a subset of episodes. We then generated pregnancy cohorts based on gestational age precision and pregnancy outcomes for assessment of accuracy and comparison of COVID-19 and other characteristics. Results: We identified 628â165 pregnant persons with 816â471 pregnancy episodes, of which 52.3% were live births, 24.4% were other outcomes (stillbirth, ectopic pregnancy, abortions), and 23.3% had unknown outcomes. Clinician validation agreed 98.8% with HIPPS-identified episodes. We were able to estimate start dates within 1 week of precision for 475â433 (58.2%) episodes. 62â540 (7.7%) episodes had incident COVID-19 during pregnancy. Discussion: HIPPS provides measures of support for pregnancy-related variables such as gestational age and pregnancy outcomes based on N3C data. Gestational age precision allows researchers to find time to events with reasonable confidence. Conclusion: We have developed a novel and robust approach for inferring pregnancy episodes and gestational age that addresses data inconsistency and missingness in EHR data.

16.

An open natural language processing (NLP) framework for EHR-based clinical research: a case demonstration using the National COVID Cohort Collaborative (N3C).

Liu, Sijia; Wen, Andrew; Wang, Liwei; He, Huan; Fu, Sunyang; Miller, Robert; Williams, Andrew; Harris, Daniel; Kavuluru, Ramakanth; Liu, Mei; Abu-El-Rub, Noor; Schutte, Dalton; Zhang, Rui; Rouhizadeh, Masoud; Osborne, John D; He, Yongqun; Topaloglu, Umit; Hong, Stephanie S; Saltz, Joel H; Schaffter, Thomas; Pfaff, Emily; Chute, Christopher G; Duong, Tim; Haendel, Melissa A; Fuentes, Rafael; Szolovits, Peter; Xu, Hua; Liu, Hongfang.

J Am Med Inform Assoc ; 30(12): 2036-2040, 2023 11 17.

Artículo en Inglés | MEDLINE | ID: mdl-37555837

RESUMEN

Despite recent methodology advancements in clinical natural language processing (NLP), the adoption of clinical NLP models within the translational research community remains hindered by process heterogeneity and human factor variations. Concurrently, these factors also dramatically increase the difficulty in developing NLP models in multi-site settings, which is necessary for algorithm robustness and generalizability. Here, we reported on our experience developing an NLP solution for Coronavirus Disease 2019 (COVID-19) signs and symptom extraction in an open NLP framework from a subset of sites participating in the National COVID Cohort (N3C). We then empirically highlight the benefits of multi-site data for both symbolic and statistical methods, as well as highlight the need for federated annotation and evaluation to resolve several pitfalls encountered in the course of these efforts.

Asunto(s)

COVID-19 , Procesamiento de Lenguaje Natural , Humanos , Registros Electrónicos de Salud , Algoritmos

17.

Predicting nutrition and environmental factors associated with female reproductive disorders using a knowledge graph and random forests.

Chan, Lauren E; Casiraghi, Elena; Putman, Tim; Reese, Justin; Harmon, Quaker E; Schaper, Kevin; Hedge, Harshad; Valentini, Giorgio; Schmitt, Charles; Motsinger-Reif, Alison; Hall, Janet E; Mungall, Christopher J; Robinson, Peter N; Haendel, Melissa A.

medRxiv ; 2023 Jul 16.

Artículo en Inglés | MEDLINE | ID: mdl-37502882

RESUMEN

Objective: Female reproductive disorders (FRDs) are common health conditions that may present with significant symptoms. Diet and environment are potential areas for FRD interventions. We utilized a knowledge graph (KG) method to predict factors associated with common FRDs (e.g., endometriosis, ovarian cyst, and uterine fibroids). Materials and Methods: We harmonized survey data from the Personalized Environment and Genes Study on internal and external environmental exposures and health conditions with biomedical ontology content. We merged the harmonized data and ontologies with supplemental nutrient and agricultural chemical data to create a KG. We analyzed the KG by embedding edges and applying a random forest for edge prediction to identify variables potentially associated with FRDs. We also conducted logistic regression analysis for comparison. Results: Across 9765 PEGS respondents, the KG analysis resulted in 8535 significant predicted links between FRDs and chemicals, phenotypes, and diseases. Amongst these links, 32 were exact matches when compared with the logistic regression results, including comorbidities, medications, foods, and occupational exposures. Discussion: Mechanistic underpinnings of predicted links documented in the literature may support some of our findings. Our KG methods are useful for predicting possible associations in large, survey-based datasets with added information on directionality and magnitude of effect from logistic regression. These results should not be construed as causal, but can support hypothesis generation. Conclusion: This investigation enabled the generation of hypotheses on a variety of potential links between FRDs and exposures. Future investigations should prospectively evaluate the variables hypothesized to impact FRDs.

18.

The Medical Action Ontology: A Tool for Annotating and Analyzing Treatments and Clinical Management of Human Disease.

Carmody, Leigh C; Gargano, Michael A; Toro, Sabrina; Vasilevsky, Nicole A; Adam, Margaret P; Blau, Hannah; Chan, Lauren E; Gomez-Andres, David; Horvath, Rita; Kraus, Megan L; Ladewig, Markus S; Lewis-Smith, David; Lochmüller, Hanns; Matentzoglu, Nicolas A; Munoz-Torres, Monica C; Schuetz, Catharina; Seitz, Berthold; Similuk, Morgan N; Sparks, Teresa N; Strauss, Timmy; Swietlik, Emilia M; Thompson, Rachel; Zhang, Xingmin Aaron; Mungall, Christopher J; Haendel, Melissa A; Robinson, Peter N.

medRxiv ; 2023 Jul 13.

Artículo en Inglés | MEDLINE | ID: mdl-37503136

RESUMEN

Navigating the vast landscape of clinical literature to find optimal treatments and management strategies can be a challenging task, especially for rare diseases. To address this task, we introduce the Medical Action Ontology (MAxO), the first ontology specifically designed to organize medical procedures, therapies, and interventions in a structured way. Currently, MAxO contains 1757 medical action terms added through a combination of manual and semi-automated processes. MAxO was developed with logical structures that make it compatible with several other ontologies within the Open Biological and Biomedical Ontologies (OBO) Foundry. These cover a wide range of biomedical domains, from human anatomy and investigations to the chemical and protein entities involved in biological processes. We have created a database of over 16000 annotations that describe diagnostic modalities for specific phenotypic abnormalities as defined by the Human Phenotype Ontology (HPO). Additionally, 413 annotations are provided for medical actions for 189 rare diseases. We have developed a web application called POET (https://poet.jax.org/) for the community to use to contribute MAxO annotations. MAxO provides a computational representation of treatments and other actions taken for the clinical management of patients. The development of MAxO is closely coupled to the Mondo Disease Ontology (Mondo) and the Human Phenotype Ontology (HPO) and expands the scope of our computational modeling of diseases and phenotypic features to include diagnostics and therapeutic actions. MAxO is available under the open-source CC-BY 4.0 license (https://github.com/monarch-initiative/MAxO).

19.

KG-Hub-building and exchanging biological knowledge graphs.

Caufield, J Harry; Putman, Tim; Schaper, Kevin; Unni, Deepak R; Hegde, Harshad; Callahan, Tiffany J; Cappelletti, Luca; Moxon, Sierra A T; Ravanmehr, Vida; Carbon, Seth; Chan, Lauren E; Cortes, Katherina; Shefchek, Kent A; Elsarboukh, Glass; Balhoff, Jim; Fontana, Tommaso; Matentzoglu, Nicolas; Bruskiewich, Richard M; Thessen, Anne E; Harris, Nomi L; Munoz-Torres, Monica C; Haendel, Melissa A; Robinson, Peter N; Joachimiak, Marcin P; Mungall, Christopher J; Reese, Justin T.

Bioinformatics ; 39(7)2023 07 01.

Artículo en Inglés | MEDLINE | ID: mdl-37389415

RESUMEN

MOTIVATION: Knowledge graphs (KGs) are a powerful approach for integrating heterogeneous data and making inferences in biology and many other domains, but a coherent solution for constructing, exchanging, and facilitating the downstream use of KGs is lacking. RESULTS: Here we present KG-Hub, a platform that enables standardized construction, exchange, and reuse of KGs. Features include a simple, modular extract-transform-load pattern for producing graphs compliant with Biolink Model (a high-level data model for standardizing biological data), easy integration of any OBO (Open Biological and Biomedical Ontologies) ontology, cached downloads of upstream data sources, versioned and automatically updated builds with stable URLs, web-browsable storage of KG artifacts on cloud infrastructure, and easy reuse of transformed subgraphs across projects. Current KG-Hub projects span use cases including COVID-19 research, drug repurposing, microbial-environmental interactions, and rare disease research. KG-Hub is equipped with tooling to easily analyze and manipulate KGs. KG-Hub is also tightly integrated with graph machine learning (ML) tools which allow automated graph ML, including node embeddings and training of models for link prediction and node classification. AVAILABILITY AND IMPLEMENTATION: https://kghub.org.

Asunto(s)

Ontologías Biológicas , COVID-19 , Humanos , Reconocimiento de Normas Patrones Automatizadas , Enfermedades Raras , Aprendizaje Automático

20.

Ontologizing health systems data at scale: making translational discovery a reality.

Callahan, Tiffany J; Stefanski, Adrianne L; Wyrwa, Jordan M; Zeng, Chenjie; Ostropolets, Anna; Banda, Juan M; Baumgartner, William A; Boyce, Richard D; Casiraghi, Elena; Coleman, Ben D; Collins, Janine H; Deakyne Davies, Sara J; Feinstein, James A; Lin, Asiyah Y; Martin, Blake; Matentzoglu, Nicolas A; Meeker, Daniella; Reese, Justin; Sinclair, Jessica; Taneja, Sanya B; Trinkley, Katy E; Vasilevsky, Nicole A; Williams, Andrew E; Zhang, Xingmin A; Denny, Joshua C; Ryan, Patrick B; Hripcsak, George; Bennett, Tellen D; Haendel, Melissa A; Robinson, Peter N; Hunter, Lawrence E; Kahn, Michael G.

NPJ Digit Med ; 6(1): 89, 2023 May 19.

Artículo en Inglés | MEDLINE | ID: mdl-37208468

RESUMEN

Common data models solve many challenges of standardizing electronic health record (EHR) data but are unable to semantically integrate all of the resources needed for deep phenotyping. Open Biological and Biomedical Ontology (OBO) Foundry ontologies provide computable representations of biological knowledge and enable the integration of heterogeneous data. However, mapping EHR data to OBO ontologies requires significant manual curation and domain expertise. We introduce OMOP2OBO, an algorithm for mapping Observational Medical Outcomes Partnership (OMOP) vocabularies to OBO ontologies. Using OMOP2OBO, we produced mappings for 92,367 conditions, 8611 drug ingredients, and 10,673 measurement results, which covered 68-99% of concepts used in clinical practice when examined across 24 hospitals. When used to phenotype rare disease patients, the mappings helped systematically identify undiagnosed patients who might benefit from genetic testing. By aligning OMOP vocabularies to OBO ontologies our algorithm presents new opportunities to advance EHR-based deep phenotyping.

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA