Search | Nursing VHL Search Portal

1.

The Monarch Initiative in 2024: an analytic platform integrating phenotypes, genes and diseases across species.

Putman, Tim E; Schaper, Kevin; Matentzoglu, Nicolas; Rubinetti, Vincent P; Alquaddoomi, Faisal S; Cox, Corey; Caufield, J Harry; Elsarboukh, Glass; Gehrke, Sarah; Hegde, Harshad; Reese, Justin T; Braun, Ian; Bruskiewich, Richard M; Cappelletti, Luca; Carbon, Seth; Caron, Anita R; Chan, Lauren E; Chute, Christopher G; Cortes, Katherina G; De Souza, Vinícius; Fontana, Tommaso; Harris, Nomi L; Hartley, Emily L; Hurwitz, Eric; Jacobsen, Julius O B; Krishnamurthy, Madan; Laraway, Bryan J; McLaughlin, James A; McMurry, Julie A; Moxon, Sierra A T; Mullen, Kathleen R; O'Neil, Shawn T; Shefchek, Kent A; Stefancsik, Ray; Toro, Sabrina; Vasilevsky, Nicole A; Walls, Ramona L; Whetzel, Patricia L; Osumi-Sutherland, David; Smedley, Damian; Robinson, Peter N; Mungall, Christopher J; Haendel, Melissa A; Munoz-Torres, Monica C.

Nucleic Acids Res ; 52(D1): D938-D949, 2024 Jan 05.

Article in English | MEDLINE | ID: mdl-38000386

ABSTRACT

Bridging the gap between genetic variations, environmental determinants, and phenotypic outcomes is critical for supporting clinical diagnosis and understanding mechanisms of diseases. It requires integrating open data at a global scale. The Monarch Initiative advances these goals by developing open ontologies, semantic data models, and knowledge graphs for translational research. The Monarch App is an integrated platform combining data about genes, phenotypes, and diseases across species. Monarch's APIs enable access to carefully curated datasets and advanced analysis tools that support the understanding and diagnosis of disease for diverse applications such as variant prioritization, deep phenotyping, and patient profile-matching. We have migrated our system into a scalable, cloud-based infrastructure; simplified Monarch's data ingestion and knowledge graph integration systems; enhanced data mapping and integration standards; and developed a new user interface with novel search and graph navigation features. Furthermore, we advanced Monarch's analytic tools by developing a customized plugin for OpenAI's ChatGPT to increase the reliability of its responses about phenotypic data, allowing us to interrogate the knowledge in the Monarch graph using state-of-the-art Large Language Models. The resources of the Monarch Initiative can be found at monarchinitiative.org and its corresponding code repository at github.com/monarch-initiative/monarch-app.

Subject(s)

Databases, Factual , Disease , Genes , Phenotype , Humans , Internet , Databases, Factual/standards , Software , Genes/genetics , Disease/genetics

2.

The Human Phenotype Ontology in 2024: phenotypes around the world.

Gargano, Michael A; Matentzoglu, Nicolas; Coleman, Ben; Addo-Lartey, Eunice B; Anagnostopoulos, Anna V; Anderton, Joel; Avillach, Paul; Bagley, Anita M; Bakstein, Eduard; Balhoff, James P; Baynam, Gareth; Bello, Susan M; Berk, Michael; Bertram, Holli; Bishop, Somer; Blau, Hannah; Bodenstein, David F; Botas, Pablo; Boztug, Kaan; Cady, Jolana; Callahan, Tiffany J; Cameron, Rhiannon; Carbon, Seth J; Castellanos, Francisco; Caufield, J Harry; Chan, Lauren E; Chute, Christopher G; Cruz-Rojo, Jaime; Dahan-Oliel, Noémi; Davids, Jon R; de Dieuleveult, Maud; de Souza, Vinicius; de Vries, Bert B A; de Vries, Esther; DePaulo, J Raymond; Derfalvi, Beata; Dhombres, Ferdinand; Diaz-Byrd, Claudia; Dingemans, Alexander J M; Donadille, Bruno; Duyzend, Michael; Elfeky, Reem; Essaid, Shahim; Fabrizzi, Carolina; Fico, Giovanna; Firth, Helen V; Freudenberg-Hua, Yun; Fullerton, Janice M; Gabriel, Davera L; Gilmour, Kimberly.

Nucleic Acids Res ; 52(D1): D1333-D1346, 2024 Jan 05.

Article in English | MEDLINE | ID: mdl-37953324

ABSTRACT

The Human Phenotype Ontology (HPO) is a widely used resource that comprehensively organizes and defines the phenotypic features of human disease, enabling computational inference and supporting genomic and phenotypic analyses through semantic similarity and machine learning algorithms. The HPO has widespread applications in clinical diagnostics and translational research, including genomic diagnostics, gene-disease discovery, and cohort analytics. In recent years, groups around the world have developed translations of the HPO from English to other languages, and the HPO browser has been internationalized, allowing users to view HPO term labels and in many cases synonyms and definitions in ten languages in addition to English. Since our last report, a total of 2239 new HPO terms and 49235 new HPO annotations were developed, many in collaboration with external groups in the fields of psychiatry, arthrogryposis, immunology and cardiology. The Medical Action Ontology (MAxO) is a new effort to model treatments and other measures taken for clinical management. Finally, the HPO consortium is contributing to efforts to integrate the HPO and the GA4GH Phenopacket Schema into electronic health records (EHRs) with the goal of more standardized and computable integration of rare disease data in EHRs.

Subject(s)

Biological Ontologies , Humans , Phenotype , Genomics , Algorithms , Rare Diseases

3.

Predictive Utility of Polygenic Risk Scores for Coronary Heart Disease in Three Major Racial and Ethnic Groups.

Dikilitas, Ozan; Schaid, Daniel J; Kosel, Matthew L; Carroll, Robert J; Chute, Christopher G; Denny, Joshua A; Fedotov, Alex; Feng, QiPing; Hakonarson, Hakon; Jarvik, Gail P; Lee, Ming Ta Michael; Pacheco, Jennifer A; Rowley, Robb; Sleiman, Patrick M; Stein, C Michael; Sturm, Amy C; Wei, Wei-Qi; Wiesner, Georgia L; Williams, Marc S; Zhang, Yanfei; Manolio, Teri A; Kullo, Iftikhar J.

Am J Hum Genet ; 106(5): 707-716, 2020 05 07.

Article in English | MEDLINE | ID: mdl-32386537

ABSTRACT

Because polygenic risk scores (PRSs) for coronary heart disease (CHD) are derived from mainly European ancestry (EA) cohorts, their validity in African ancestry (AA) and Hispanic ethnicity (HE) individuals is unclear. We investigated associations of "restricted" and genome-wide PRSs with CHD in three major racial and ethnic groups in the U.S. The eMERGE cohort (mean age 48 ± 14 years, 58% female) included 45,645 EA, 7,597 AA, and 2,493 HE individuals. We assessed two restricted PRSs (PRSTikkanen and PRSTada; 28 and 50 variants, respectively) and two genome-wide PRSs (PRSmetaGRS and PRSLDPred; 1.7 M and 6.6 M variants, respectively) derived from EA cohorts. Over a median follow-up of 11.1 years, 2,652 incident CHD events occurred. Hazard and odds ratios for the association of PRSs with CHD were similar in EA and HE cohorts but lower in AA cohorts. Genome-wide PRSs were more strongly associated with CHD than restricted PRSs were. PRSmetaGRS, the best performing PRS, was associated with CHD in all three cohorts; hazard ratios (95% CI) per 1 SD increase were 1.53 (1.46-1.60), 1.53 (1.23-1.90), and 1.27 (1.13-1.43) for incident CHD in EA, HE, and AA individuals, respectively. The hazard ratios were comparable in the EA and HE cohorts (pinteraction = 0.77) but were significantly attenuated in AA individuals (pinteraction= 2.9 × 10-3). These results highlight the potential clinical utility of PRSs for CHD as well as the need to assemble diverse cohorts to generate ancestry- and ethnicity PRSs.

Subject(s)

Black or African American/genetics , Coronary Disease/genetics , Genetic Predisposition to Disease , Hispanic or Latino/genetics , Multifactorial Inheritance/genetics , White People/genetics , Cohort Studies , Female , Humans , Male , Middle Aged , Odds Ratio

4.

Coding long COVID: characterizing a new disease through an ICD-10 lens.

Pfaff, Emily R; Madlock-Brown, Charisse; Baratta, John M; Bhatia, Abhishek; Davis, Hannah; Girvin, Andrew; Hill, Elaine; Kelly, Elizabeth; Kostka, Kristin; Loomba, Johanna; McMurry, Julie A; Wong, Rachel; Bennett, Tellen D; Moffitt, Richard; Chute, Christopher G; Haendel, Melissa.

BMC Med ; 21(1): 58, 2023 02 16.

Article in English | MEDLINE | ID: mdl-36793086

ABSTRACT

BACKGROUND: Naming a newly discovered disease is a difficult process; in the context of the COVID-19 pandemic and the existence of post-acute sequelae of SARS-CoV-2 infection (PASC), which includes long COVID, it has proven especially challenging. Disease definitions and assignment of a diagnosis code are often asynchronous and iterative. The clinical definition and our understanding of the underlying mechanisms of long COVID are still in flux, and the deployment of an ICD-10-CM code for long COVID in the USA took nearly 2 years after patients had begun to describe their condition. Here, we leverage the largest publicly available HIPAA-limited dataset about patients with COVID-19 in the US to examine the heterogeneity of adoption and use of U09.9, the ICD-10-CM code for "Post COVID-19 condition, unspecified." METHODS: We undertook a number of analyses to characterize the N3C population with a U09.9 diagnosis code (n = 33,782), including assessing person-level demographics and a number of area-level social determinants of health; diagnoses commonly co-occurring with U09.9, clustered using the Louvain algorithm; and quantifying medications and procedures recorded within 60 days of U09.9 diagnosis. We stratified all analyses by age group in order to discern differing patterns of care across the lifespan. RESULTS: We established the diagnoses most commonly co-occurring with U09.9 and algorithmically clustered them into four major categories: cardiopulmonary, neurological, gastrointestinal, and comorbid conditions. Importantly, we discovered that the population of patients diagnosed with U09.9 is demographically skewed toward female, White, non-Hispanic individuals, as well as individuals living in areas with low poverty and low unemployment. Our results also include a characterization of common procedures and medications associated with U09.9-coded patients. CONCLUSIONS: This work offers insight into potential subtypes and current practice patterns around long COVID and speaks to the existence of disparities in the diagnosis of patients with long COVID. This latter finding in particular requires further research and urgent remediation.

Subject(s)

COVID-19 , Post-Acute COVID-19 Syndrome , Humans , Female , International Classification of Diseases , Pandemics , COVID-19/diagnosis , COVID-19/epidemiology , SARS-CoV-2

5.

The Human Phenotype Ontology in 2021.

Köhler, Sebastian; Gargano, Michael; Matentzoglu, Nicolas; Carmody, Leigh C; Lewis-Smith, David; Vasilevsky, Nicole A; Danis, Daniel; Balagura, Ganna; Baynam, Gareth; Brower, Amy M; Callahan, Tiffany J; Chute, Christopher G; Est, Johanna L; Galer, Peter D; Ganesan, Shiva; Griese, Matthias; Haimel, Matthias; Pazmandi, Julia; Hanauer, Marc; Harris, Nomi L; Hartnett, Michael J; Hastreiter, Maximilian; Hauck, Fabian; He, Yongqun; Jeske, Tim; Kearney, Hugh; Kindle, Gerhard; Klein, Christoph; Knoflach, Katrin; Krause, Roland; Lagorce, David; McMurry, Julie A; Miller, Jillian A; Munoz-Torres, Monica C; Peters, Rebecca L; Rapp, Christina K; Rath, Ana M; Rind, Shahmir A; Rosenberg, Avi Z; Segal, Michael M; Seidel, Markus G; Smedley, Damian; Talmy, Tomer; Thomas, Yarlalu; Wiafe, Samuel A; Xian, Julie; Yüksel, Zafer; Helbig, Ingo; Mungall, Christopher J; Haendel, Melissa A.

Nucleic Acids Res ; 49(D1): D1207-D1217, 2021 01 08.

Article in English | MEDLINE | ID: mdl-33264411

ABSTRACT

The Human Phenotype Ontology (HPO, https://hpo.jax.org) was launched in 2008 to provide a comprehensive logical standard to describe and computationally analyze phenotypic abnormalities found in human disease. The HPO is now a worldwide standard for phenotype exchange. The HPO has grown steadily since its inception due to considerable contributions from clinical experts and researchers from a diverse range of disciplines. Here, we present recent major extensions of the HPO for neurology, nephrology, immunology, pulmonology, newborn screening, and other areas. For example, the seizure subontology now reflects the International League Against Epilepsy (ILAE) guidelines and these enhancements have already shown clinical validity. We present new efforts to harmonize computational definitions of phenotypic abnormalities across the HPO and multiple phenotype ontologies used for animal models of disease. These efforts will benefit software such as Exomiser by improving the accuracy and scope of cross-species phenotype matching. The computational modeling strategy used by the HPO to define disease entities and phenotypic features and distinguish between them is explained in detail.We also report on recent efforts to translate the HPO into indigenous languages. Finally, we summarize recent advances in the use of HPO in electronic health record systems.

Subject(s)

Biological Ontologies , Computational Biology/methods , Databases, Factual , Disease/genetics , Genome , Phenotype , Software , Animals , Disease Models, Animal , Genotype , Humans , Infant, Newborn , International Cooperation , Internet , Neonatal Screening/methods , Pharmacogenetics/methods , Terminology as Topic

6.

Risk factors associated with post-acute sequelae of SARS-CoV-2: an N3C and NIH RECOVER study.

Hill, Elaine L; Mehta, Hemalkumar B; Sharma, Suchetha; Mane, Klint; Singh, Sharad Kumar; Xie, Catherine; Cathey, Emily; Loomba, Johanna; Russell, Seth; Spratt, Heidi; DeWitt, Peter E; Ammar, Nariman; Madlock-Brown, Charisse; Brown, Donald; McMurry, Julie A; Chute, Christopher G; Haendel, Melissa A; Moffitt, Richard; Pfaff, Emily R; Bennett, Tellen D.

BMC Public Health ; 23(1): 2103, 2023 10 25.

Article in English | MEDLINE | ID: mdl-37880596

ABSTRACT

BACKGROUND: More than one-third of individuals experience post-acute sequelae of SARS-CoV-2 infection (PASC, which includes long-COVID). The objective is to identify risk factors associated with PASC/long-COVID diagnosis. METHODS: This was a retrospective case-control study including 31 health systems in the United States from the National COVID Cohort Collaborative (N3C). 8,325 individuals with PASC (defined by the presence of the International Classification of Diseases, version 10 code U09.9 or a long-COVID clinic visit) matched to 41,625 controls within the same health system and COVID index date within ± 45 days of the corresponding case's earliest COVID index date. Measurements of risk factors included demographics, comorbidities, treatment and acute characteristics related to COVID-19. Multivariable logistic regression, random forest, and XGBoost were used to determine the associations between risk factors and PASC. RESULTS: Among 8,325 individuals with PASC, the majority were > 50 years of age (56.6%), female (62.8%), and non-Hispanic White (68.6%). In logistic regression, middle-age categories (40 to 69 years; OR ranging from 2.32 to 2.58), female sex (OR 1.4, 95% CI 1.33-1.48), hospitalization associated with COVID-19 (OR 3.8, 95% CI 3.05-4.73), long (8-30 days, OR 1.69, 95% CI 1.31-2.17) or extended hospital stay (30 + days, OR 3.38, 95% CI 2.45-4.67), receipt of mechanical ventilation (OR 1.44, 95% CI 1.18-1.74), and several comorbidities including depression (OR 1.50, 95% CI 1.40-1.60), chronic lung disease (OR 1.63, 95% CI 1.53-1.74), and obesity (OR 1.23, 95% CI 1.16-1.3) were associated with increased likelihood of PASC diagnosis or care at a long-COVID clinic. Characteristics associated with a lower likelihood of PASC diagnosis or care at a long-COVID clinic included younger age (18 to 29 years), male sex, non-Hispanic Black race, and comorbidities such as substance abuse, cardiomyopathy, psychosis, and dementia. More doctors per capita in the county of residence was associated with an increased likelihood of PASC diagnosis or care at a long-COVID clinic. Our findings were consistent in sensitivity analyses using a variety of analytic techniques and approaches to select controls. CONCLUSIONS: This national study identified important risk factors for PASC diagnosis such as middle age, severe COVID-19 disease, and specific comorbidities. Further clinical and epidemiological research is needed to better understand underlying mechanisms and the potential role of vaccines and therapeutics in altering PASC course.

Subject(s)

COVID-19 , SARS-CoV-2 , Middle Aged , Female , Male , Humans , Adult , Aged , Adolescent , Young Adult , COVID-19/epidemiology , Post-Acute COVID-19 Syndrome , Case-Control Studies , Retrospective Studies , Risk Factors , Disease Progression

7.

ICD-11: A catalyst for advancing patient safety surveillance globally.

Forster, Alan J; Chute, Christopher G; Pincus, Harold Alan; Ghali, William A.

BMC Med Inform Decis Mak ; 21(Suppl 6): 383, 2023 03 09.

Article in English | MEDLINE | ID: mdl-36894925

ABSTRACT

The World Health Organization's (WHO) international classification of disease version 11 (ICD-11) contains several features which enable improved classification of patient safety events. We have identified three suggestions to facilitate adoption of ICD-11 from the patient safety perspective. One, health system leaders at national, regional, and local levels should incorporate ICD-11 into all approaches to monitor patient safety. This will allow them to take advantage of the innovative patient safety classification methods embedded in ICD-11 to overcome several limitations related to existing patient safety surveillance methods. Two, application developers should incorporate ICD-11 into software solutions. This will accelerate adoption and utility of software-enabled clinical and administrative workflows relevant to patient safety management. This is enabled as a result of the ICD-11 application programming interface (or API) developed by the WHO. Third, health system leaders should adopt the ICD-11 using a continuous improvement framework. This will help leaders at national, regional and local levels to take advantage of specific existing initiatives which will be strengthened by ICD-11, including peer review comparisons, clinician engagement, and alignment of front-line safety efforts with post marketing surveillance of medical technologies. While the investment to adopt ICD-11 will be considerable, these will be offset by reducing the ongoing costs related to a lack of accurate routine information.

Subject(s)

International Classification of Diseases , Patient Safety , Humans , Global Health , Patients , Software

8.

NSAID use and clinical outcomes in COVID-19 patients: a 38-center retrospective cohort study.

Reese, Justin T; Coleman, Ben; Chan, Lauren; Blau, Hannah; Callahan, Tiffany J; Cappelletti, Luca; Fontana, Tommaso; Bradwell, Katie R; Harris, Nomi L; Casiraghi, Elena; Valentini, Giorgio; Karlebach, Guy; Deer, Rachel; McMurry, Julie A; Haendel, Melissa A; Chute, Christopher G; Pfaff, Emily; Moffitt, Richard; Spratt, Heidi; Singh, Jasvinder A; Mungall, Christopher J; Williams, Andrew E; Robinson, Peter N.

Virol J ; 19(1): 84, 2022 05 15.

Article in English | MEDLINE | ID: mdl-35570298

ABSTRACT

BACKGROUND: Non-steroidal anti-inflammatory drugs (NSAIDs) are commonly used to reduce pain, fever, and inflammation but have been associated with complications in community-acquired pneumonia. Observations shortly after the start of the COVID-19 pandemic in 2020 suggested that ibuprofen was associated with an increased risk of adverse events in COVID-19 patients, but subsequent observational studies failed to demonstrate increased risk and in one case showed reduced risk associated with NSAID use. METHODS: A 38-center retrospective cohort study was performed that leveraged the harmonized, high-granularity electronic health record data of the National COVID Cohort Collaborative. A propensity-matched cohort of 19,746 COVID-19 inpatients was constructed by matching cases (treated with NSAIDs at the time of admission) and 19,746 controls (not treated) from 857,061 patients with COVID-19 available for analysis. The primary outcome of interest was COVID-19 severity in hospitalized patients, which was classified as: moderate, severe, or mortality/hospice. Secondary outcomes were acute kidney injury (AKI), extracorporeal membrane oxygenation (ECMO), invasive ventilation, and all-cause mortality at any time following COVID-19 diagnosis. RESULTS: Logistic regression showed that NSAID use was not associated with increased COVID-19 severity (OR: 0.57 95% CI: 0.53-0.61). Analysis of secondary outcomes using logistic regression showed that NSAID use was not associated with increased risk of all-cause mortality (OR 0.51 95% CI: 0.47-0.56), invasive ventilation (OR: 0.59 95% CI: 0.55-0.64), AKI (OR: 0.67 95% CI: 0.63-0.72), or ECMO (OR: 0.51 95% CI: 0.36-0.7). In contrast, the odds ratios indicate reduced risk of these outcomes, but our quantitative bias analysis showed E-values of between 1.9 and 3.3 for these associations, indicating that comparatively weak or moderate confounder associations could explain away the observed associations. CONCLUSIONS: Study interpretation is limited by the observational design. Recording of NSAID use may have been incomplete. Our study demonstrates that NSAID use is not associated with increased COVID-19 severity, all-cause mortality, invasive ventilation, AKI, or ECMO in COVID-19 inpatients. A conservative interpretation in light of the quantitative bias analysis is that there is no evidence that NSAID use is associated with risk of increased severity or the other measured outcomes. Our results confirm and extend analogous findings in previous observational studies using a large cohort of patients drawn from 38 centers in a nationally representative multicenter database.

Subject(s)

Acute Kidney Injury , COVID-19 , Anti-Inflammatory Agents, Non-Steroidal/adverse effects , COVID-19 Testing , Cohort Studies , Humans , Pandemics , Retrospective Studies

9.

Developing an ETL tool for converting the PCORnet CDM into the OMOP CDM to facilitate the COVID-19 data integration.

Yu, Yue; Zong, Nansu; Wen, Andrew; Liu, Sijia; Stone, Daniel J; Knaack, David; Chamberlain, Alanna M; Pfaff, Emily; Gabriel, Davera; Chute, Christopher G; Shah, Nilay; Jiang, Guoqian.

J Biomed Inform ; 127: 104002, 2022 03.

Article in English | MEDLINE | ID: mdl-35077901

ABSTRACT

OBJECTIVE: The large-scale collection of observational data and digital technologies could help curb the COVID-19 pandemic. However, the coexistence of multiple Common Data Models (CDMs) and the lack of data extract, transform, and load (ETL) tool between different CDMs causes potential interoperability issue between different data systems. The objective of this study is to design, develop, and evaluate an ETL tool that transforms the PCORnet CDM format data into the OMOP CDM. METHODS: We developed an open-source ETL tool to facilitate the data conversion from the PCORnet CDM and the OMOP CDM. The ETL tool was evaluated using a dataset with 1000 patients randomly selected from the PCORnet CDM at Mayo Clinic. Information loss, data mapping accuracy, and gap analysis approaches were conducted to assess the performance of the ETL tool. We designed an experiment to conduct a real-world COVID-19 surveillance task to assess the feasibility of the ETL tool. We also assessed the capacity of the ETL tool for the COVID-19 data surveillance using data collection criteria of the MN EHR Consortium COVID-19 project. RESULTS: After the ETL process, all the records of 1000 patients from 18 PCORnet CDM tables were successfully transformed into 12 OMOP CDM tables. The information loss for all the concept mapping was less than 0.61%. The string mapping process for the unit concepts lost 2.84% records. Almost all the fields in the manual mapping process achieved 0% information loss, except the specialty concept mapping. Moreover, the mapping accuracy for all the fields were 100%. The COVID-19 surveillance task collected almost the same set of cases (99.3% overlaps) from the original PCORnet CDM and target OMOP CDM separately. Finally, all the data elements for MN EHR Consortium COVID-19 project could be captured from both the PCORnet CDM and the OMOP CDM. CONCLUSION: We demonstrated that our ETL tool could satisfy the data conversion requirements between the PCORnet CDM and the OMOP CDM. The outcome of the work would facilitate the data retrieval, communication, sharing, and analysis between different institutions for not only COVID-19 related project, but also other real-world evidence-based observational studies.

Subject(s)

COVID-19 , COVID-19/epidemiology , Databases, Factual , Electronic Health Records , Humans , Information Storage and Retrieval , Pandemics , SARS-CoV-2

10.

FHIR-Ontop-OMOP: Building clinical knowledge graphs in FHIR RDF with the OMOP Common data Model.

Xiao, Guohui; Pfaff, Emily; Prud'hommeaux, Eric; Booth, David; Sharma, Deepak K; Huo, Nan; Yu, Yue; Zong, Nansu; Ruddy, Kathryn J; Chute, Christopher G; Jiang, Guoqian.

J Biomed Inform ; 134: 104201, 2022 10.

Article in English | MEDLINE | ID: mdl-36089199

ABSTRACT

BACKGROUND: Knowledge graphs (KGs) play a key role to enable explainable artificial intelligence (AI) applications in healthcare. Constructing clinical knowledge graphs (CKGs) against heterogeneous electronic health records (EHRs) has been desired by the research and healthcare AI communities. From the standardization perspective, community-based standards such as the Fast Healthcare Interoperability Resources (FHIR) and the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) are increasingly used to represent and standardize EHR data for clinical data analytics, however, the potential of such a standard on building CKG has not been well investigated. OBJECTIVE: To develop and evaluate methods and tools that expose the OMOP CDM-based clinical data repositories into virtual clinical KGs that are compliant with FHIR Resource Description Framework (RDF) specification. METHODS: We developed a system called FHIR-Ontop-OMOP to generate virtual clinical KGs from the OMOP relational databases. We leveraged an OMOP CDM-based Medical Information Mart for Intensive Care (MIMIC-III) data repository to evaluate the FHIR-Ontop-OMOP system in terms of the faithfulness of data transformation and the conformance of the generated CKGs to the FHIR RDF specification. RESULTS: A beta version of the system has been released. A total of more than 100 data element mappings from 11 OMOP CDM clinical data, health system and vocabulary tables were implemented in the system, covering 11 FHIR resources. The generated virtual CKG from MIMIC-III contains 46,520 instances of FHIR Patient, 716,595 instances of Condition, 1,063,525 instances of Procedure, 24,934,751 instances of MedicationStatement, 365,181,104 instances of Observations, and 4,779,672 instances of CodeableConcept. Patient counts identified by five pairs of SQL (over the MIMIC database) and SPARQL (over the virtual CKG) queries were identical, ensuring the faithfulness of the data transformation. Generated CKG in RDF triples for 100 patients were fully conformant with the FHIR RDF specification. CONCLUSION: The FHIR-Ontop-OMOP system can expose OMOP database as a FHIR-compliant RDF graph. It provides a meaningful use case demonstrating the potentials that can be enabled by the interoperability between FHIR and OMOP CDM. Generated clinical KGs in FHIR RDF provide a semantic foundation to enable explainable AI applications in healthcare.

Subject(s)

Artificial Intelligence , Pattern Recognition, Automated , Data Warehousing , Delivery of Health Care , Electronic Health Records , Humans

11.

Use of Hydroxychloroquine, Remdesivir, and Dexamethasone Among Adults Hospitalized With COVID-19 in the United States : A Retrospective Cohort Study.

Mehta, Hemalkumar B; An, Huijun; Andersen, Kathleen M; Mansour, Omar; Madhira, Vithal; Rashidi, Emaan S; Bates, Benjamin; Setoguchi, Soko; Joseph, Corey; Kocis, Paul T; Moffitt, Richard; Bennett, Tellen D; Chute, Christopher G; Garibaldi, Brian T; Alexander, G Caleb.

Ann Intern Med ; 174(10): 1395-1403, 2021 10.

Article in English | MEDLINE | ID: mdl-34399060

ABSTRACT

BACKGROUND: Relatively little is known about the use patterns of potential pharmacologic treatments of COVID-19 in the United States. OBJECTIVE: To use the National COVID Cohort Collaborative (N3C), a large, multicenter, longitudinal cohort, to characterize the use of hydroxychloroquine, remdesivir, and dexamethasone, overall as well as across individuals, health systems, and time. DESIGN: Retrospective cohort study. SETTING: 43 health systems in the United States. PARTICIPANTS: 137 870 adults hospitalized with COVID-19 between 1 February 2020 and 28 February 2021. MEASUREMENTS: Inpatient use of hydroxychloroquine, remdesivir, or dexamethasone. RESULTS: Among 137 870 persons hospitalized with confirmed or suspected COVID-19, 8754 (6.3%) received hydroxychloroquine, 29 272 (21.2%) remdesivir, and 53 909 (39.1%) dexamethasone during the study period. Since the release of results from the RECOVERY (Randomised Evaluation of COVID-19 Therapy) trial in mid-June, approximately 78% to 84% of people who have had invasive mechanical ventilation have received dexamethasone or other glucocorticoids. The use of hydroxychloroquine increased during March 2020, peaking at 42%, and started declining by April 2020. By contrast, remdesivir and dexamethasone use gradually increased over the study period. Dexamethasone and remdesivir use varied substantially across health centers (intraclass correlation coefficient, 14.2% for dexamethasone and 84.6% for remdesivir). LIMITATION: Because most N3C data contributors are academic medical centers, findings may not reflect the experience of community hospitals. CONCLUSION: Dexamethasone, an evidence-based treatment of COVID-19, may be underused among persons who are mechanically ventilated. The use of remdesivir and dexamethasone varied across health systems, suggesting variation in patient case mix, drug access, treatment protocols, and quality of care. PRIMARY FUNDING SOURCE: National Center for Advancing Translational Sciences; National Heart, Lung, and Blood Institute; and National Institute on Aging.

Subject(s)

Adenosine Monophosphate/analogs & derivatives , Alanine/analogs & derivatives , Antiviral Agents/therapeutic use , COVID-19 Drug Treatment , Dexamethasone/therapeutic use , Hydroxychloroquine/therapeutic use , Practice Patterns, Physicians' , Adenosine Monophosphate/therapeutic use , Adolescent , Adult , Aged , Alanine/therapeutic use , Anti-Inflammatory Agents/therapeutic use , COVID-19/therapy , Female , Humans , Male , Middle Aged , Pandemics , Respiration, Artificial , Retrospective Studies , SARS-CoV-2 , United States , Young Adult

12.

Overview of ICD-11 architecture and structure.

Chute, Christopher G; Çelik, Can.

BMC Med Inform Decis Mak ; 21(Suppl 6): 378, 2022 05 16.

Article in English | MEDLINE | ID: mdl-35578335

ABSTRACT

BACKGROUND: The International Classification of Diseases (ICD) has progressed from a short list of causes of death to become the predominant classification of human diseases, syndromes, and conditions around the world. The World Health Organization has now explored how the ICD could be revised to leverage the advances in computer science, ontology, and knowledge representation that had accelerated in the twentieth and early twenty-first centuries. METHODS: Many teams of clinical specialists and domain leaders worked to fundamentally revise the science and knowledge base of ICD-11. Development of the ICD-11 architecturally was a fundamental revision. The architecture for ICD-11 proposed in 2007 included three layers: a semantic network of biomedical concepts (Foundation), a traditional tabulation of hierarchical codes that would derive from that network (Linearization), and a formal ontology that would anchor the meaning of terms in the semantic network. Additionally, each entry in the semantic network would have an associated information model of required and optional content (Content Model). RESULTS: This paper describes the innovative architecture developed for ICD-11. CONCLUSION: ICD11 is a revolutionary transformation of a century long medical classification that retains is historical rendering and interface while expanding the opportunity for multiple linearization and underpinning its content with a formally constructed semantic network. The new artifact can enable modern data science and analyses with content encoded with ICD11.

Subject(s)

International Classification of Diseases , Knowledge Bases , Humans

13.

Postcoordination of codes in ICD-11.

Mabon, Kristy; Steinum, Olafr; Chute, Christopher G.

BMC Med Inform Decis Mak ; 21(Suppl 6): 379, 2022 05 17.

Article in English | MEDLINE | ID: mdl-35581649

ABSTRACT

A new coding feature introduced with ICD-11, the 11th revision of the International Classification of Diseases (ICD), is postcoordination, which supports combining (linking) two or more codes into a cluster that describes a clinical concept. Postcoordination allows for coded data to be reported to a greater level of specificity than was possible in previous version of ICD. The linked codes are kept together in a cluster when submitted for reporting. This article presents background detail on the postcoordination feature in ICD and the postcoordination tool. Also presented are several examples that demonstrate the flexibility that ICD-11 provides for enriching coded health information.

Subject(s)

International Classification of Diseases , Humans

14.

ICD-11 extension codes support detailed clinical abstraction and comprehensive classification.

Drösler, Saskia E; Weber, Stefanie; Chute, Christopher G.

BMC Med Inform Decis Mak ; 21(Suppl 6): 278, 2021 11 09.

Article in English | MEDLINE | ID: mdl-34753461

ABSTRACT

BACKGROUND: The new International Classification of Diseases-11th revision (ICD-11) succeeds ICD-10. In the three decades since ICD-10 was released, demands for detailed information on the clinical history of a morbid patient have increased. METHODS: ICD-11 has now implemented an addendum chapter X called "Extension Codes". This chapter contains numerous codes containing information on concepts including disease stage, severity, histopathology, medicaments, and anatomical details. When linked to a stem code representing a clinical state, the extension codes add significant detail and allow for multidimensional coding. RESULTS: This paper discusses the purposes and uses of extension codes and presents three examples of how extension codes can be used in coding clinical detail. CONCLUSION: ICD-11 with its extension codes implemented has the potential to improve precision and evidence based health care worldwide.

Subject(s)

International Classification of Diseases , Humans

15.

ICD-11: an international classification of diseases for the twenty-first century.

Harrison, James E; Weber, Stefanie; Jakob, Robert; Chute, Christopher G.

BMC Med Inform Decis Mak ; 21(Suppl 6): 206, 2021 11 09.

Article in English | MEDLINE | ID: mdl-34753471

ABSTRACT

BACKGROUND: The International Classification of Diseases (ICD) has long been the main basis for comparability of statistics on causes of mortality and morbidity between places and over time. This paper provides an overview of the recently completed 11th revision of the ICD, focusing on the main innovations and their implications. MAIN TEXT: Changes in content reflect knowledge and perspectives on diseases and their causes that have emerged since ICD-10 was developed about 30 years ago. Changes in design and structure reflect the arrival of the networked digital era, for which ICD-11 has been prepared. ICD-11's information framework comprises a semantic knowledge base (the Foundation), a biomedical ontology linked to the Foundation and classifications derived from the Foundation. ICD-11 for Mortality and Morbidity Statistics (ICD-11-MMS) is the primary derived classification and the main successor to ICD-10. Innovations enabled by the new architecture include an online coding tool (replacing the index and providing additional functions), an application program interface to enable remote access to ICD-11 content and services, enhanced capability to capture and combine clinically relevant characteristics of cases and integrated support for multiple languages. CONCLUSIONS: ICD-11 was adopted by the World Health Assembly in May 2019. Transition to implementation is in progress. ICD-11 can be accessed at icd.who.int.

Subject(s)

Biological Ontologies , International Classification of Diseases , Global Health , Humans , Knowledge Bases

16.

The eMERGE genotype set of 83,717 subjects imputed to ~40 million variants genome wide and association with the herpes zoster medical record phenotype.

Stanaway, Ian B; Hall, Taryn O; Rosenthal, Elisabeth A; Palmer, Melody; Naranbhai, Vivek; Knevel, Rachel; Namjou-Khales, Bahram; Carroll, Robert J; Kiryluk, Krzysztof; Gordon, Adam S; Linder, Jodell; Howell, Kayla Marie; Mapes, Brandy M; Lin, Frederick T J; Joo, Yoonjung Yoonie; Hayes, M Geoffrey; Gharavi, Ali G; Pendergrass, Sarah A; Ritchie, Marylyn D; de Andrade, Mariza; Croteau-Chonka, Damien C; Raychaudhuri, Soumya; Weiss, Scott T; Lebo, Matt; Amr, Sami S; Carrell, David; Larson, Eric B; Chute, Christopher G; Rasmussen-Torvik, Laura Jarmila; Roy-Puckelwartz, Megan J; Sleiman, Patrick; Hakonarson, Hakon; Li, Rongling; Karlson, Elizabeth W; Peterson, Josh F; Kullo, Iftikhar J; Chisholm, Rex; Denny, Joshua Charles; Jarvik, Gail P; Crosslin, David R.

Genet Epidemiol ; 43(1): 63-81, 2019 02.

Article in English | MEDLINE | ID: mdl-30298529

ABSTRACT

The Electronic Medical Records and Genomics (eMERGE) network is a network of medical centers with electronic medical records linked to existing biorepository samples for genomic discovery and genomic medicine research. The network sought to unify the genetic results from 78 Illumina and Affymetrix genotype array batches from 12 contributing medical centers for joint association analysis of 83,717 human participants. In this report, we describe the imputation of eMERGE results and methods to create the unified imputed merged set of genome-wide variant genotype data. We imputed the data using the Michigan Imputation Server, which provides a missing single-nucleotide variant genotype imputation service using the minimac3 imputation algorithm with the Haplotype Reference Consortium genotype reference set. We describe the quality control and filtering steps used in the generation of this data set and suggest generalizable quality thresholds for imputation and phenotype association studies. To test the merged imputed genotype set, we replicated a previously reported chromosome 6 HLA-B herpes zoster (shingles) association and discovered a novel zoster-associated loci in an epigenetic binding site near the terminus of chromosome 3 (3p29).

Subject(s)

Electronic Health Records , Genetic Predisposition to Disease , Genome-Wide Association Study , Herpes Zoster/genetics , Algorithms , Black People/genetics , Chromosomes, Human/genetics , Female , Haplotypes/genetics , Homozygote , Humans , Male , Phenotype , Polymorphism, Single Nucleotide/genetics , Principal Component Analysis , White People/genetics

17.

Probing the Virtual Proteome to Identify Novel Disease Biomarkers.

Mosley, Jonathan D; Benson, Mark D; Smith, J Gustav; Melander, Olle; Ngo, Debby; Shaffer, Christian M; Ferguson, Jane F; Herzig, Matthew S; McCarty, Catherine A; Chute, Christopher G; Jarvik, Gail P; Gordon, Adam S; Palmer, Melody R; Crosslin, David R; Larson, Eric B; Carrell, David S; Kullo, Iftikhar J; Pacheco, Jennifer A; Peissig, Peggy L; Brilliant, Murray H; Kitchner, Terrie E; Linneman, James G; Namjou, Bahram; Williams, Marc S; Ritchie, Marylyn D; Borthwick, Kenneth M; Kiryluk, Krzysztof; Mentch, Frank D; Sleiman, Patrick M; Karlson, Elizabeth W; Verma, Shefali S; Zhu, Yineng; Vasan, Ramachandran S; Yang, Qiong; Denny, Josh C; Roden, Dan M; Gerszten, Robert E; Wang, Thomas J.

Circulation ; 138(22): 2469-2481, 2018 11 27.

Article in English | MEDLINE | ID: mdl-30571344

ABSTRACT

BACKGROUND: Proteomic approaches allow measurement of thousands of proteins in a single specimen, which can accelerate biomarker discovery. However, applying these technologies to massive biobanks is not currently feasible because of the practical barriers and costs of implementing such assays at scale. To overcome these challenges, we used a "virtual proteomic" approach, linking genetically predicted protein levels to clinical diagnoses in >40 000 individuals. METHODS: We used genome-wide association data from the Framingham Heart Study (n=759) to construct genetic predictors for 1129 plasma protein levels. We validated the genetic predictors for 268 proteins and used them to compute predicted protein levels in 41 288 genotyped individuals in the Electronic Medical Records and Genomics (eMERGE) cohort. We tested associations for each predicted protein with 1128 clinical phenotypes. Lead associations were validated with directly measured protein levels and either low-density lipoprotein cholesterol or subclinical atherosclerosis in the MDCS (Malmö Diet and Cancer Study; n=651). RESULTS: In the virtual proteomic analysis in eMERGE, 55 proteins were associated with 89 distinct diagnoses at a false discovery rate q<0.1. Among these, 13 associations involved lipid (n=7) or atherosclerosis (n=6) phenotypes. We tested each association for validation in MDCS using directly measured protein levels. At Bonferroni-adjusted significance thresholds, levels of apolipoprotein E isoforms were associated with hyperlipidemia, and circulating C-type lectin domain family 1 member B and platelet-derived growth factor receptor-ß predicted subclinical atherosclerosis. Odds ratios for carotid atherosclerosis were 1.31 (95% CI, 1.08-1.58; P=0.006) per 1-SD increment in C-type lectin domain family 1 member B and 0.79 (0.66-0.94; P=0.008) per 1-SD increment in platelet-derived growth factor receptor-ß. CONCLUSIONS: We demonstrate a biomarker discovery paradigm to identify candidate biomarkers of cardiovascular and other diseases.

Subject(s)

Biomarkers/blood , Carotid Artery Diseases/diagnosis , Genome-Wide Association Study , Proteome/analysis , Adult , Aged , Aged, 80 and over , Carotid Artery Diseases/genetics , Female , Genotype , Humans , Lectins, C-Type/analysis , Male , Middle Aged , Odds Ratio , Phenotype , Polymorphism, Single Nucleotide , Proteomics , Receptor, Platelet-Derived Growth Factor beta/blood

18.

Response to Biesecker et al.

Hamosh, Ada; Amberger, Joanna S; Bocchini, Carol A; Bodurtha, Joann; Bult, Carol J; Chute, Christopher G; Cutting, Garry R; Dietz, Harry C; Firth, Helen V; Gibbs, Richard A; Grody, Wayne W; Haendel, Melissa A; Lupski, James R; Posey, Jennifer E; Robinson, Peter N; Schriml, Lynn M; Scott, Alan F; Sobreira, Nara L; Valle, David; Wu, Nan; Rasmussen, Sonja A.

Am J Hum Genet ; 108(9): 1807-1808, 2021 09 02.

Article in English | MEDLINE | ID: mdl-34478655

Subject(s)

Blepharophimosis , Facies , Humans

19.

On beyond Gruber: "Ontologies" in today's biomedical information systems and the limits of OWL.

Rector, Alan; Schulz, Stefan; Rodrigues, Jean Marie; Chute, Christopher G; Solbrig, Harold.

J Biomed Inform ; 100S: 100002, 2019.

Article in English | MEDLINE | ID: mdl-34384571

ABSTRACT

The word "ontology" was introduced to information systems when only closed-world reasoning systems were available. It was "borrowed" from philosophy, but literal links to its philosophical meaning were explicitly disavowed. Since then, open-world reasoning systems based on description logics have been developed, OWL has become a standard, and philosophical issues have been raised. The result has too often been confusion. The question "What statements are ontological" receives a variety of answers. A clearer vocabulary that is better suited to today's information systems is needed. The project to base ICD-11 on a "Common Ontology" required addressing this confusion. This paper sets out to systematise the lessons of that experience and subsequent discussions. We explore the semantics of open-world and closed-world systems. For specifying knowledge bases and software, we propose "invariants" or, more fully, "the first order invariant part of the background domain knowledge base" as an alternative to the words "ontology" and "ontological." We discuss the role and limitations of OWL and description logics and how they are complementary to closed world systems such as frames and to less formal "knowledge organisation systems". We illustrate why the conventions of classifications such as ICD cannot be formulated directly in OWL, but can be linked to OWL knowledge bases by queries. We contend that while OWL and description logics are major advances for representing invariants and terminologies, they must be combined with other technologies to represent broader background knowledge faithfully. The ICD-11 architecture is one approach. We argue that such hybrid architectures can and should be developed further.

20.

Sex, obesity, diabetes, and exposure to particulate matter among patients with severe asthma: Scientific insights from a comparative analysis of open clinical data sources during a five-day hackathon.

Fecho, Karamarie; Ahalt, Stanley C; Arunachalam, Saravanan; Champion, James; Chute, Christopher G; Davis, Sarah; Gersing, Kenneth; Glusman, Gustavo; Hadlock, Jennifer; Lee, Jewel; Pfaff, Emily; Robinson, Max; Sid, Eric; Ta, Casey; Xu, Hao; Zhu, Richard; Zhu, Qian; Peden, David B.

J Biomed Inform ; 100: 103325, 2019 12.

Article in English | MEDLINE | ID: mdl-31676459

ABSTRACT

This special communication describes activities, products, and lessons learned from a recent hackathon that was funded by the National Center for Advancing Translational Sciences via the Biomedical Data Translator program ('Translator'). Specifically, Translator team members self-organized and worked together to conceptualize and execute, over a five-day period, a multi-institutional clinical research study that aimed to examine, using open clinical data sources, relationships between sex, obesity, diabetes, and exposure to airborne fine particulate matter among patients with severe asthma. The goal was to develop a proof of concept that this new model of collaboration and data sharing could effectively produce meaningful scientific results and generate new scientific hypotheses. Three Translator Clinical Knowledge Sources, each of which provides open access (via Application Programming Interfaces) to data derived from the electronic health record systems of major academic institutions, served as the source of study data. Jupyter Python notebooks, shared in GitHub repositories, were used to call the knowledge sources and analyze and integrate the results. The results replicated established or suspected relationships between sex, obesity, diabetes, exposure to airborne fine particulate matter, and severe asthma. In addition, the results demonstrated specific differences across the three Translator Clinical Knowledge Sources, suggesting cohort- and/or environment-specific factors related to the services themselves or the catchment area from which each service derives patient data. Collectively, this special communication demonstrates the power and utility of intense, team-oriented hackathons and offers general technical, organizational, and scientific lessons learned.

Subject(s)

Asthma/physiopathology , Diabetes Mellitus/physiopathology , Environmental Exposure , Information Storage and Retrieval , Obesity/physiopathology , Particulate Matter/toxicity , Sex Factors , Asthma/complications , Female , Humans , Male , Obesity/complications , Severity of Illness Index

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL