Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
1.
AI Mag ; 43(1): 46-58, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36093122

RESUMO

Knowledge representation and reasoning (KR&R) has been successfully implemented in many fields to enable computers to solve complex problems with AI methods. However, its application to biomedicine has been lagging in part due to the daunting complexity of molecular and cellular pathways that govern human physiology and pathology. In this article we describe concrete uses of SPOKE, an open knowledge network that connects curated information from 37 specialized and human-curated databases into a single property graph, with 3 million nodes and 15 million edges to date. Applications discussed in this article include drug discovery, COVID-19 research and chronic disease diagnosis and management.

2.
Bioinformatics ; 35(21): 4515-4518, 2019 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-31214700

RESUMO

MOTIVATION: Electronic health records (EHRs) are quickly becoming omnipresent in healthcare, but interoperability issues and technical demands limit their use for biomedical and clinical research. Interactive and flexible software that interfaces directly with EHR data structured around a common data model (CDM) could accelerate more EHR-based research by making the data more accessible to researchers who lack computational expertise and/or domain knowledge. RESULTS: We present PatientExploreR, an extensible application built on the R/Shiny framework that interfaces with a relational database of EHR data in the Observational Medical Outcomes Partnership CDM format. PatientExploreR produces patient-level interactive and dynamic reports and facilitates visualization of clinical data without any programming required. It allows researchers to easily construct and export patient cohorts from the EHR for analysis with other software. This application could enable easier exploration of patient-level data for physicians and researchers. PatientExploreR can incorporate EHR data from any institution that employs the CDM for users with approved access. The software code is free and open source under the MIT license, enabling institutions to install and users to expand and modify the application for their own purposes. AVAILABILITY AND IMPLEMENTATION: PatientExploreR can be freely obtained from GitHub: https://github.com/BenGlicksberg/PatientExploreR. We provide instructions for how researchers with approved access to their institutional EHR can use this package. We also release an open sandbox server of synthesized patient data for users without EHR access to explore: http://patientexplorer.ucsf.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Registros Eletrônicos de Saúde , Software , Computadores , Bases de Dados Factuais , Humanos , Estudos Observacionais como Assunto
3.
Cell Rep Med ; 5(1): 101350, 2024 01 16.
Artigo em Inglês | MEDLINE | ID: mdl-38134931

RESUMO

Every year, 11% of infants are born preterm with significant health consequences, with the vaginal microbiome a risk factor for preterm birth. We crowdsource models to predict (1) preterm birth (PTB; <37 weeks) or (2) early preterm birth (ePTB; <32 weeks) from 9 vaginal microbiome studies representing 3,578 samples from 1,268 pregnant individuals, aggregated from public raw data via phylogenetic harmonization. The predictive models are validated on two independent unpublished datasets representing 331 samples from 148 pregnant individuals. The top-performing models (among 148 and 121 submissions from 318 teams) achieve area under the receiver operator characteristic (AUROC) curve scores of 0.69 and 0.87 predicting PTB and ePTB, respectively. Alpha diversity, VALENCIA community state types, and composition are important features in the top-performing models, most of which are tree-based methods. This work is a model for translation of microbiome data into clinically relevant predictive models and to better understand preterm birth.


Assuntos
Crowdsourcing , Microbiota , Nascimento Prematuro , Gravidez , Feminino , Recém-Nascido , Humanos , Filogenia , Vagina , Microbiota/genética
4.
JAMIA Open ; 6(3): ooad045, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-37416449

RESUMO

Objectives: Clinical notes are a veritable treasure trove of information on a patient's disease progression, medical history, and treatment plans, yet are locked in secured databases accessible for research only after extensive ethics review. Removing personally identifying and protected health information (PII/PHI) from the records can reduce the need for additional Institutional Review Boards (IRB) reviews. In this project, our goals were to: (1) develop a robust and scalable clinical text de-identification pipeline that is compliant with the Health Insurance Portability and Accountability Act (HIPAA) Privacy Rule for de-identification standards and (2) share routinely updated de-identified clinical notes with researchers. Materials and Methods: Building on our open-source de-identification software called Philter, we added features to: (1) make the algorithm and the de-identified data HIPAA compliant, which also implies type 2 error-free redaction, as certified via external audit; (2) reduce over-redaction errors; and (3) normalize and shift date PHI. We also established a streamlined de-identification pipeline using MongoDB to automatically extract clinical notes and provide truly de-identified notes to researchers with periodic monthly refreshes at our institution. Results: To the best of our knowledge, the Philter V1.0 pipeline is currently the first and only certified, de-identified redaction pipeline that makes clinical notes available to researchers for nonhuman subjects' research, without further IRB approval needed. To date, we have made over 130 million certified de-identified clinical notes available to over 600 UCSF researchers. These notes were collected over the past 40 years, and represent data from 2757016 UCSF patients.

5.
medRxiv ; 2023 Apr 05.
Artigo em Inglês | MEDLINE | ID: mdl-36993193

RESUMO

The vaginal microbiome has been shown to be associated with pregnancy outcomes including preterm birth (PTB) risk. Here we present VMAP: Vaginal Microbiome Atlas during Pregnancy (http://vmapapp.org), an application to visualize features of 3,909 vaginal microbiome samples of 1,416 pregnant individuals from 11 studies, aggregated from raw public and newly generated sequences via an open-source tool, MaLiAmPi. Our visualization tool (http://vmapapp.org) includes microbial features such as various measures of diversity, VALENCIA community state types (CST), and composition (via phylotypes and taxonomy). This work serves as a resource for the research community to further analyze and visualize vaginal microbiome data in order to better understand both healthy term pregnancies and those associated with adverse outcomes.

6.
Genome Biol ; 23(1): 15, 2022 01 10.
Artigo em Inglês | MEDLINE | ID: mdl-35012625

RESUMO

BACKGROUND: Angiotensin-converting enzyme 2 (ACE2) is the cell-entry receptor for SARS-CoV-2. It plays critical roles in both the transmission and the pathogenesis of COVID-19. Comprehensive profiling of ACE2 expression patterns could reveal risk factors of severe COVID-19 illness. While the expression of ACE2 in healthy human tissues has been well characterized, it is not known which diseases and drugs might be associated with ACE2 expression. RESULTS: We develop GENEVA (GENe Expression Variance Analysis), a semi-automated framework for exploring massive amounts of RNA-seq datasets. We apply GENEVA to 286,650 publicly available RNA-seq samples to identify any previously studied experimental conditions that could be directly or indirectly associated with ACE2 expression. We identify multiple drugs, genetic perturbations, and diseases that are associated with the expression of ACE2, including cardiomyopathy, HNF1A overexpression, and drug treatments with RAD140 and itraconazole. Our joint analysis of seven datasets confirms ACE2 upregulation in all cardiomyopathy categories. Using electronic health records data from 3936 COVID-19 patients, we demonstrate that patients with pre-existing cardiomyopathy have an increased mortality risk than age-matched patients with other cardiovascular conditions. GENEVA is applicable to any genes of interest and is freely accessible at http://genevatool.org . CONCLUSIONS: This study identifies multiple diseases and drugs that are associated with the expression of ACE2. The effect of these conditions should be carefully studied in COVID-19 patients. In particular, our analysis identifies cardiomyopathy patients as a high-risk group, with increased ACE2 expression in the heart and increased mortality after SARS-COV-2 infection.


Assuntos
Enzima de Conversão de Angiotensina 2/isolamento & purificação , Enzima de Conversão de Angiotensina 2/metabolismo , COVID-19/mortalidade , Cardiomiopatias/metabolismo , Doenças Cardiovasculares , Adulto , Idoso , Enzima de Conversão de Angiotensina 2/genética , Feminino , Expressão Gênica , Humanos , Masculino , Pessoa de Meia-Idade , Mortalidade , RNA-Seq , Fatores de Risco , SARS-CoV-2
7.
Nat Commun ; 13(1): 675, 2022 02 03.
Artigo em Inglês | MEDLINE | ID: mdl-35115528

RESUMO

Alzheimer's Disease (AD) is a neurodegenerative disorder that is still not fully understood. Sex modifies AD vulnerability, but the reasons for this are largely unknown. We utilize two independent electronic medical record (EMR) systems across 44,288 patients to perform deep clinical phenotyping and network analysis to gain insight into clinical characteristics and sex-specific clinical associations in AD. Embeddings and network representation of patient diagnoses demonstrate greater comorbidity interactions in AD in comparison to matched controls. Enrichment analysis identifies multiple known and new diagnostic, medication, and lab result associations across the whole cohort and in a sex-stratified analysis. With this data-driven method of phenotyping, we can represent AD complexity and generate hypotheses of clinical factors that can be followed-up for further diagnostic and predictive analyses, mechanistic understanding, or drug repurposing and therapeutic approaches.


Assuntos
Doença de Alzheimer/diagnóstico , Doença de Alzheimer/tratamento farmacológico , Bases de Dados Factuais/estatística & dados numéricos , Registros Eletrônicos de Saúde/estatística & dados numéricos , Idoso , Idoso de 80 Anos ou mais , Doença de Alzheimer/epidemiologia , California/epidemiologia , Distribuição de Qui-Quadrado , Estudos de Coortes , Comorbidade , Feminino , Humanos , Masculino , Transtornos Mentais/epidemiologia , Doenças Musculoesqueléticas/epidemiologia , Doenças do Sistema Nervoso/epidemiologia , New York/epidemiologia , Fenótipo , Fatores Sexuais , Doenças Vasculares/epidemiologia
8.
JAMA Netw Open ; 4(11): e2133090, 2021 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-34779847

RESUMO

Importance: Antidepressant use may be associated with reduced levels of several proinflammatory cytokines suggested to be involved with the development of severe COVID-19. An association between the use of selective serotonin reuptake inhibitors (SSRIs)-specifically fluoxetine hydrochloride and fluvoxamine maleate-with decreased mortality among patients with COVID-19 has been reported in recent studies; however, these studies had limited power due to their small size. Objective: To investigate the association of SSRIs with outcomes in patients with COVID-19 by analyzing electronic health records (EHRs). Design, Setting, and Participants: This retrospective cohort study used propensity score matching by demographic characteristics, comorbidities, and medication indication to compare SSRI-treated patients with matched control patients not treated with SSRIs within a large EHR database representing a diverse population of 83 584 patients diagnosed with COVID-19 from January to September 2020 and with a duration of follow-up of as long as 8 months in 87 health care centers across the US. Exposures: Selective serotonin reuptake inhibitors and specifically (1) fluoxetine, (2) fluoxetine or fluvoxamine, and (3) other SSRIs (ie, not fluoxetine or fluvoxamine). Main Outcomes and Measures: Death. Results: A total of 3401 adult patients with COVID-19 prescribed SSRIs (2033 women [59.8%]; mean [SD] age, 63.8 [18.1] years) were identified, with 470 receiving fluoxetine only (280 women [59.6%]; mean [SD] age, 58.5 [18.1] years), 481 receiving fluoxetine or fluvoxamine (285 women [59.3%]; mean [SD] age, 58.7 [18.0] years), and 2898 receiving other SSRIs (1733 women [59.8%]; mean [SD] age, 64.7 [18.0] years) within a defined time frame. When compared with matched untreated control patients, relative risk (RR) of mortality was reduced among patients prescribed any SSRI (497 of 3401 [14.6%] vs 1130 of 6802 [16.6%]; RR, 0.92 [95% CI, 0.85-0.99]; adjusted P = .03); fluoxetine (46 of 470 [9.8%] vs 937 of 7050 [13.3%]; RR, 0.72 [95% CI, 0.54-0.97]; adjusted P = .03); and fluoxetine or fluvoxamine (48 of 481 [10.0%] vs 956 of 7215 [13.3%]; RR, 0.74 [95% CI, 0.55-0.99]; adjusted P = .04). The association between receiving any SSRI that is not fluoxetine or fluvoxamine and risk of death was not statistically significant (447 of 2898 [15.4%] vs 1474 of 8694 [17.0%]; RR, 0.92 [95% CI, 0.84-1.00]; adjusted P = .06). Conclusions and Relevance: These results support evidence that SSRIs may be associated with reduced severity of COVID-19 reflected in the reduced RR of mortality. Further research and randomized clinical trials are needed to elucidate the effect of SSRIs generally, or more specifically of fluoxetine and fluvoxamine, on the severity of COVID-19 outcomes.


Assuntos
Antidepressivos , COVID-19/mortalidade , Fluoxetina , Fluvoxamina , Inibidores Seletivos de Recaptação de Serotonina , Índice de Gravidade de Doença , Adulto , Idoso , Antidepressivos/farmacologia , COVID-19/metabolismo , Citalopram/farmacologia , Citocinas/metabolismo , Feminino , Fluoxetina/farmacologia , Fluvoxamina/farmacologia , Humanos , Masculino , Pessoa de Meia-Idade , Medicamentos sob Prescrição , Estudos Retrospectivos , Risco , SARS-CoV-2 , Inibidores Seletivos de Recaptação de Serotonina/farmacologia , Sertralina , Estados Unidos
9.
medRxiv ; 2020 May 02.
Artigo em Inglês | MEDLINE | ID: mdl-32511606

RESUMO

Management of the COVID-19 pandemic has proven to be a significant challenge to policy makers. This is in large part due to uneven reporting and the absence of open-access visualization tools to present local trends and infer healthcare needs. Here we report the development of CovidCounties.org, an interactive web application that depicts daily disease trends at the level of US counties using time series plots and maps. This application is accompanied by a manually curated dataset that catalogs all major public policy actions made at the state-level, as well as technical validation of the primary data. Finally, the underlying code for the site is also provided as open source, enabling others to validate and learn from this work.

10.
Sci Data ; 7(1): 405, 2020 11 16.
Artigo em Inglês | MEDLINE | ID: mdl-33199721

RESUMO

Management of the COVID-19 pandemic has proven to be a significant challenge to policy makers. This is in large part due to uneven reporting and the absence of open-access visualization tools to present local trends and infer healthcare needs. Here we report the development of CovidCounties.org, an interactive web application that depicts daily disease trends at the level of US counties using time series plots and maps. This application is accompanied by a manually curated dataset that catalogs all major public policy actions made at the state-level, as well as technical validation of the primary data. Finally, the underlying code for the site is also provided as open source, enabling others to validate and learn from this work.


Assuntos
Infecções por Coronavirus/epidemiologia , Pneumonia Viral/epidemiologia , Software , Betacoronavirus , COVID-19 , Curadoria de Dados/métodos , Conjuntos de Dados como Assunto , Humanos , Internet , Pandemias , SARS-CoV-2 , Estados Unidos/epidemiologia
11.
NPJ Digit Med ; 3: 57, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32337372

RESUMO

There is a great and growing need to ascertain what exactly is the state of a patient, in terms of disease progression, actual care practices, pathology, adverse events, and much more, beyond the paucity of data available in structured medical record data. Ascertaining these harder-to-reach data elements is now critical for the accurate phenotyping of complex traits, detection of adverse outcomes, efficacy of off-label drug use, and longitudinal patient surveillance. Clinical notes often contain the most detailed and relevant digital information about individual patients, the nuances of their diseases, the treatment strategies selected by physicians, and the resulting outcomes. However, notes remain largely unused for research because they contain Protected Health Information (PHI), which is synonymous with individually identifying data. Previous clinical note de-identification approaches have been rigid and still too inaccurate to see any substantial real-world use, primarily because they have been trained with too small medical text corpora. To build a new de-identification tool, we created the largest manually annotated clinical note corpus for PHI and develop a customizable open-source de-identification software called Philter ("Protected Health Information filter"). Here we describe the design and evaluation of Philter, and show how it offers substantial real-world improvements over prior methods.

12.
JAMA Netw Open ; 2(3): e190606, 2019 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-30874779

RESUMO

Importance: Knowing the future condition of a patient would enable a physician to customize current therapeutic options to prevent disease worsening, but predicting that future condition requires sophisticated modeling and information. If artificial intelligence models were capable of forecasting future patient outcomes, they could be used to aid practitioners and patients in prognosticating outcomes or simulating potential outcomes under different treatment scenarios. Objective: To assess the ability of an artificial intelligence system to prognosticate the state of disease activity of patients with rheumatoid arthritis (RA) at their next clinical visit. Design, Setting, and Participants: This prognostic study included 820 patients with RA from rheumatology clinics at 2 distinct health care systems with different electronic health record platforms: a university hospital (UH) and a public safety-net hospital (SNH). The UH and SNH had substantially different patient populations and treatment patterns. The UH has records on approximately 1 million total patients starting in January 2012. The UH data for this study were accessed on July 1, 2017. The SNH has records on 65 000 unique individuals starting in January 2013. The SNH data for the study were collected on February 27, 2018. Exposures: Structured data were extracted from the electronic health record, including exposures (medications), patient demographics, laboratories, and prior measures of disease activity. A longitudinal deep learning model was used to predict disease activity for patients with RA at their next rheumatology clinic visit and to evaluate interhospital performance and model interoperability strategies. Main Outcomes and Measures: Model performance was quantified using the area under the receiver operating characteristic curve (AUROC). Disease activity in RA was measured using a composite index score. Results: A total of 578 UH patients (mean [SD] age, 57 [15] years; 477 [82.5%] female; 296 [51.2%] white) and 242 SNH patients (mean [SD] age, 60 [15] years; 195 [80.6%] female; 30 [12.4%] white) were included in the study. Patients at the UH compared with those at the SNH were seen more frequently (median time between visits, 100 vs 180 days) and were more frequently prescribed higher-class medications (biologics) (364 [63.0%] vs 70 [28.9%]). At the UH, the model reached an AUROC of 0.91 (95% CI, 0.86-0.96) in a test cohort of 116 patients. The UH-trained model had an AUROC of 0.74 (95% CI, 0.65-0.83) in the SNH test cohort (n = 117) despite marked differences in the patient populations. In both settings, baseline prediction using each patients' most recent disease activity score had statistically random performance. Conclusions and Relevance: The findings suggest that building accurate models to forecast complex disease outcomes using electronic health record data is possible and these models can be shared across hospitals with diverse patient populations.


Assuntos
Artrite Reumatoide/diagnóstico , Artrite Reumatoide/epidemiologia , Aprendizado Profundo , Diagnóstico por Computador/métodos , Registros Eletrônicos de Saúde/classificação , Adulto , Idoso , Estudos de Coortes , Feminino , Previsões , Humanos , Masculino , Pessoa de Meia-Idade , Prognóstico
13.
JAMIA Open ; 2(1): 10-14, 2019 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-31633087

RESUMO

OBJECTIVES: Electronic health record (EHR) data are increasingly used for biomedical discoveries. The nature of the data, however, requires expertise in both data science and EHR structure. The Observational Medical Out-comes Partnership (OMOP) common data model (CDM) standardizes the language and structure of EHR data to promote interoperability of EHR data for research. While the OMOP CDM is valuable and more attuned to research purposes, it still requires extensive domain knowledge to utilize effectively, potentially limiting more widespread adoption of EHR data for research and quality improvement. MATERIALS AND METHODS: We have created ROMOP: an R package for direct interfacing with EHR data in the OMOP CDM format. RESULTS: ROMOP streamlines typical EHR-related data processes. Its functions include exploration of data types, extraction and summarization of patient clinical and demographic data, and patient searches using any CDM vocabulary concept. CONCLUSION: ROMOP is freely available under the Massachusetts Institute of Technology (MIT) license and can be obtained from GitHub (http://github.com/BenGlicksberg/ROMOP). We detail instructions for setup and use in the Supplementary Materials. Additionally, we provide a public sandbox server containing synthesized clinical data for users to explore OMOP data and ROMOP (http://romop.ucsf.edu).

14.
J Appl Lab Med ; 3(3): 366-377, 2018 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-33636914

RESUMO

BACKGROUND: The results of clinical laboratory tests are an essential component of medical decision-making. To guide interpretation, test results are returned with reference intervals defined by the range in which the central 95% of values occur in healthy individuals. Clinical laboratories often set their own reference intervals to accommodate variation in local population and instrumentation. For some tests, reference intervals change as a function of sex, age, and self-identified race and ethnicity. METHODS: In this work, we develop a novel approach, which leverages electronic health record data, to identify healthy individuals and tests for differences in laboratory test values between populations. RESULTS: We found that the distributions of >50% of laboratory tests with currently fixed reference intervals differ among self-identified racial and ethnic groups (SIREs) in healthy individuals. CONCLUSIONS: Our results confirm the known SIRE-specific differences in creatinine and suggest that more research needs to be done to determine the clinical implications of using one-size-fits-all reference intervals for other tests with SIRE-specific distributions.

15.
Nat Commun ; 8(1): 1077, 2017 10 20.
Artigo em Inglês | MEDLINE | ID: mdl-29057876

RESUMO

Histologically normal tissue adjacent to the tumor (NAT) is commonly used as a control in cancer studies. However, little is known about the transcriptomic profile of NAT, how it is influenced by the tumor, and how the profile compares with non-tumor-bearing tissues. Here, we integrate data from the Genotype-Tissue Expression project and The Cancer Genome Atlas to comprehensively analyze the transcriptomes of healthy, NAT, and tumor tissues in 6506 samples across eight tissues and corresponding tumor types. Our analysis shows that NAT presents a unique intermediate state between healthy and tumor. Differential gene expression and protein-protein interaction analyses reveal altered pathways shared among NATs across tissue types. We characterize a set of 18 genes that are specifically activated in NATs. By applying pathway and tissue composition analyses, we suggest a pan-cancer mechanism of pro-inflammatory signals from the tumor stimulates an inflammatory response in the adjacent endothelium.


Assuntos
Neoplasias/genética , Transcriptoma/genética , Perfilação da Expressão Gênica , Regulação da Expressão Gênica de Plantas/genética , Humanos , RNA Interferente Pequeno/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA