Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 80
Filtrar
Más filtros

Bases de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
J Med Syst ; 47(1): 65, 2023 May 17.
Artículo en Inglés | MEDLINE | ID: mdl-37195430

RESUMEN

Graph data models are an emerging approach to structure clinical and biomedical information. These models offer intriguing opportunities for novel approaches in healthcare, such as disease phenotyping, risk prediction, and personalized precision care. The combination of data and information in a graph model to create knowledge graphs has rapidly expanded in biomedical research, but the integration of real-world data from the electronic health record has been limited. To broadly apply knowledge graphs to EHR and other real-world data, a deeper understanding of how to represent these data in a standardized graph model is needed. We provide an overview of the state-of-the-art research for clinical and biomedical data integration and summarize the potential to accelerate healthcare and precision medicine research through insight generation from integrated knowledge graphs.


Asunto(s)
Algoritmos , Investigación Biomédica , Humanos , Reconocimiento de Normas Patrones Automatizadas , Fenotipo , Medicina de Precisión
2.
J Manipulative Physiol Ther ; 45(9): 615-622, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-37294219

RESUMEN

OBJECTIVE: The purpose of this study was to determine whether patient characteristics were associated with face-to-face (F2F) and telehealth visits for those receiving chiropractic care for musculoskeletal conditions in the US Veterans Health Administration (VHA) during the COVID-19 pandemic. METHODS: A retrospective cross-sectional analysis of all patients (veterans, dependents, and spouses) who received chiropractic care nationwide at the VHA from March 1, 2020, to February 28, 2021, was performed. Patients were allocated into 1 of the following 3 groups: only telehealth visits, only F2F visits, and combined F2F and telehealth visits. Patient characteristics included age, sex, race, ethnicity, marital status, and Charlson Comorbidity Index. Multinomial logistic regression estimated associations of these variables with visit type. RESULTS: The total number of unique patients seen by chiropractors between March 2020 and February 2021 was 62 658. Key findings were that patients of non-White race and Hispanic or Latino ethnicity were more likely to attend telehealth-only visits (Black [odds ratio 1.20, 95% confidence interval {1.10-1.31}], other races [1.36 {1.16-1.59}], and Hispanic or Latino [1.35 {1.20-1.52}]) and combination telehealth and F2F care (Black [1.32 {1.25-1.40}], other races [1.37 {1.23-1.52}], and Hispanic or Latino [1.63 {1.51-1.76}]). Patients younger than 40 years of age were more likely to choose telehealth visits ([1.13 {1.02-1.26}], 66-75 years [1.17 {1.01-1.35}], and >75 years [1.26 {1.06-1.51}] vs those 40-55 years of age). Sex, visit frequency, and Charlson Comorbidity Index showed significant relationships as well, while marital status did not. CONCLUSION: During the COVID-19 pandemic, VHA patients with musculoskeletal complaints using chiropractic telehealth were more ethnically and racially diverse than those using F2F care alone.


Asunto(s)
COVID-19 , Quiropráctica , Telemedicina , Humanos , Anciano , COVID-19/epidemiología , Estudios Transversales , Pandemias , Estudios Retrospectivos , Salud de los Veteranos
3.
BMC Bioinformatics ; 22(Suppl 9): 105, 2021 Aug 25.
Artículo en Inglés | MEDLINE | ID: mdl-34433410

RESUMEN

BACKGROUND: Many systems biology studies leverage the integration of multiple data types (across different data sources) to offer a more comprehensive view of the biological system being studied. While SQL (Structured Query Language) databases are popular in the biomedical domain, NoSQL database technologies have been used as a more relationship-based, flexible and scalable method of data integration. RESULTS: We have created a graph database integrating data from multiple sources. In addition to using a graph-based query language (Cypher) for data retrieval, we have developed a web-based dashboard that allows users to easily browse and plot data without the need to learn Cypher. We have also implemented a visual graph query interface for users to browse graph data. Finally, we have built a prototype to allow the user to query the graph database in natural language. CONCLUSION: We have demonstrated the feasibility and flexibility of using a graph database for storing and querying immunological data with complex biological relationships. Querying a graph database through such relationships has the potential to discover novel relationships among heterogeneous biological data and metadata.


Asunto(s)
Almacenamiento y Recuperación de la Información , Web Semántica , Bases de Datos Factuales , Lenguaje , Biología de Sistemas
4.
Am J Med Genet B Neuropsychiatr Genet ; 183(3): 181-194, 2020 04.
Artículo en Inglés | MEDLINE | ID: mdl-31872970

RESUMEN

Cognitive impairment is a frequent and serious problem in patients with various forms of severe mental illnesses (SMI), including schizophrenia (SZ) and bipolar disorder (BP). Recent research suggests genetic links to several cognitive phenotypes in both SMI and in the general population. Our goal in this study was to identify potential genomic signatures of cognitive functioning in veterans with severe mental illness and compare them to previous findings for cognition across different populations. Veterans Affairs (VA) Cooperative Studies Program (CSP) Study #572 evaluated cognitive and functional capacity measures among SZ and BP patients. In conjunction with the VA Million Veteran Program, 3,959 European American (1,095 SZ, 2,864 BP) and 2,601 African American (1,095 SZ, 2,864 BP) patients were genotyped using a custom Affymetrix Axiom Biobank array. We performed a genome-wide association study of global cognitive functioning, constructed polygenic scores for SZ and cognition in the general population, and examined genetic correlations with 2,626 UK Biobank traits. Although no single locus attained genome-wide significance, observed allelic effects were strongly consistent with previous studies. We observed robust associations between global cognitive functioning and polygenic scores for cognitive performance, intelligence, and SZ risk. We also identified significant genetic correlations with several cognition-related traits in UK Biobank. In a diverse cohort of U.S. veterans with SZ or BP, we demonstrate broad overlap of common genetic effects on cognition in the general population, and find that greater polygenic loading for SZ risk is associated with poorer cognitive performance.


Asunto(s)
Trastorno Bipolar/genética , Trastornos del Conocimiento/genética , Cognición , Estudio de Asociación del Genoma Completo , Esquizofrenia/genética , Adulto , Anciano , Alelos , Trastorno Bipolar/fisiopatología , Trastornos del Conocimiento/fisiopatología , Femenino , Genotipo , Humanos , Masculino , Persona de Mediana Edad , Pruebas Neuropsicológicas , Análisis de Secuencia por Matrices de Oligonucleótidos , Polimorfismo de Nucleótido Simple , Esquizofrenia/fisiopatología , Estados Unidos , United States Department of Veterans Affairs , Veteranos
5.
BMC Bioinformatics ; 20(Suppl 5): 182, 2019 Apr 25.
Artículo en Inglés | MEDLINE | ID: mdl-31272390

RESUMEN

BACKGROUND: Human immunology studies often rely on the isolation and quantification of cell populations from an input sample based on flow cytometry and related techniques. Such techniques classify cells into populations based on the detection of a pattern of markers. The description of the cell populations targeted in such experiments typically have two complementary components: the description of the cell type targeted (e.g. 'T cells'), and the description of the marker pattern utilized (e.g. CD14-, CD3+). RESULTS: We here describe our attempts to use ontologies to cross-compare cell types and marker patterns (also referred to as gating definitions). We used a large set of such gating definitions and corresponding cell types submitted by different investigators into ImmPort, a central database for immunology studies, to examine the ability to parse gating definitions using terms from the Protein Ontology (PRO) and cell type descriptions, using the Cell Ontology (CL). We then used logical axioms from CL to detect discrepancies between the two. CONCLUSIONS: We suggest adoption of our proposed format for describing gating and cell type definitions to make comparisons easier. We also suggest a number of new terms to describe gating definitions in flow cytometry that are not based on molecular markers captured in PRO, but on forward- and side-scatter of light during data acquisition, which is more appropriate to capture in the Ontology for Biomedical Investigations (OBI). Finally, our approach results in suggestions on what logical axioms and new cell types could be considered for addition to the Cell Ontology.


Asunto(s)
Ontologías Biológicas , Bases de Datos Factuales , Humanos , Sistema Inmunológico/metabolismo , Subunidades de Proteína/metabolismo , Proteínas/metabolismo
6.
Am J Public Health ; 109(1): 113-115, 2019 01.
Artículo en Inglés | MEDLINE | ID: mdl-30496002

RESUMEN

The Share Project (TSP), a US health justice initiative, convened key stakeholders to advance the use of inclusive research methods and data sharing to engage groups that are typically marginalized from research. TSP trained justice-involved patients, community health workers, policymakers, and researchers in participatory research and the use of a data-sharing platform developed with justice-involved patients. The platform allowed users to analyze health and criminal justice data to develop new research that is patient driven and responsive to the needs of providers.


Asunto(s)
Creación de Capacidad , Investigación Participativa Basada en la Comunidad , Atención a la Salud/organización & administración , Prisioneros/psicología , Humanos , Difusión de la Información , Formulación de Políticas
7.
Epilepsy Behav ; 97: 197-205, 2019 08.
Artículo en Inglés | MEDLINE | ID: mdl-31252279

RESUMEN

OBJECTIVES: Coordination of multidisciplinary care is critical to address the complex needs of people with neurological disorders; however, quality improvement and research tools to measure coordination of neurological care are not well-developed. This study explored and compared the value of social network analysis (SNA) and relational coordination (RC) in measuring coordination of care in a neurology setting. The Department of Veterans Affairs Healthcare System (VA) established an Epilepsy Centers of Excellence (ECOE) hub and spoke model of care, which provides a setting to measure coordination of care across networks of providers. METHODS: In a parallel mixed methods approach, we compared coordination of care of VA providers who formally engage the ECOE system to VA providers outside the ECOE system using SNA and RC. Coordination of care scores were compiled from provider teams across 66 VA facilities, and key informant interviews of 80 epilepsy care team members were conducted concurrently to describe the quality of epilepsy care coordinating in the VA healthcare system. RESULTS: On average, members of healthcare teams affiliated with the ECOE program rated quality of communication and respect higher than non-ECOE physicians. Connectivity between neurologist and primary care providers as well as between neurologists and mental health providers were higher within ECOE hub facilities compared to spoke referring facilities. Key informant interviews reported the important role of formal and informal programming, social support and social capital, and social influence on epilepsy care networks. CONCLUSION: For quality improvement and research purposes, SNA and RC can be used to measure coordination of neurological care; RC provides a detailed assessment of the quality of communication within and across healthcare teams but is difficult to administer and analyze; SNA provides large scale coordination of care maps and metrics to compare across large healthcare systems. The two measures provide complimentary coordination of care data at a local as well as population level. Interviews describe the mechanisms of developing and sustaining health professional networks that are not captured in either SNA or RC measures.


Asunto(s)
Epilepsia/terapia , Grupo de Atención al Paciente/organización & administración , Red Social , Prestación Integrada de Atención de Salud/organización & administración , Personal de Salud , Servicios de Salud , Hospitales de Veteranos , Humanos , Modelos Organizacionales , Neurólogos , Derivación y Consulta , Apoyo Social , Encuestas y Cuestionarios , Estados Unidos , United States Department of Veterans Affairs
8.
BMC Bioinformatics ; 19(1): 268, 2018 07 16.
Artículo en Inglés | MEDLINE | ID: mdl-30012108

RESUMEN

BACKGROUND: Public biomedical data repositories often provide web-based interfaces to collect experimental metadata. However, these interfaces typically reflect the ad hoc metadata specification practices of the associated repositories, leading to a lack of standardization in the collected metadata. This lack of standardization limits the ability of the source datasets to be broadly discovered, reused, and integrated with other datasets. To increase reuse, discoverability, and reproducibility of the described experiments, datasets should be appropriately annotated by using agreed-upon terms, ideally from ontologies or other controlled term sources. RESULTS: This work presents "CEDAR OnDemand", a browser extension powered by the NCBO (National Center for Biomedical Ontology) BioPortal that enables users to seamlessly enter ontology-based metadata through existing web forms native to individual repositories. CEDAR OnDemand analyzes the web page contents to identify the text input fields and associate them with relevant ontologies which are recommended automatically based upon input fields' labels (using the NCBO ontology recommender) and a pre-defined list of ontologies. These field-specific ontologies are used for controlling metadata entry. CEDAR OnDemand works for any web form designed in the HTML format. We demonstrate how CEDAR OnDemand works through the NCBI (National Center for Biotechnology Information) BioSample web-based metadata entry. CONCLUSION: CEDAR OnDemand helps lower the barrier of incorporating ontologies into standardized metadata entry for public data repositories. CEDAR OnDemand is available freely on the Google Chrome store https://chrome.google.com/webstore/search/CEDAROnDemand.


Asunto(s)
Ontologías Biológicas , Internet , Metadatos , Programas Informáticos , Algoritmos , Humanos
9.
Genet Epidemiol ; 41(2): 152-162, 2017 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-28019059

RESUMEN

A key step in genomic studies is to assess high throughput measurements across millions of markers for each participant's DNA, either using microarrays or sequencing techniques. Accurate genotype calling is essential for downstream statistical analysis of genotype-phenotype associations, and next generation sequencing (NGS) has recently become a more common approach in genomic studies. How the accuracy of variant calling in NGS-based studies affects downstream association analysis has not, however, been studied using empirical data in which both microarrays and NGS were available. In this article, we investigate the impact of variant calling errors on the statistical power to identify associations between single nucleotides and disease, and on associations between multiple rare variants and disease. Both differential and nondifferential genotyping errors are considered. Our results show that the power of burden tests for rare variants is strongly influenced by the specificity in variant calling, but is rather robust with regard to sensitivity. By using the variant calling accuracies estimated from a substudy of a Cooperative Studies Program project conducted by the Department of Veterans Affairs, we show that the power of association tests is mostly retained with commonly adopted variant calling pipelines. An R package, GWAS.PC, is provided to accommodate power analysis that takes account of genotyping errors (http://zhaocenter.org/software/).


Asunto(s)
Trastorno Bipolar/genética , Interpretación Estadística de Datos , Estudios de Asociación Genética , Genómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/normas , Polimorfismo de Nucleótido Simple/genética , Esquizofrenia/genética , Algoritmos , Estudios de Casos y Controles , Marcadores Genéticos/genética , Genotipo , Humanos , Control de Calidad
10.
Epilepsy Behav ; 73: 31-35, 2017 08.
Artículo en Inglés | MEDLINE | ID: mdl-28605631

RESUMEN

OBJECTIVE: The study sought to quantify coordination of epilepsy care, over time, between neurologists and other health care providers using social network analysis (SNA). METHODS: The Veterans Health Administration (VA) instituted an Epilepsy Center of Excellence (ECOE) model in 2008 to enhance care coordination between neurologists and other health care providers. Provider networks in the 16 VA ECOE facilities (hub sites) were compared to a subset of 33 VA facilities formally affiliated (consortium sites) and 14 unaffiliated VA facilities. The number of connections between neurologists and each provider (node degree) was measured by shared epilepsy patients and tallied to generate estimates at the facility level separately within and across facilities. Mixed models were used to compare change of facility-level node degree over time across the three facility types, adjusted for number of providers per facility. RESULTS: Over the time period 2000-2013, epilepsy care coordination both within and across facilities significantly increased. These increases were seen in all three types of facilities namely hub, consortium, and unaffiliated site, relatively equally. The increase in connectivity was more dramatic with providers across facilities compared to providers within the same facilities. CONCLUSION: Establishment of the ECOE hub and spoke model contributed to an increase in epilepsy care coordination both within and across facilities from 2000 to 2013, but there was substantial variation across different facilities. SNA is a tool that may help measure coordination of specialty care.


Asunto(s)
Epilepsia/terapia , Personal de Salud/estadística & datos numéricos , Servicios de Salud/estadística & datos numéricos , Neurólogos/estadística & datos numéricos , Red Social , Humanos , Estados Unidos , United States Department of Veterans Affairs
11.
Epilepsy Behav ; 64(Pt A): 4-8, 2016 11.
Artículo en Inglés | MEDLINE | ID: mdl-27723497

RESUMEN

Management of psychogenic nonepileptic seizures (PNES) requires collaboration among and between health care professionals. Although criteria are established for diagnosis of PNES, miscommunication between neurologists, primary care providers, and mental health professionals may occur if the clinical impression is not clearly articulated. We extracted progress notes from the Department of Veterans Affairs (VA) electronic health record (EHR) nationally to study veterans who were evaluated for PNES. Of the 750 patients being worked up for PNES, the majority of patients did not meet criteria for PNES (64.6%). Of those who were thought to suffer from PNES, 147 (19.6%) met International League Against Epilepsy (ILAE) criteria for documented PNES, 14 (1.9%) for clinically established PNES, and 104 (13.9%) for probable or possible PNES. Neurologists tended to use ambiguous language, such as "thought to be" or "suggestive of" to describe their impressions of patients overall, even those with definitive PNES. Ambiguous language may lead to miscommunication across providers and inappropriate health care.


Asunto(s)
Comunicación , Documentación , Registros Electrónicos de Salud , Grupo de Atención al Paciente , Trastornos Psicofisiológicos/diagnóstico , Convulsiones/diagnóstico , Humanos , Trastornos Psicofisiológicos/psicología , Convulsiones/psicología , Veteranos
12.
BMC Bioinformatics ; 15: 86, 2014 Mar 26.
Artículo en Inglés | MEDLINE | ID: mdl-24669769

RESUMEN

BACKGROUND: It is well known that the development of cancer is caused by the accumulation of somatic mutations within the genome. For oncogenes specifically, current research suggests that there is a small set of "driver" mutations that are primarily responsible for tumorigenesis. Further, due to recent pharmacological successes in treating these driver mutations and their resulting tumors, a variety of approaches have been developed to identify potential driver mutations using methods such as machine learning and mutational clustering. We propose a novel methodology that increases our power to identify mutational clusters by taking into account protein tertiary structure via a graph theoretical approach. RESULTS: We have designed and implemented GraphPAC (Graph Protein Amino acid Clustering) to identify mutational clustering while considering protein spatial structure. Using GraphPAC, we are able to detect novel clusters in proteins that are known to exhibit mutation clustering as well as identify clusters in proteins without evidence of prior clustering based on current methods. Specifically, by utilizing the spatial information available in the Protein Data Bank (PDB) along with the mutational data in the Catalogue of Somatic Mutations in Cancer (COSMIC), GraphPAC identifies new mutational clusters in well known oncogenes such as EGFR and KRAS. Further, by utilizing graph theory to account for the tertiary structure, GraphPAC discovers clusters in DPP4, NRP1 and other proteins not identified by existing methods. The R package is available at: http://bioconductor.org/packages/release/bioc/html/GraphPAC.html. CONCLUSION: GraphPAC provides an alternative to iPAC and an extension to current methodology when identifying potential activating driver mutations by utilizing a graph theoretic approach when considering protein tertiary structure.


Asunto(s)
Mutación , Estructura Terciaria de Proteína/genética , Análisis por Conglomerados , Genes Relacionados con las Neoplasias , Proteínas/genética
13.
BMC Bioinformatics ; 15: 231, 2014 Jul 03.
Artículo en Inglés | MEDLINE | ID: mdl-24990767

RESUMEN

BACKGROUND: Current research suggests that a small set of "driver" mutations are responsible for tumorigenesis while a larger body of "passenger" mutations occur in the tumor but do not progress the disease. Due to recent pharmacological successes in treating cancers caused by driver mutations, a variety of methodologies that attempt to identify such mutations have been developed. Based on the hypothesis that driver mutations tend to cluster in key regions of the protein, the development of cluster identification algorithms has become critical. RESULTS: We have developed a novel methodology, SpacePAC (Spatial Protein Amino acid Clustering), that identifies mutational clustering by considering the protein tertiary structure directly in 3D space. By combining the mutational data in the Catalogue of Somatic Mutations in Cancer (COSMIC) and the spatial information in the Protein Data Bank (PDB), SpacePAC is able to identify novel mutation clusters in many proteins such as FGFR3 and CHRM2. In addition, SpacePAC is better able to localize the most significant mutational hotspots as demonstrated in the cases of BRAF and ALK. The R package is available on Bioconductor at: http://www.bioconductor.org/packages/release/bioc/html/SpacePAC.html. CONCLUSION: SpacePAC adds a valuable tool to the identification of mutational clusters while considering protein tertiary structure.


Asunto(s)
Biología Computacional/métodos , Mutación , Proteínas/química , Proteínas/genética , Algoritmos , Análisis por Conglomerados , Bases de Datos de Proteínas , Genes Relacionados con las Neoplasias/genética , Humanos , Neoplasias/genética , Estructura Terciaria de Proteína
14.
Methods ; 61(3): 287-98, 2013 Jun 15.
Artículo en Inglés | MEDLINE | ID: mdl-23702368

RESUMEN

Selective or Multiple Reaction monitoring (SRM/MRM) is a liquid-chromatography (LC)/tandem-mass spectrometry (MS/MS) method that enables the quantitation of specific proteins in a sample by analyzing precursor ions and the fragment ions of their selected tryptic peptides. Instrumentation software has advanced to the point that thousands of transitions (pairs of primary and secondary m/z values) can be measured in a triple quadrupole instrument coupled to an LC, by a well-designed scheduling and selection of m/z windows. The design of a good MRM assay relies on the availability of peptide spectra from previous discovery-phase LC-MS/MS studies. The tedious aspect of manually developing and processing MRM assays involving thousands of transitions has spurred to development of software tools to automate this process. Software packages have been developed for project management, assay development, assay validation, data export, peak integration, quality assessment, and biostatistical analysis. No single tool provides a complete end-to-end solution, thus this article reviews the current state and discusses future directions of these software tools in order to enable researchers to combine these tools for a comprehensive targeted proteomics workflow.


Asunto(s)
Cromatografía Liquida/estadística & datos numéricos , Fragmentos de Péptidos/análisis , Proteínas/análisis , Proteómica/estadística & datos numéricos , Programas Informáticos , Espectrometría de Masas en Tándem/estadística & datos numéricos , Animales , Cromatografía Liquida/métodos , Cromatografía Liquida/normas , Humanos , Iones , Proteómica/métodos , Proteómica/normas , Espectrometría de Masas en Tándem/métodos , Espectrometría de Masas en Tándem/normas
15.
J Investig Med ; 72(1): 139-150, 2024 01.
Artículo en Inglés | MEDLINE | ID: mdl-37668313

RESUMEN

Affecting an estimated 88 million Americans, prediabetes increases the risk for developing type 2 diabetes mellitus (T2DM), and independently, cardiovascular disease, retinopathy, nephropathy, and neuropathy. Nevertheless, little is known about the use of metformin for diabetes prevention among patients in the Veterans Health Administration, the largest integrated healthcare system in the U.S. This is a retrospective observational cohort study of the proportion of Veterans with incident prediabetes who were prescribed metformin at the Veterans Health Administration from October 2010 to September 2019. Among 1,059,605 Veterans with incident prediabetes, 12,009 (1.1%) were prescribed metformin during an average 3.4 years of observation after diagnosis. Metformin prescribing was marginally higher (1.6%) among those with body mass index (BMI) ≥35 kg/m2, age <60 years, HbA1c≥6.0%, or those with a history of gestational diabetes, all subgroups at a higher risk for progression to T2DM. In a multivariable model, metformin was more likely to be prescribed for those with BMI ≥35 kg/m2 incidence rate ratio [IRR] 2.6 [95% confidence intervals (CI): 2.1-3.3], female sex IRR, 2.4 [95% CI: 1.8-3.3], HbA1c≥6% IRR, 1.93 [95% CI: 1.5-2.4], age <60 years IRR, 1.7 [95% CI: 1.3-2.3], hypertriglyceridemia IRR, 1.5 [95% CI: 1.2-1.9], hypertension IRR, 1.5 [95% CI: 1.1-2.1], Major Depressive Disorder IRR, 1.5 [95% CI: 1.1-2.0], or schizophrenia IRR, 2.1 [95% CI: 1.2-3.8]. Over 20% of Veterans with prediabetes attended a comprehensive structured lifestyle modification clinic or program. Among Veterans with prediabetes, metformin was prescribed to 1.1% overall, a proportion that marginally increased to 1.6% in the subset of individuals at highest risk for progression to T2DM.


Asunto(s)
Trastorno Depresivo Mayor , Diabetes Mellitus Tipo 2 , Metformina , Estado Prediabético , Veteranos , Femenino , Humanos , Persona de Mediana Edad , Estudios de Cohortes , Trastorno Depresivo Mayor/tratamiento farmacológico , Diabetes Mellitus Tipo 2/tratamiento farmacológico , Diabetes Mellitus Tipo 2/epidemiología , Diabetes Mellitus Tipo 2/diagnóstico , Hipoglucemiantes/uso terapéutico , Metformina/uso terapéutico , Estado Prediabético/tratamiento farmacológico , Estado Prediabético/epidemiología , Prescripciones , Estudios Retrospectivos
16.
BMC Bioinformatics ; 14: 190, 2013 Jun 13.
Artículo en Inglés | MEDLINE | ID: mdl-23758891

RESUMEN

BACKGROUND: Human cancer is caused by the accumulation of somatic mutations in tumor suppressors and oncogenes within the genome. In the case of oncogenes, recent theory suggests that there are only a few key "driver" mutations responsible for tumorigenesis. As there have been significant pharmacological successes in developing drugs that treat cancers that carry these driver mutations, several methods that rely on mutational clustering have been developed to identify them. However, these methods consider proteins as a single strand without taking their spatial structures into account. We propose an extension to current methodology that incorporates protein tertiary structure in order to increase our power when identifying mutation clustering. RESULTS: We have developed iPAC (identification of Protein Amino acid Clustering), an algorithm that identifies non-random somatic mutations in proteins while taking into account the three dimensional protein structure. By using the tertiary information, we are able to detect both novel clusters in proteins that are known to exhibit mutation clustering as well as identify clusters in proteins without evidence of clustering based on existing methods. For example, by combining the data in the Protein Data Bank (PDB) and the Catalogue of Somatic Mutations in Cancer, our algorithm identifies new mutational clusters in well known cancer proteins such as KRAS and PI3KC α. Further, by utilizing the tertiary structure, our algorithm also identifies clusters in EGFR, EIF2AK2, and other proteins that are not identified by current methodology. The R package is available at: http://www.bioconductor.org/packages/2.12/bioc/html/iPAC.html. CONCLUSION: Our algorithm extends the current methodology to identify oncogenic activating driver mutations by utilizing tertiary protein structure when identifying nonrandom somatic residue mutation clusters.


Asunto(s)
Algoritmos , Mutación , Proteínas de Neoplasias/genética , Estructura Terciaria de Proteína , Análisis por Conglomerados , Humanos , Proteínas de Neoplasias/química
17.
Res Sq ; 2023 Feb 16.
Artículo en Inglés | MEDLINE | ID: mdl-36824778

RESUMEN

Background: Identifying chemical mentions within the Alzheimer's and dementia literature can provide a powerful tool to further therapeutic research. Leveraging the Chemical Entities of Biological Interest (ChEBI) ontology, which is rich in hierarchical and other relationship types, for entity normalization can provide an advantage for future downstream applications. We provide a reproducible hybrid approach that combines an ontology-enhanced PubMedBERT model for disambiguation with a dictionary-based method for candidate selection. Results: There were 56,553 chemical mentions in the titles of 44,812 unique PubMed article abstracts. Based on our gold standard, our method of disambiguation improved entity normalization by 25.3 percentage points compared to using only the dictionary-based approach with fuzzy-string matching for disambiguation. For our Alzheimer's and dementia cohort, we were able to add 47.1% more potential mappings between MeSH and ChEBI when compared to BioPortal. Conclusion: Use of natural language models like PubMedBERT and resources such as ChEBI and PubChem provide a beneficial way to link entity mentions to ontology terms, while further supporting downstream tasks like filtering ChEBI mentions based on roles and assertions to find beneficial therapies for Alzheimer's and dementia.

18.
BMJ Health Care Inform ; 30(1)2023 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-37730251

RESUMEN

OBJECTIVE: The study aimed to measure the validity of International Classification of Diseases, 10th Edition (ICD-10) code F44.5 for functional seizure disorder (FSD) in the Veterans Affairs Connecticut Healthcare System electronic health record (VA EHR). METHODS: The study used an informatics search tool, a natural language processing algorithm and a chart review to validate FSD coding. RESULTS: The positive predictive value (PPV) for code F44.5 was calculated to be 44%. DISCUSSION: ICD-10 introduced a specific code for FSD to improve coding validity. However, results revealed a meager (44%) PPV for code F44.5. Evaluation of the low diagnostic precision of FSD identified inconsistencies in the ICD-10 and VA EHR systems. CONCLUSION: Information system improvements may increase the precision of diagnostic coding by clinicians. Specifically, the EHR problem list should include commonly used diagnostic codes and an appropriately curated ICD-10 term list for 'seizure disorder,' and a single ICD code for FSD should be classified under neurology and psychiatry.


Asunto(s)
Epilepsia , Clasificación Internacional de Enfermedades , Humanos , Algoritmos , Registros Electrónicos de Salud , Epilepsia/diagnóstico , Procesamiento de Lenguaje Natural
19.
BMC Bioinformatics ; 13 Suppl 1: S10, 2012 01 25.
Artículo en Inglés | MEDLINE | ID: mdl-22373303

RESUMEN

BACKGROUND: The RDF triple provides a simple linguistic means of describing limitless types of information. Triples can be flexibly combined into a unified data source we call a semantic model. Semantic models open new possibilities for the integration of variegated biological data. We use Semantic Web technology to explicate high throughput clinical data in the context of fundamental biological knowledge. We have extended Corvus, a data warehouse which provides a uniform interface to various forms of Omics data, by providing a SPARQL endpoint. With the querying and reasoning tools made possible by the Semantic Web, we were able to explore quantitative semantic models retrieved from Corvus in the light of systematic biological knowledge. RESULTS: For this paper, we merged semantic models containing genomic, transcriptomic and epigenomic data from melanoma samples with two semantic models of functional data - one containing Gene Ontology (GO) data, the other, regulatory networks constructed from transcription factor binding information. These two semantic models were created in an ad hoc manner but support a common interface for integration with the quantitative semantic models. Such combined semantic models allow us to pose significant translational medicine questions. Here, we study the interplay between a cell's molecular state and its response to anti-cancer therapy by exploring the resistance of cancer cells to Decitabine, a demethylating agent. CONCLUSIONS: We were able to generate a testable hypothesis to explain how Decitabine fights cancer - namely, that it targets apoptosis-related gene promoters predominantly in Decitabine-sensitive cell lines, thus conveying its cytotoxic effect by activating the apoptosis pathway. Our research provides a framework whereby similar hypotheses can be developed easily.


Asunto(s)
Epigenómica , Perfilación de la Expresión Génica , Internet , Melanoma/genética , Semántica , Azacitidina/análogos & derivados , Azacitidina/farmacología , Sistemas de Administración de Bases de Datos , Decitabina , Resistencia a Antineoplásicos/genética , Ontología de Genes , Redes Reguladoras de Genes , Humanos , Melanoma/tratamiento farmacológico , Melanoma/patología , Factores de Transcripción/metabolismo , Investigación Biomédica Traslacional
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA