RESUMO
MOTIVATION: Methods for concept recognition (CR) in clinical texts have largely been tested on abstracts or articles from the medical literature. However, texts from electronic health records (EHRs) frequently contain spelling errors, abbreviations, and other nonstandard ways of representing clinical concepts. RESULTS: Here, we present a method inspired by the BLAST algorithm for biosequence alignment that screens texts for potential matches on the basis of matching k-mer counts and scores candidates based on conformance to typical patterns of spelling errors derived from 2.9 million clinical notes. Our method, the Term-BLAST-like alignment tool (TBLAT) leverages a gold standard corpus for typographical errors to implement a sequence alignment-inspired method for efficient entity linkage. We present a comprehensive experimental comparison of TBLAT with five widely used tools. Experimental results show an increase of 10% in recall on scientific publications and 20% increase in recall on EHR records (when compared against the next best method), hence supporting a significant enhancement of the entity linking task. The method can be used stand-alone or as a complement to existing approaches. AVAILABILITY AND IMPLEMENTATION: Fenominal is a Java library that implements TBLAT for named CR of Human Phenotype Ontology terms and is available at https://github.com/monarch-initiative/fenominal under the GNU General Public License v3.0.
Assuntos
Algoritmos , Idioma , Humanos , Alinhamento de Sequência , Registros Eletrônicos de Saúde , PublicaçõesRESUMO
AIM: To facilitate multisite studies and international clinical research, this study aimed to identify consensus-based, standardized common data elements (CDEs) for arthrogryposis multiplex congenita (AMC). METHOD: A mixed-methods study comprising of several focus group discussions and three rounds of modified Delphi surveys to achieve consensus using two tiered-rating scales were conducted. RESULTS: Overall, 45 clinical experts and adults with lived experience (including 12 members of an AMC consortium) participated in this study from 11 countries in North America, Europe, and Australia. The CDEs include 321 data elements and 19 standardized measures across various domains from fetal development to adulthood. Data elements pertaining to AMC phenotypic traits were mapped according to the Human Phenotype Ontology. A universal governance structure, local operating protocols, and sustainability plans were identified as the main facilitators, whereas limited capacity for data sharing and the need for a federated informatics infrastructure were the main barriers. INTERPRETATION: Collection of systematic data on AMC using CDEs will allow investigations on etiological pathways, describe epidemiological profile, and establish genotype-phenotype correlations in a standardized manner. The proposed CDEs will facilitate international multidisciplinary collaborations by improving large-scale studies and opportunities for data sharing, knowledge translation, and dissemination.
RESUMO
OBJECTIF: Afin de faciliter les études multisites et la recherche clinique d'envergure internationale, cette étude a pour but d'identifier des éléments de données communs (EDCs) normalisés et fondés sur un consensus pour l'arthrogrypose multiple congénitale (AMC). MÉTHODE: Une étude à méthodes mixtes comprenant plusieurs groupes de discussion et trois séries d'enquêtes Delphi modifiées pour parvenir à un consensus ont été menées. RÉSULTATS: Dans l'ensemble, 45 experts cliniques ainsi qu'adultes ayant une expérience vécue (dont 12 membres d'un consortium d'AMC) ont participé à cette étude à travers 11 pays en Amérique du Nord, Europe et Australie. Les EDCs comprennent 321 éléments de données et 19 mesures standardisées dans divers domaines, du développement du fÅtus à l'âge adulte. Les éléments de données relatifs aux traits phénotypiques de l'AMC ont été cartographiés conformément à l'ontologie du phénotype humain (HPO). Une structure de gouvernance universelle, des protocoles de fonctionnement et des plans de développement durable ont été identifiés comme les principaux facilitateurs considérant que la capacité limitée de partage des données et la nécessité d'une infrastructure informatique fédérée étaient les principaux obstacles. INTERPRÉTATION: Une collecte de données systématiques sur l'AMC à l'aide d'EDCs permettra d'étudier sur les voies étiologiques, décrire le profil épidémiologique, et établir des corrélations génotypephénotype de manière standardisée. Les EDCs proposés faciliteront les collaborations internationales multidisciplinaires en améliorant à grande échelle les études multicentriques, les possibilités de partage des données, ainsi que le transfert et la diffusion des connaissances.
RESUMO
OBJETIVO: Para facilitar los estudios multicéntricos y la investigación clínica internacional, este estudio pretende identificar de forma consensuada los elementos de datos estandarizados para la artrogriposis múltiple congénita (AMC). MÉTODO: Estudio de métodos mixtos de grupos de discusión y tres rondas de encuestas Delphi modificadas para llegar a un consenso utilizando dos escalas de clasificación por niveles. RESULTADOS: En total, 45 expertos clínicos y adultos con experiencia vivida (incluidos 12 miembros de un consorcio de AMC) participaron en este estudio procedentes de 11 países: Norteamérica, Europa y Australia. Los CDEs incluyen 321 elementos de datos y 19 medidas estandarizadas en varios dominios desde el desarrollo fetal hasta la edad adulta. Los elementos de datos relativos a los rasgos fenotípicos del CDEs se mapearon de acuerdo con la Ontología de Fenotipos Humanos. Se identificaron como principales facilitadores la estructura de gobernanza universal, protocolos operados de forma local y los planes de sostenibilidad, mientras que los principales obstáculos observados son la capacidad limitada para compartir datos y la necesidad de una infraestructura informática federada. INTERPRETACIÓN: La recopilación de datos sistemáticos sobre la AMC mediante CDEs permitirá investigar las vías etiológicas, describir el perfil epidemiológico y establecer correlaciones genotipofenotipo de forma estandarizada. Los CDEs propuestos facilitarán las colaboraciones multidisciplinares internacionales mejorando los estudios a gran escala y las oportunidades para compartir datos, translación de conocimiento y difusión.
RESUMO
Shriners Children's (SHC) is a hospital system whose mission is to advance the treatment and research of pediatric diseases. SHC success has generated a wealth of clinical data. Unfortunately, barriers to healthcare data access often limit data-driven clinical research. We decreased this burden by allowing access to clinical data via the standardized data access standard called FHIR (Fast Healthcare Interoperability Resources). Specifically, we converted existing data in the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) standard into FHIR data elements using a technology called OMOP-on-FHIR. In addition, we developed two applications leveraging the FHIR data elements to facilitate patient cohort curation to advance research into pediatric musculoskeletal diseases. Our work enables clinicians and clinical researchers to use hundreds of currently available open-sourced FHIR applications. Our successful implementation of OMOP-on-FHIR within a large hospital system will accelerate advancements in pediatric disease treatment and research.
Assuntos
Informática Médica , Doenças Musculoesqueléticas , Criança , Instalações de Saúde , Hospitais , Humanos , TecnologiaRESUMO
PURPOSE: Pharmacologic doses of corticosteroid (CS) have been shown to ameliorate the progression of Duchenne muscular dystrophy (DMD) preserving strength, pulmonary function and ambulation as well as reducing the incidence of scoliosis. However, there are serious side effects of CS, which may impact dose tolerance. The purpose of this study was to compare the magnitude of positive CS effects on patients in our clinic to those reported in the literature. METHODS: We retrospectively reviewed medical records and radiographs of 142 DMD patients who were seen between 1st January 1991 and 31st December 2017. RESULTS: In total, 101 boys met study inclusion criteria. Of these 32 were steroid naïve, 37 took the recommended dose (standard of care, SOC) of Prednisone or Deflazacort, and 32 took a lower dose (LD). Following initiation of CS, both treatment groups showed an increase in weight velocity and decrease in linear growth velocity. Although there was a trend to later loss of ambulation (LOA) in the SOC group relative to the naïve group by one year, this was not significant, however, a small subgroup of boys on Deflazacort showed a 3.4 year later LOA than the naïve group. The incidence of scoliosis was reduced from 69% in the naïve, to 41% in the LD and 47% in the SOC group. CONCLUSIONS: Although there was a reduction in the incidence of scoliosis, it was not as robust as seen elsewhere. Many published studies have inadequate data on scoliosis probably due to the lack of inclusion of orthopaedists in the study group. LEVEL OF EVIDENCE: IV.