RESUMEN
Understanding population health disparities is an essential component of equitable precision health efforts. Epidemiology research often relies on definitions of race and ethnicity, but these population labels may not adequately capture disease burdens and environmental factors impacting specific sub-populations. Here, we propose a framework for repurposing data from electronic health records (EHRs) in concert with genomic data to explore the demographic ties that can impact disease burdens. Using data from a diverse biobank in New York City, we identified 17 communities sharing recent genetic ancestry. We observed 1,177 health outcomes that were statistically associated with a specific group and demonstrated significant differences in the segregation of genetic variants contributing to Mendelian diseases. We also demonstrated that fine-scale population structure can impact the prediction of complex disease risk within groups. This work reinforces the utility of linking genomic data to EHRs and provides a framework toward fine-scale monitoring of population health.
Asunto(s)
Etnicidad/genética , Salud Poblacional , Bases de Datos Genéticas , Registros Electrónicos de Salud , Genómica , Humanos , AutoinformeRESUMEN
Precision medicine initiatives across the globe have led to a revolution of repositories linking large-scale genomic data with electronic health records, enabling genomic analyses across the entire phenome. Many of these initiatives focus solely on research insights, leading to limited direct benefit to patients. We describe the biobank at the Colorado Center for Personalized Medicine (CCPM Biobank) that was jointly developed by the University of Colorado Anschutz Medical Campus and UCHealth to serve as a unique, dual-purpose research and clinical resource accelerating personalized medicine. This living resource currently has more than 200,000 participants with ongoing recruitment. We highlight the clinical, laboratory, regulatory, and HIPAA-compliant informatics infrastructure along with our stakeholder engagement, consent, recontact, and participant engagement strategies. We characterize aspects of genetic and geographic diversity unique to the Rocky Mountain region, the primary catchment area for CCPM Biobank participants. We leverage linked health and demographic information of the CCPM Biobank participant population to demonstrate the utility of the CCPM Biobank to replicate complex trait associations in the first 33,674 genotyped individuals across multiple disease domains. Finally, we describe our current efforts toward return of clinical genetic test results, including high-impact pathogenic variants and pharmacogenetic information, and our broader goals as the CCPM Biobank continues to grow. Bringing clinical and research interests together fosters unique clinical and translational questions that can be addressed from the large EHR-linked CCPM Biobank resource within a HIPAA- and CLIA-certified environment.
Asunto(s)
Aprendizaje del Sistema de Salud , Medicina de Precisión , Humanos , Bancos de Muestras Biológicas , Colorado , GenómicaRESUMEN
A long-standing recognition that information from human genetics studies has the potential to accelerate drug discovery has led to decades of research on how to leverage genetic and phenotypic information for drug discovery. Established simple and advanced statistical methods that allow the simultaneous analysis of genotype and clinical phenotype data by genome- and phenome-wide analyses, colocalization analyses with quantitative trait loci data from transcriptomics and proteomics data sets from different tissues, and Mendelian randomization are essential tools for drug development in the postgenomic era. Numerous studies have demonstrated how genomic data provide opportunities for the identification of new drug targets, the repurposing of drugs, and drug safety analyses. With an increase in the number of biobanks that enable linking in-depth omics data with rich repositories of phenotypic traits via electronic health records, more powerful ways for the evaluation and validation of drug targets will continue to expand across different disciplines of clinical research.
Asunto(s)
Registros Electrónicos de Salud , Estudio de Asociación del Genoma Completo , Humanos , Estudio de Asociación del Genoma Completo/métodos , Genómica/métodos , Fenotipo , Descubrimiento de DrogasRESUMEN
Co-occurrence of diseases decreases patient quality of life, complicates treatment choices, and increases mortality. Analyses of electronic health records present a complex scenario of comorbidity relationships that vary by age, sex, and cohort under study. The study of similarities between diseases using 'omics data, such as genes altered in diseases, gene expression, proteome, and microbiome, are fundamental to uncovering the origin of, and potential treatment for, comorbidities. Recent studies have produced a first generation of genetic interpretations for as much as 46% of the comorbidities described in large cohorts. Integrating different sources of molecular information and using artificial intelligence (AI) methods are promising approaches for the study of comorbidities. They may help to improve the treatment of comorbidities, including the potential repositioning of drugs.
Asunto(s)
Inteligencia Artificial , Calidad de Vida , Humanos , ComorbilidadRESUMEN
Population-scale biobanks linked to electronic health record data provide vast opportunities to extend our knowledge of human genetics and discover new phenotype-genotype associations. Given their dense phenotype data, biobanks can also facilitate replication studies on a phenome-wide scale. Here, we introduce the phenotype-genotype reference map (PGRM), a set of 5,879 genetic associations from 523 GWAS publications that can be used for high-throughput replication experiments. PGRM phenotypes are standardized as phecodes, ensuring interoperability between biobanks. We applied the PGRM to five ancestry-specific cohorts from four independent biobanks and found evidence of robust replications across a wide array of phenotypes. We show how the PGRM can be used to detect data corruption and to empirically assess parameters for phenome-wide studies. Finally, we use the PGRM to explore factors associated with replicability of GWAS results.
Asunto(s)
Bancos de Muestras Biológicas , Ciencia de los Datos , Humanos , Fenómica , Fenotipo , GenotipoRESUMEN
As large-scale genomic screening becomes increasingly prevalent, understanding the influence of actionable results on healthcare utilization is key to estimating the potential long-term clinical impact. The eMERGE network sequenced individuals for actionable genes in multiple genetic conditions and returned results to individuals, providers, and the electronic health record. Differences in recommended health services (laboratory, imaging, and procedural testing) delivered within 12 months of return were compared among individuals with pathogenic or likely pathogenic (P/LP) findings to matched individuals with negative findings before and after return of results. Of 16,218 adults, 477 unselected individuals were found to have a monogenic risk for arrhythmia (n = 95), breast cancer (n = 96), cardiomyopathy (n = 95), colorectal cancer (n = 105), or familial hypercholesterolemia (n = 86). Individuals with P/LP results more frequently received services after return (43.8%) compared to before return (25.6%) of results and compared to individuals with negative findings (24.9%; p < 0.0001). The annual cost of qualifying healthcare services increased from an average of $162 before return to $343 after return of results among the P/LP group (p < 0.0001); differences in the negative group were non-significant. The mean difference-in-differences was $149 (p < 0.0001), which describes the increased cost within the P/LP group corrected for cost changes in the negative group. When stratified by individual conditions, significant cost differences were observed for arrhythmia, breast cancer, and cardiomyopathy. In conclusion, less than half of individuals received billed health services after monogenic return, which modestly increased healthcare costs for payors in the year following return.
Asunto(s)
Neoplasias de la Mama , Cardiomiopatías , Adulto , Humanos , Femenino , Estudios Prospectivos , Aceptación de la Atención de Salud , Arritmias Cardíacas , Neoplasias de la Mama/genética , Cardiomiopatías/genéticaRESUMEN
A major focus of academia, industry, and global governmental agencies is to develop and apply artificial intelligence and other advanced analytical tools to transform health care delivery. The American Heart Association supports the creation of tools and services that would further the science and practice of precision medicine by enabling more precise approaches to cardiovascular and stroke research, prevention, and care of individuals and populations. Nevertheless, several challenges exist, and few artificial intelligence tools have been shown to improve cardiovascular and stroke care sufficiently to be widely adopted. This scientific statement outlines the current state of the art on the use of artificial intelligence algorithms and data science in the diagnosis, classification, and treatment of cardiovascular disease. It also sets out to advance this mission, focusing on how digital tools and, in particular, artificial intelligence may provide clinical and mechanistic insights, address bias in clinical studies, and facilitate education and implementation science to improve cardiovascular and stroke outcomes. Last, a key objective of this scientific statement is to further the field by identifying best practices, gaps, and challenges for interested stakeholders.
Asunto(s)
Enfermedades Cardiovasculares , Cardiopatías , Accidente Cerebrovascular , Estados Unidos , Humanos , Inteligencia Artificial , American Heart Association , Enfermedades Cardiovasculares/terapia , Enfermedades Cardiovasculares/prevención & control , Accidente Cerebrovascular/diagnóstico , Accidente Cerebrovascular/prevención & controlRESUMEN
Diagnosis for rare genetic diseases often relies on phenotype-driven methods, which hinge on the accuracy and completeness of the rare disease phenotypes in the underlying annotation knowledgebase. Existing knowledgebases are often manually curated with additional annotations found in published case reports. Despite their potential, real-world data such as electronic health records (EHRs) have not been fully exploited to derive rare disease annotations. Here, we present open annotation for rare diseases (OARD), a real-world-data-derived resource with annotation for rare-disease-related phenotypes. This resource is derived from the EHRs of two academic health institutions containing more than 10 million individuals spanning wide age ranges and different disease subgroups. By leveraging ontology mapping and advanced natural-language-processing (NLP) methods, OARD automatically and efficiently extracts concepts for both rare diseases and their phenotypic traits from billing codes and lab tests as well as over 100 million clinical narratives. The rare disease prevalence derived by OARD is highly correlated with those annotated in the original rare disease knowledgebase. By performing association analysis, we identified more than 1 million novel disease-phenotype association pairs that were previously missed by human annotation, and >60% were confirmed true associations via manual review of a list of sampled pairs. Compared to the manual curated annotation, OARD is 100% data driven and its pipeline can be shared across different institutions. By supporting privacy-preserving sharing of aggregated summary statistics, such as term frequencies and disease-phenotype associations, it fills an important gap to facilitate data-driven research in the rare disease community.
Asunto(s)
Procesamiento de Lenguaje Natural , Enfermedades Raras , Registros Electrónicos de Salud , Humanos , Fenotipo , Enfermedades Raras/genéticaRESUMEN
Complex traits are influenced by genetic risk factors, lifestyle, and environmental variables, so-called exposures. Some exposures, e.g., smoking or lipid levels, have common genetic modifiers identified in genome-wide association studies. Because measurements are often unfeasible, exposure polygenic risk scores (ExPRSs) offer an alternative to study the influence of exposures on various phenotypes. Here, we collected publicly available summary statistics for 28 exposures and applied four common PRS methods to generate ExPRSs in two large biobanks: the Michigan Genomics Initiative and the UK Biobank. We established ExPRSs for 27 exposures and demonstrated their applicability in phenome-wide association studies and as predictors for common chronic conditions. Especially the addition of multiple ExPRSs showed, for several chronic conditions, an improvement compared to prediction models that only included traditional, disease-focused PRSs. To facilitate follow-up studies, we share all ExPRS constructs and generated results via an online repository called ExPRSweb.
Asunto(s)
Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Lípidos , Herencia Multifactorial/genética , Factores de RiesgoRESUMEN
PURPOSE: Coding mutations in the Transthyretin (TTR) gene cause a hereditary form of amyloidosis characterized by a complex genotype-phenotype correlation with limited information regarding differences among worldwide populations. METHODS: We compared 676 diverse individuals carrying TTR amyloidogenic mutations (rs138065384, Phe44Leu; rs730881165, Ala81Thr; rs121918074, His90Asn; rs76992529, Val122Ile) to 12,430 non-carriers matched by age, sex, and genetically-inferred ancestry to assess their clinical presentations across 1,693 outcomes derived from electronic health records in UK biobank. RESULTS: In individuals of African descent (AFR), Val122Ile mutation was linked to multiple outcomes related to the circulatory system (fold-enrichment = 2.96, p = 0.002) with the strongest associations being cardiac congenital anomalies (phecode 747.1, p = 0.003), endocarditis (phecode 420.3, p = 0.006), and cardiomyopathy (phecode 425, p = 0.007). In individuals of Central-South Asian descent (CSA), His90Asn mutation was associated with dermatologic outcomes (fold-enrichment = 28, p = 0.001). The same TTR mutation was linked to neoplasms in European-descent individuals (EUR, fold-enrichment = 3.09, p = 0.003). In EUR, Ala81Thr showed multiple associations with respiratory outcomes related (fold-enrichment = 3.61, p = 0.002), but the strongest association was with atrioventricular block (phecode 426.2, p = 2.81 × 10- 4). Additionally, the same mutation in East Asians (EAS) showed associations with endocrine-metabolic traits (fold-enrichment = 4.47, p = 0.003). In the cross-ancestry meta-analysis, Val122Ile mutation was associated with peripheral nerve disorders (phecode 351, p = 0.004) in addition to cardiac congenital anomalies (fold-enrichment = 6.94, p = 0.003). CONCLUSIONS: Overall, these findings highlight that TTR amyloidogenic mutations present ancestry-specific and ancestry-convergent associations related to a range of health domains. This supports the need to increase awareness regarding the range of outcomes associated with TTR mutations across worldwide populations to reduce misdiagnosis and delayed diagnosis of TTR-related amyloidosis.
Asunto(s)
Amiloidosis , Prealbúmina , Humanos , Prealbúmina/genética , Mutación , Amiloidosis/diagnóstico , Amiloidosis/genética , Fenotipo , Genética de PoblaciónRESUMEN
Rationale: Cardiovascular events after chronic obstructive pulmonary disease (COPD) exacerbations are recognized. Studies to date have been post hoc analyses of trials, did not differentiate exacerbation severity, included death in the cardiovascular outcome, or had insufficient power to explore individual outcomes temporally.Objectives: We explore temporal relationships between moderate and severe exacerbations and incident, nonfatal hospitalized cardiovascular events in a primary care-derived COPD cohort.Methods: We included people with COPD in England from 2014 to 2020, from the Clinical Practice Research Datalink Aurum primary care database. The index date was the date of first COPD exacerbation or, for those without exacerbations, date upon eligibility. We determined composite and individual cardiovascular events (acute coronary syndrome, arrhythmia, heart failure, ischemic stroke, and pulmonary hypertension) from linked hospital data. Adjusted Cox regression models were used to estimate average and time-stratified adjusted hazard ratios (aHRs).Measurements and Main Results: Among 213,466 patients, 146,448 (68.6%) had any exacerbation; 119,124 (55.8%) had moderate exacerbations, and 27,324 (12.8%) had severe exacerbations. A total of 40,773 cardiovascular events were recorded. There was an immediate period of cardiovascular relative rate after any exacerbation (1-14 d; aHR, 3.19 [95% confidence interval (CI), 2.71-3.76]), followed by progressively declining yet maintained effects, elevated after one year (aHR, 1.84 [95% CI, 1.78-1.91]). Hazard ratios were highest 1-14 days after severe exacerbations (aHR, 14.5 [95% CI, 12.2-17.3]) but highest 14-30 days after moderate exacerbations (aHR, 1.94 [95% CI, 1.63-2.31]). Cardiovascular outcomes with the greatest two-week effects after a severe exacerbation were arrhythmia (aHR, 12.7 [95% CI, 10.3-15.7]) and heart failure (aHR, 8.31 [95% CI, 6.79-10.2]).Conclusions: Cardiovascular events after moderate COPD exacerbations occur slightly later than after severe exacerbations; heightened relative rates remain beyond one year irrespective of severity. The period immediately after an exacerbation presents a critical opportunity for clinical intervention and treatment optimization to prevent future cardiovascular events.
Asunto(s)
Enfermedades Cardiovasculares , Insuficiencia Cardíaca , Enfermedad Pulmonar Obstructiva Crónica , Humanos , Progresión de la Enfermedad , Enfermedad Pulmonar Obstructiva Crónica/complicaciones , Enfermedad Pulmonar Obstructiva Crónica/epidemiología , Enfermedad Pulmonar Obstructiva Crónica/tratamiento farmacológico , Arritmias Cardíacas , Insuficiencia Cardíaca/epidemiología , Enfermedades Cardiovasculares/epidemiología , Enfermedades Cardiovasculares/etiologíaRESUMEN
Here, we summarize the proceedings of the inaugural Artificial Intelligence in Primary Immune Deficiencies conference, during which experts and advocates gathered to advance research into the applications of artificial intelligence (AI), machine learning, and other computational tools in the diagnosis and management of inborn errors of immunity (IEIs). The conference focused on the key themes of expediting IEI diagnoses, challenges in data collection, roles of natural language processing and large language models in interpreting electronic health records, and ethical considerations in implementation. Innovative AI-based tools trained on electronic health records and claims databases have discovered new patterns of warning signs for IEIs, facilitating faster diagnoses and enhancing patient outcomes. Challenges in training AIs persist on account of data limitations, especially in cases of rare diseases, overlapping phenotypes, and biases inherent in current data sets. Furthermore, experts highlighted the significance of ethical considerations, data protection, and the necessity for open science principles. The conference delved into regulatory frameworks, equity in access, and the imperative for collaborative efforts to overcome these obstacles and harness the transformative potential of AI. Concerted efforts to successfully integrate AI into daily clinical immunology practice are still needed.
Asunto(s)
Inteligencia Artificial , Enfermedades de Inmunodeficiencia Primaria , Humanos , Aprendizaje Automático , Procesamiento de Lenguaje Natural , Recolección de DatosRESUMEN
BACKGROUND: General pediatric providers are the front line for early peanut introduction discussions, but many providers believe that they are ill-equipped to handle such discussions, as the guidelines have changed quickly. OBJECTIVE: We hypothesized that a clinical decision support (CDS) tool could improve discussions of peanut introduction. METHODS: CDS tools were designed by stakeholders, improved through usability testing, and integrated into the current note templates. On the basis of queries of electronic health records, we did a preperformance versus postperformance evaluation of conversations regarding peanut introduction, barriers to peanut introduction, and percentage of 12-month well-child checkups (WCCs) that resulted in successful introduction of peanut. Providers completed surveys before and after intervention to assess their awareness of early peanut introduction and comfort using the CDS tools. RESULTS: Providers' awareness of early peanut introduction guidelines increased from 17.8% to 66.7% after the CDS tool was implemented; 79.1% of the providers were comfortable using the tool. The CDS tool improved peanut introduction conversations at the 4-month WCC from 2.4% to 81.2%, at the 6-month WCC from 3.0% to 84.2%, and at the 12-month WCC from 2.7% to 82.9%. In all, 56.6% of families had a plan to introduce peanut at the 4-month WCC. Of those who did not have a plan, the most common barrier was the family's unawareness of the benefits of early peanut introduction. At the 12-month WCC, 62.8% of families had introduced peanut without concerns. CONCLUSION: A point-of-care CDS tool encouraged more discussions of early peanut introduction between general pediatric providers and all patients. CDS tools should be considered in quality improvement projects as an implementation method for the most up-to-date guidelines.
Asunto(s)
Arachis , Sistemas de Apoyo a Decisiones Clínicas , Hipersensibilidad al Cacahuete , Humanos , Hipersensibilidad al Cacahuete/diagnóstico , Arachis/inmunología , Lactante , Femenino , Masculino , Guías de Práctica Clínica como AsuntoRESUMEN
Within a multistate clinical cohort, SARS-CoV-2 antiviral prescribing patterns were evaluated from April 2022-June 2023 among nonhospitalized patients with SARS-CoV-2 with risk factors for severe COVID-19. Among 3247 adults, only 31.9% were prescribed an antiviral agent (87.6% nirmatrelvir/ritonavir, 11.9% molnupiravir, 0.5% remdesivir), highlighting the need to identify and address treatment barriers.
Asunto(s)
Antivirales , Tratamiento Farmacológico de COVID-19 , SARS-CoV-2 , Humanos , Antivirales/uso terapéutico , Masculino , Persona de Mediana Edad , Femenino , Adulto , Anciano , Factores de Riesgo , Ritonavir/uso terapéutico , COVID-19/epidemiología , Adenosina Monofosfato/análogos & derivados , Adenosina Monofosfato/uso terapéutico , Alanina/uso terapéutico , Alanina/análogos & derivados , Pautas de la Práctica en Medicina/estadística & datos numéricos , Citidina/análogos & derivados , HidroxilaminasRESUMEN
BACKGROUND: Limited evidence suggests elevated risks of cardiovascular disease (CVD) among people diagnosed with tuberculosis (TB) disease, though studies have not adjusted for pre-existing CVD risk. We carried out a cohort study using two separate datasets, estimating CVD incidence in people with TB versus those without. METHODS: Using data from the United States (Veterans Health Administration) and the United Kingdom (Clinical Practice Research Datalink) for 2000-2020 we matched adults with incident TB disease and no CVD history 2-years before TB diagnosis (US n=2,121; UK n=15,820) with up to 10 peopleâ¯without TB on the basis of age, sex, race/ethnicity and healthcare practice. Participants were followed beginning 2-years before TB diagnosis and for 2-years subsequently. The acute period was defined as 3-months before/after TB diagnosis. TB, CVD and covariates were identified from electronic routinely collected data (primary and secondary care; mortality). Poisson models estimated incident rate ratios (IRR) for CVD events in people with TB compared to those without. RESULTS: CVD incidence was consistently higher in people with TB, including during the baseline period (pre-TB) and particularly in the acute period: IRRs were US 3.5 (95% Confidence Interval 2.7-4.4), UK 2.7 (2.2-3.3). Rate Ratios remained high after adjusting for differences in pre-existing CVD risk: US 3.2 (2.2-4.4), UK 1.6 (1.2-2.1). CONCLUSION: Increased CVD incidence was observed in people with TB versus those without, especially within months of TB diagnosis, persistent after adjustment for differences in pre-existing risk. Enhancing CVD screening and risk management may improve long-term outcomes in people with TB.
RESUMEN
Viral respiratory illness surveillance has traditionally focused on single pathogens (e.g., influenza) and required fever to identify influenza-like illness (ILI). We developed an automated system applying both laboratory test and syndrome criteria to electronic health records from 3 practice groups in Massachusetts, USA, to monitor trends in respiratory viral-like illness (RAVIOLI) across multiple pathogens. We identified RAVIOLI syndrome using diagnosis codes associated with respiratory viral testing or positive respiratory viral assays or fever. After retrospectively applying RAVIOLI criteria to electronic health records, we observed annual winter peaks during 2015-2019, predominantly caused by influenza, followed by cyclic peaks corresponding to SARS-CoV-2 surges during 2020-2024, spikes in RSV in mid-2021 and late 2022, and recrudescent influenza in late 2022 and 2023. RAVIOLI rates were higher and fluctuations more pronounced compared with traditional ILI surveillance. RAVIOLI broadens the scope, granularity, sensitivity, and specificity of respiratory viral illness surveillance compared with traditional ILI surveillance.
Asunto(s)
Algoritmos , Registros Electrónicos de Salud , Infecciones del Sistema Respiratorio , Humanos , Infecciones del Sistema Respiratorio/virología , Infecciones del Sistema Respiratorio/epidemiología , Infecciones del Sistema Respiratorio/diagnóstico , Estudios Retrospectivos , Gripe Humana/epidemiología , Gripe Humana/diagnóstico , Gripe Humana/virología , COVID-19/epidemiología , COVID-19/diagnóstico , Vigilancia de la Población/métodos , Massachusetts/epidemiología , Adulto , Persona de Mediana Edad , SARS-CoV-2 , Masculino , Adolescente , Niño , Anciano , Femenino , Estaciones del Año , Virosis/epidemiología , Virosis/diagnóstico , Virosis/virología , Preescolar , Adulto JovenRESUMEN
Lyme disease surveillance based on provider and laboratory reports underestimates incidence. We developed an algorithm for automating surveillance using electronic health record data. We identified potential Lyme disease markers in electronic health record data (laboratory tests, diagnosis codes, prescriptions) from January 2017-December 2018 in 2 large practice groups in Massachusetts, USA. We calculated their sensitivities and positive predictive values (PPV), alone and in combination, relative to medical record review. Sensitivities ranged from 57% (95% CI 47%-69%) for immunoassays to 87% (95% CI 70%-100%) for diagnosis codes. PPVs ranged from 53% (95% CI 43%-61%) for diagnosis codes to 58% (95% CI 50%-66%) for immunoassays. The combination of a diagnosis code and antibiotics within 14 days or a positive Western blot had a sensitivity of 100% (95% CI 86%-100%) and PPV of 82% (95% CI 75%-89%). This algorithm could make Lyme disease surveillance more efficient and consistent.
Asunto(s)
Registros Electrónicos de Salud , Enfermedad de Lyme , Humanos , Enfermedad de Lyme/epidemiología , Massachusetts/epidemiología , Vigilancia de la Población , Algoritmos , Historia del Siglo XXIRESUMEN
Confinement facilities are high-risk settings for the spread of infectious disease, necessitating timely surveillance to inform public health action. To identify jail-associated COVID-19 cases from electronic laboratory reports maintained in the Minnesota Electronic Disease Surveillance System (MEDSS), Minnesota, USA, the Minnesota Department of Health developed a surveillance system that used keyword and address matching (KAM). The KAM system used a SAS program (SAS Institute Inc., https://www.sas.com) and an automated program within MEDSS to identify confinement keywords and addresses. To evaluate KAM, we matched jail booking data from the Minnesota Statewide Supervision System by full name and birthdate to the MEDSS records of adults with COVID-19 for 2022. The KAM system identified 2,212 cases in persons detained in jail; sensitivity was 92.40% and specificity was 99.95%. The success of KAM demonstrates its potential to be applied to other diseases and congregate-living settings for real-time surveillance without added reporting burden.
Asunto(s)
COVID-19 , Adulto , Humanos , COVID-19/epidemiología , Cárceles Locales , Minnesota/epidemiología , Prueba de COVID-19 , Salud PúblicaRESUMEN
Hypertension is a common "silent killer" in adult medicine, but epidemiologic estimates of elevated blood pressure in children and adolescents are challenged by under-diagnosis and resultant low utilization of relevant administrative or billing codes. In the article by Horgan et al (Am J Epidemiol 2024), children and adolescents with hypertension and elevated blood pressure were identified using direct assessment of blood pressure measurements available in the electronic health record from both inpatient and outpatient visits ("clinical cohort") in comparison to diagnosis codes ("claims-based cohort"). The study population included 3.75 million pediatric healthcare visits available in the US Food and Drug Administration's Sentinel System. While the study applied a relatively novel methodology to interrogate available clinical data within the EHR to better understand the prevalence of pediatric hypertension and raised concern for a higher occurrence of hypertension among children and adolescents than previously realized using claims codes, the utility of the prevalence estimates may be limited by the potential for misclassification bias inherent in EHR data. However, these data raise important concerns about relaying solely on ICD-9-CM/ICD-10-CM codes to quantify the epidemiology of pediatric hypertension and highlight opportunities to address elevated blood pressure in children that could improve long-term cardiovascular health.
RESUMEN
Electronic health record (EHR) data are seen as an important source for Pharmacoepidemiology studies. In the US healthcare system, EHR systems often only identify fragments of patients' health information across the care continuum, including primary care, specialist care, hospitalizations, and pharmacy dispensing. This leads to unobservable information in longitudinal evaluations of medication effects causing unmeasured confounding, misclassification, and truncated follow-up times. A remedy is to link EHR data with longitudinal claims data which record all encounters during a defined enrollment period across all care settings. We evaluate EHR and claims data sources in three aspects relevant to etiologic studies of medical products: data continuity, data granularity, and data chronology. Reflecting on the strengths and limitations of EHR and insurance claims data, it becomes obvious that they complement each other. The combination of both will improve the validity of etiologic studies and expand the range of questions that can be answered. As the research community transitions towards a future state with access to large-scale combined EHR+claims data, we outline analytic templates to improve the validity and broaden the scope of pharmacoepidemiology studies in the current environment where EHR data are available only for a subset of patients with claims data.