ABSTRACT
BACKGROUND: Sarcopenia, cachexia and frailty have overlapping features and clinical consequences, but often go unrecognized. The objective was to detect patients described by clinicians as having sarcopenia, cachexia or frailty within electronic health records (EHR) and compare clinical variables between cases and matched controls. METHODS: We conducted a case-control study using retrospective data from the Indiana Network for Patient Care multi-health system database from 2016 to 2017. The computable phenotype combined ICD codes for sarcopenia, cachexia and frailty, with clinical note text terms for sarcopenia, cachexia and frailty detected using natural language processing. Cases with these codes or text terms were matched to controls without these codes or text terms matched on birth year, sex and race. Two physicians reviewed EHR for all cases and a subset of controls. Comorbidity codes, laboratory values, and other coded clinical variables were compared between groups using Wilcoxon matched-pair sign-rank test for continuous variables and conditional logistic regression for binary variables. RESULTS: Cohorts of 9594 cases and 9594 matched controls were generated. Cases were 59% female, 69% white, and a median (1st, 3rd quartiles) age 74.9 (62.2, 84.8) years. Most cases were detected by text terms without ICD codes n = 8285 (86.4%). All cases detected by ICD codes (total n = 1309) also had supportive text terms. Overall 1496 (15.6%) had concurrent terms or codes for two or more of the three conditions (sarcopenia, cachexia or frailty). Of text term occurrence, 97% were used positively for sarcopenia, 90% for cachexia, and 95% for frailty. The remaining occurrences were negative uses of the terms or applied to someone other than the patient. Cases had lower body mass index, albumin and prealbumin, and significantly higher odds ratios for diabetes, hypertension, cardiovascular and peripheral vascular diseases, chronic kidney disease, liver disease, malignancy, osteoporosis and fractures (all p < 0.05). Cases were more likely to be prescribed appetite stimulants and caloric supplements. CONCLUSIONS: Patients detected with a computable phenotype for sarcopenia, cachexia and frailty differed from controls in several important clinical variables. Potential uses include detection among clinical cohorts for targeting recruitment for research and interventions.
Subject(s)
Frailty , Sarcopenia , Aged , Cachexia/diagnosis , Cachexia/epidemiology , Case-Control Studies , Electronic Health Records , Female , Frailty/diagnosis , Frailty/epidemiology , Humans , Male , Retrospective Studies , Sarcopenia/diagnosis , Sarcopenia/epidemiologyABSTRACT
OBJECTIVES: Accurate record linkage (RL) enables consolidation and de-duplication of data from disparate datasets, resulting in more comprehensive and complete patient data. However, conducting RL with low quality or unfit data can waste institutional resources on poor linkage results. We aim to evaluate data linkability to enhance the effectiveness of record linkage. MATERIALS AND METHODS: We describe a systematic approach using data fitness ("linkability") measures, defined as metrics that characterize the availability, discriminatory power, and distribution of potential variables for RL. We used the isolation forest algorithm to detect abnormal linkability values from 188 sites in Indiana and Colorado, and manually reviewed the data to understand the cause of anomalies. RESULT: We calculated 10 linkability metrics for 11 potential linkage variables (LVs) across 188 sites for a total of 20 680 linkability metrics. Potential LVs such as first name, last name, date of birth, and sex have low missing data rates, while Social Security Number vary widely in completeness among all sites. We investigated anomalous linkability values to identify the cause of many records having identical values in certain LVs, issues with placeholder values disguising data missingness, and orphan records. DISCUSSION: The fitness of a variable for RL is determined by its availability and its discriminatory power to uniquely identify individuals. These results highlight the need for awareness of placeholder values, which inform the selection of variables and methods to optimize RL performance. CONCLUSION: Evaluating linkability measures using the isolation forest algorithm to highlight anomalous findings can help identify fitness-for-use issues that must be addressed before initiating the RL process to ensure high-quality linkage outcomes.
Subject(s)
Algorithms , Medical Record Linkage , Humans , Medical Record Linkage/methods , Colorado , Electronic Health Records , Indiana , Data AccuracyABSTRACT
Established classifications exist to confirm Sjögren's Disease (SD) (previously referred as Sjögren's Syndrome) and recruit patients for research. However, no established classification exists for diagnosis in clinical settings causing delayed diagnosis. SD patients experience a huge dental disease burden impairing their quality of life. This study established criteria to characterize Indiana University School of Dentistry (IUSD) patients' SD based on symptoms and signs in the electronic health record (EHR) data available through the state-wide Indiana health information exchange (IHIE). Association between SD diagnosis, and comorbidities including other autoimmune conditions, and documentation of SD diagnosis in electronic dental record (EDR) were also determined. The IUSD patients' EDR were linked with their EHR data in the IHIE and queried for SD diagnostic ICD9/10 codes. The resulting cohorts' EHR clinical findings were characterized and classified using diagnostic criteria based on clinical experts' recommendations. Descriptive statistics were performed, and Chi-square tests determined the association between the different SD presentations and comorbidities including other autoimmune conditions. Eighty-three percent of IUSD patients had an EHR of which 377 patients had a SD diagnosis. They were characterized as positive (24%), uncertain (20%) and negative (56%) based on EHR clinical findings. Dry eyes and mouth were reported for 51% and positive Anti-Ro/SSA antibodies and anti-nuclear antibody (ANA) for 17% of this study cohort. One comorbidity was present in 98% and other autoimmune condition/s were present in 53% respectively. Significant differences were observed between the three SD clinical characteristics/classifications and certain medical and autoimmune conditions (p<0.05). Sixty-nine percent of patients' EDR did not mention SD, highlighting the huge gap in reporting SD during dental care. This study of SD patients diagnosed in community practices characterized three different SD clinical presentations, which can be used to generate SD study cohorts for longitudinal studies using EHR data. The results emphasize the heterogenous SD clinical presentations and the need for further research to diagnose SD early in community practice settings where most people seek care.
Subject(s)
Dry Eye Syndromes , Health Information Exchange , Sjogren's Syndrome , Humans , Sjogren's Syndrome/diagnosis , Sjogren's Syndrome/epidemiology , Quality of Life , Antibodies, AntinuclearABSTRACT
BACKGROUND: Early studies on COVID-19 identified unequal patterns in hospitalization and mortality in urban environments for racial and ethnic minorities. These studies were primarily single center observational studies conducted within the first few weeks or months of the pandemic. We sought to examine trends in COVID-19 morbidity, hospitalization, and mortality over time for minority and rural populations, especially during the U.S. fall surge. METHODS: Data were extracted from a statewide cohort of all adult residents in Indiana tested for SARS-CoV-2 infection between March 1 and December 31, 2020, linked to electronic health records. Primary measures were per capita rates of infection, hospitalization, and death. Age adjusted rates were calculated for multiple time periods corresponding to public health mitigation efforts. Comparisons across time within groups were compared using ANOVA. RESULTS: Morbidity and mortality increased over time with notable differences among sub-populations. Initially, hospitalization rates among racial minorities were 3-4 times higher than whites, and mortality rates among urban residents were twice those of rural residents. By fall 2020, hospitalization and mortality rates in rural areas surpassed those of urban areas, and gaps between black/brown and white populations narrowed. Changes across time among demographic groups was significant for morbidity and hospitalization. Cumulative morbidity and mortality were highest among minority groups and in rural communities. CONCLUSIONS: The synchronicity of disparities in COVID-19 by race and geography suggests that health officials should explicitly measure disparities and adjust mitigation as well as vaccination strategies to protect those sub-populations with greater disease burden.