RESUMO
INTRODUCTION: Hidradenitis suppurativa (HS) is a prevalent and persistent inflammatory skin disorder, lacking a known cure or effective biomarkers for early diagnosis at present. The genetic determinants of HS have not been fully documented, but it is believed to result from a combination of genetic and environmental factors. METHODS: To identify relevant HS gene variants in sporadic HS patients, this study utilized longitudinal electronic health records (EHRs) and whole-exome sequencing. DNA exome sequencing data from 92,455 participant samples in the MyCode biobank, linked to Geisinger's EHR, were analyzed. This cohort included 1,092 HS cases and 91,363 healthy controls. The MyCode EHR has a median longitudinal follow-up of 15 years per participant, with an average of 87 clinical encounters, 687 laboratory tests, and 7 procedures. RESULTS: There were 1,092 (901 females and 191 males) participants aged 14-89 years (median 47 years) with HS (L73.2), indicating a 1.18% prevalence and accounting for a 4.7:1 female-to-male ratio among the individuals presenting for clinical care. γ-secretase complex, syndromic, and autoinflammatory gene variants were assessed. Potential pathogenic variants were identified among 66 individuals in the HS genes studied. Molecularly, the estimated HS variant prevalence was 1:1,400 in the cohort, 12.3% of variant carriers had HS diagnosis in EHR. CONCLUSIONS: Using longitudinal EHR data, genomic screening identified HS-associated gene variants in a defined group of sporadic HS patients to augment the clinical diagnosis, particularly in cases of ambiguity. Based on this study, the field of skin disorders can benefit from a personalized approach to HS diagnosis using large-scale sequencing.
RESUMO
SIGNIFICANCE STATEMENT: Pathogenic structural genetic variants, also known as genomic disorders, have been associated with pediatric CKD. This study extends those results across the lifespan, with genomic disorders enriched in both pediatric and adult patients compared with controls. In the Chronic Renal Insufficiency Cohort study, genomic disorders were also associated with lower serum Mg, lower educational performance, and a higher risk of death. A phenome-wide association study confirmed the link between kidney disease and genomic disorders in an unbiased way. Systematic detection of genomic disorders can provide a molecular diagnosis and refine prediction of risk and prognosis. BACKGROUND: Genomic disorders (GDs) are associated with many comorbid outcomes, including CKD. Identification of GDs has diagnostic utility. METHODS: We examined the prevalence of GDs among participants in the Chronic Kidney Disease in Children (CKiD) cohort II ( n =248), Chronic Renal Insufficiency Cohort (CRIC) study ( n =3375), Columbia University CKD Biobank (CU-CKD; n =1986), and the Family Investigation of Nephropathy and Diabetes (FIND; n =1318) compared with 30,746 controls. We also performed a phenome-wide association analysis (PheWAS) of GDs in the electronic MEdical Records and GEnomics (eMERGE; n =11,146) cohort. RESULTS: We found nine out of 248 (3.6%) CKiD II participants carried a GD, replicating prior findings in pediatric CKD. We also identified GDs in 72 out of 6679 (1.1%) adult patients with CKD in the CRIC, CU-CKD, and FIND cohorts, compared with 199 out of 30,746 (0.65%) GDs in controls (OR, 1.7; 95% CI, 1.3 to 2.2). Among adults with CKD, we found recurrent GDs at the 1q21.1, 16p11.2, 17q12, and 22q11.2 loci. The 17q12 GD (diagnostic of renal cyst and diabetes syndrome) was most frequent, present in 1:252 patients with CKD and diabetes. In the PheWAS, dialysis and neuropsychiatric phenotypes were the top associations with GDs. In CRIC participants, GDs were associated with lower serum magnesium, lower educational achievement, and higher mortality risk. CONCLUSION: Undiagnosed GDs are detected both in children and adults with CKD. Identification of GDs in these patients can enable a precise genetic diagnosis, inform prognosis, and help stratify risk in clinical studies. GDs could also provide a molecular explanation for nephropathy and comorbidities, such as poorer neurocognition for a subset of patients.
Assuntos
Longevidade , Insuficiência Renal Crônica , Humanos , Estudos de Coortes , Estudos Prospectivos , Insuficiência Renal Crônica/epidemiologia , Insuficiência Renal Crônica/genética , Insuficiência Renal Crônica/complicações , Genômica , Progressão da Doença , Fatores de RiscoRESUMO
The calcium-sensing receptor (CaSR) regulates serum calcium concentrations. CASR loss- or gain-of-function mutations cause familial hypocalciuric hypercalcemia type 1 (FHH1) or autosomal-dominant hypocalcemia type 1 (ADH1), respectively, but the population prevalence of FHH1 or ADH1 is unknown. Rare CASR variants were identified in whole-exome sequences from 51,289 de-identified individuals in the DiscovEHR cohort derived from a single US healthcare system. We integrated bioinformatics pathogenicity triage, mean serum Ca concentrations, and mode of inheritance to identify potential FHH1 or ADH1 variants, and we used a Sequence Kernel Association Test (SKAT) to identify rare variant-associated diseases. We identified predicted heterozygous loss-of-function CASR variants (6 different nonsense/frameshift variants and 12 different missense variants) in 38 unrelated individuals, 21 of whom were hypercalcemic. Missense CASR variants were identified in two unrelated hypocalcemic individuals. Functional studies showed that all hypercalcemia-associated missense variants impaired heterologous expression, plasma membrane targeting, and/or signaling, whereas hypocalcemia-associated missense variants increased expression, plasma membrane targeting, and/or signaling. Thus, 38 individuals with a genetic diagnosis of FHH1 and two individuals with a genetic diagnosis of ADH1 were identified in the 51,289 cohort, giving a prevalence in this population of 74.1 per 100,000 for FHH1 and 3.9 per 100,000 for ADH1. SKAT combining all nonsense, frameshift, and missense loss-of-function variants revealed associations with cardiovascular, neurological, and other diseases. In conclusion, FHH1 is a common cause of hypercalcemia, with prevalence similar to that of primary hyperparathyroidism, and is associated with altered disease risks, whereas ADH1 is a major cause of non-surgical hypoparathyroidism.
Assuntos
Atenção à Saúde/estatística & dados numéricos , Hipercalcemia/congênito , Adulto , Idoso , Idoso de 80 Anos ou mais , Cálcio/sangue , Estudos de Coortes , Feminino , Genes Dominantes/genética , Heterozigoto , Humanos , Hipercalcemia/genética , Masculino , Pessoa de Meia-Idade , Mutação , Fenótipo , Prevalência , Receptores de Detecção de Cálcio/genética , Estados UnidosRESUMO
The pace of deorphanization of G protein-coupled receptors (GPCRs) has slowed, and new approaches are required. Small molecule targeting of orphan GPCRs can potentially be of clinical benefit even if the endogenous receptor ligand has not been identified. Many GPCRs lack common variants that lead to reproducible genome-wide disease associations, and rare-variant approaches have emerged as a viable alternative to identify disease associations for such genes. Therefore, our goal was to prioritize orphan GPCRs by determining their associations with human diseases in a large clinical population. We used sequence kernel association tests to assess the disease associations of 85 orphan or understudied GPCRs in an unselected cohort of 51,289 individuals. Using rare loss-of-function variants, missense variants predicted to be pathogenic or likely pathogenic, and a subset of rare synonymous variants that cause large changes in local codon bias as independent data sets, we found strong, phenome-wide disease associations shared by two or more variant categories for 39% of the GPCRs. To validate the bioinformatics and sequence kernel association test analyses, we functionally characterized rare missense and synonymous variants of GPR39, a family A GPCR, revealing altered expression or Zn2+-mediated signaling for members of both variant classes. These results support the utility of rare variant analyses for identifying disease associations for GPCRs that lack impactful common variants. We highlight the importance of rare synonymous variants in human physiology and argue for their routine inclusion in any comprehensive analysis of genomic variants as potential causes of disease.
Assuntos
Receptores Acoplados a Proteínas G/genética , Transdução de Sinais/genética , Mutação Silenciosa , Estudo de Associação Genômica Ampla , HumanosRESUMO
Importance: Electronic health records are a potentially valuable source of information for identifying patients with opioid use disorder (OUD). Objective: To evaluate whether proxy measures from electronic health record data can be used reliably to identify patients with probable OUD based on Diagnostic and Statistical Manual of Mental Disorders (Fifth Edition) (DSM-5) criteria. Design, Setting, and Participants: This retrospective cross-sectional study analyzed individuals within the Geisinger health system who were prescribed opioids between December 31, 2000, and May 31, 2017, using a mixed-methods approach. The cohort was identified from 16â¯253 patients enrolled in a contract-based, Geisinger-specific medication monitoring program (GMMP) for opioid use, including patients who maintained or violated contract terms, as well as a demographically matched control group of 16â¯253 patients who were prescribed opioids but not enrolled in the GMMP. Substance use diagnoses and psychiatric comorbidities were assessed using automated electronic health record summaries. A manual medical record review procedure using DSM-5 criteria for OUD was completed for a subset of patients. The analysis was conducted beginning from June 5, 2017, until May 29, 2020. Main Outcomes and Measures: The primary outcome was the prevalence of OUD as defined by proxy measures for DSM-5 criteria for OUD as well as the prevalence of comorbidities among patients prescribed opioids within an integrated health system. Results: Among the 16â¯253 patients enrolled in the GMMP (9309 women [57%]; mean [SD] age, 52 [14] years), OUD diagnoses as defined by diagnostic codes were present at a much lower rate than expected (291 [2%]), indicating the necessity for alternative diagnostic strategies. The DSM-5 criteria for OUD can be assessed using manual medical record review; a manual review of 200 patients in the GMMP and 200 control patients identifed a larger percentage of patients with probable moderate to severe OUD (GMMP, 145 of 200 [73%]; and control, 27 of 200 [14%]) compared with the prevalence of OUD assessed using diagnostic codes. Conclusions and Relevance: These results suggest that patients with OUD may be identified using information available in the electronic health record, even when diagnostic codes do not reflect this diagnosis. Furthermore, the study demonstrates the utility of coding for DSM-5 criteria from medical records to generate a quantitative DSM-5 score that is associated with OUD severity.
Assuntos
Documentação/estatística & dados numéricos , Registros Eletrônicos de Saúde/estatística & dados numéricos , Transtornos Relacionados ao Uso de Opioides/diagnóstico , Adulto , Idoso , Estudos Transversais , Documentação/métodos , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Transtornos Relacionados ao Uso de Opioides/fisiopatologia , Prevalência , Estudos RetrospectivosRESUMO
BACKGROUND: Endometrial cancer (EMCA) is the fifth most common cancer among women in the world. Identification of potentially pathogenic germline variants from individuals with EMCA will help characterize genetic features that underlie the disease and potentially predispose individuals to its pathogenesis. METHODS: The Geisinger Health System's (GHS) DiscovEHR cohort includes exome sequencing on over 50,000 consenting patients, 297 of whom have evidence of an EMCA diagnosis in their electronic health record. Here, rare variants were annotated as potentially pathogenic. RESULTS: Eight genes were identified as having increased burden in the EMCA cohort relative to the non-cancer control cohort. None of the eight genes had an increased burden in the other hormone related cancer cohort from GHS, suggesting they can help characterize the underlying genetic variation that gives rise to EMCA. Comparing GHS to the cancer genome atlas (TCGA) EMCA germline data illustrated 34 genes with potentially pathogenic variation and eight unique potentially pathogenic variants that were present in both studies. Thus, similar germline variation among genes can be observed in unique EMCA cohorts and could help prioritize genes to investigate for future work. CONCLUSION: In summary, this systematic characterization of potentially pathogenic germline variants describes the genetic underpinnings of EMCA through the use of data from a single hospital system.
Assuntos
Registros Eletrônicos de Saúde , Neoplasias do Endométrio/genética , Mutação em Linhagem Germinativa , Adulto , Idoso , Idoso de 80 Anos ou mais , Estudos de Coortes , Neoplasias do Endométrio/patologia , Feminino , Humanos , Pessoa de Meia-Idade , Sequenciamento do ExomaRESUMO
Following publication of the original article [1], the authors reported that Fig. 1 was not correctly processed during the production process. The correct Fig. 1 is given below.
RESUMO
A wide range of patient health data is recorded in Electronic Health Records (EHR). This data includes diagnosis, surgical procedures, clinical laboratory measurements, and medication information. Together this information reflects the patient's medical history. Many studies have efficiently used this data from the EHR to find associations that are clinically relevant, either by utilizing International Classification of Diseases, version 9 (ICD-9) codes or laboratory measurements, or by designing phenotype algorithms to extract case and control status with accuracy from the EHR. Here we developed a strategy to utilize longitudinal quantitative trait data from the EHR at Geisinger Health System focusing on outpatient metabolic and complete blood panel data as a starting point. Comprehensive Metabolic Panel (CMP) as well as Complete Blood Counts (CBC) are parts of routine care and provide a comprehensive picture from high level screening of patients' overall health and disease. We randomly split our data into two datasets to allow for discovery and replication. We first conducted a genome-wide association study (GWAS) with median values of 25 different clinical laboratory measurements to identify variants from Human Omni Express Exome beadchip data that are associated with these measurements. We identified 687 variants that associated and replicated with the tested clinical measurements at p<5×10-08. Since longitudinal data from the EHR provides a record of a patient's medical history, we utilized this information to further investigate the ICD-9 codes that might be associated with differences in variability of the measurements in the longitudinal dataset. We identified low and high variance patients by looking at changes within their individual longitudinal EHR laboratory results for each of the 25 clinical lab values (thus creating 50 groups - a high variance and a low variance for each lab variable). We then performed a PheWAS analysis with ICD-9 diagnosis codes, separately in the high variance group and the low variance group for each lab variable. We found 717 PheWAS associations that replicated at a p-value less than 0.001. Next, we evaluated the results of this study by comparing the association results between the high and low variance groups. For example, we found 39 SNPs (in multiple genes) associated with ICD-9 250.01 (Type-I diabetes) in patients with high variance of plasma glucose levels, but not in patients with low variance in plasma glucose levels. Another example is the association of 4 SNPs in UMOD with chronic kidney disease in patients with high variance for aspartate aminotransferase (discovery p-value: 8.71×10-09 and replication p-value: 2.03×10-06). In general, we see a pattern of many more statistically significant associations from patients with high variance in the quantitative lab variables, in comparison with the low variance group across all of the 25 laboratory measurements. This study is one of the first of its kind to utilize quantitative trait variance from longitudinal laboratory data to find associations among genetic variants and clinical phenotypes obtained from an EHR, integrating laboratory values and diagnosis codes to understand the genetic complexities of common diseases.