RESUMEN
Understanding population health disparities is an essential component of equitable precision health efforts. Epidemiology research often relies on definitions of race and ethnicity, but these population labels may not adequately capture disease burdens and environmental factors impacting specific sub-populations. Here, we propose a framework for repurposing data from electronic health records (EHRs) in concert with genomic data to explore the demographic ties that can impact disease burdens. Using data from a diverse biobank in New York City, we identified 17 communities sharing recent genetic ancestry. We observed 1,177 health outcomes that were statistically associated with a specific group and demonstrated significant differences in the segregation of genetic variants contributing to Mendelian diseases. We also demonstrated that fine-scale population structure can impact the prediction of complex disease risk within groups. This work reinforces the utility of linking genomic data to EHRs and provides a framework toward fine-scale monitoring of population health.
Asunto(s)
Etnicidad/genética , Salud Poblacional , Bases de Datos Genéticas , Registros Electrónicos de Salud , Genómica , Humanos , AutoinformeRESUMEN
Personalized medicine has largely been enabled by the integration of genomic and other data with electronic health records (EHRs) in the United States and elsewhere. Increased EHR adoption across various clinical settings and the establishment of EHR-linked population-based biobanks provide unprecedented opportunities for the types of translational and implementation research that drive personalized medicine. We review advances in the digitization of health information and the proliferation of genomic research in health systems and provide insights into emerging paths for the widespread implementation of personalized medicine.
Asunto(s)
Registros Electrónicos de Salud/tendencias , Medicina de Precisión/métodos , Medicina de Precisión/tendencias , Pruebas Genéticas , Genoma Humano/genética , Genómica/métodos , Genómica/tendencias , Humanos , Estados UnidosRESUMEN
Heritability is essential for understanding the biological causes of disease but requires laborious patient recruitment and phenotype ascertainment. Electronic health records (EHRs) passively capture a wide range of clinically relevant data and provide a resource for studying the heritability of traits that are not typically accessible. EHRs contain next-of-kin information collected via patient emergency contact forms, but until now, these data have gone unused in research. We mined emergency contact data at three academic medical centers and identified 7.4 million familial relationships while maintaining patient privacy. Identified relationships were consistent with genetically derived relatedness. We used EHR data to compute heritability estimates for 500 disease phenotypes. Overall, estimates were consistent with the literature and between sites. Inconsistencies were indicative of limitations and opportunities unique to EHR research. These analyses provide a validation of the use of EHRs for genetics and disease research.
Asunto(s)
Registros Electrónicos de Salud , Enfermedades Genéticas Congénitas/genética , Algoritmos , Bases de Datos Factuales , Relaciones Familiares , Enfermedades Genéticas Congénitas/patología , Genotipo , Humanos , Linaje , Fenotipo , Carácter Cuantitativo HeredableRESUMEN
Polygenic risk scores (PRSs) summarize the genetic predisposition of a complex human trait or disease and may become a valuable tool for advancing precision medicine. However, PRSs that are developed in populations of predominantly European genetic ancestries can increase health disparities due to poor predictive performance in individuals of diverse and complex genetic ancestries. We describe genetic and modifiable risk factors that limit the transferability of PRSs across populations and review the strengths and weaknesses of existing PRS construction methods for diverse ancestries. Developing PRSs that benefit global populations in research and clinical settings provides an opportunity for innovation and is essential for health equity.
Asunto(s)
Predisposición Genética a la Enfermedad , Humanos , Factores de Riesgo , Herencia Multifactorial , Medicina de Precisión , Estudio de Asociación del Genoma CompletoRESUMEN
Mutations in a diverse set of driver genes increase the fitness of haematopoietic stem cells (HSCs), leading to clonal haematopoiesis1. These lesions are precursors for blood cancers2-6, but the basis of their fitness advantage remains largely unknown, partly owing to a paucity of large cohorts in which the clonal expansion rate has been assessed by longitudinal sampling. Here, to circumvent this limitation, we developed a method to infer the expansion rate from data from a single time point. We applied this method to 5,071 people with clonal haematopoiesis. A genome-wide association study revealed that a common inherited polymorphism in the TCL1A promoter was associated with a slower expansion rate in clonal haematopoiesis overall, but the effect varied by driver gene. Those carrying this protective allele exhibited markedly reduced growth rates or prevalence of clones with driver mutations in TET2, ASXL1, SF3B1 and SRSF2, but this effect was not seen in clones with driver mutations in DNMT3A. TCL1A was not expressed in normal or DNMT3A-mutated HSCs, but the introduction of mutations in TET2 or ASXL1 led to the expression of TCL1A protein and the expansion of HSCs in vitro. The protective allele restricted TCL1A expression and expansion of mutant HSCs, as did experimental knockdown of TCL1A expression. Forced expression of TCL1A promoted the expansion of human HSCs in vitro and mouse HSCs in vivo. Our results indicate that the fitness advantage of several commonly mutated driver genes in clonal haematopoiesis may be mediated by TCL1A activation.
Asunto(s)
Hematopoyesis Clonal , Células Madre Hematopoyéticas , Animales , Humanos , Ratones , Alelos , Hematopoyesis Clonal/genética , Estudio de Asociación del Genoma Completo , Hematopoyesis/genética , Células Madre Hematopoyéticas/citología , Células Madre Hematopoyéticas/metabolismo , Mutación , Regiones Promotoras GenéticasRESUMEN
The human reference genome is the most widely used resource in human genetics and is due for a major update. Its current structure is a linear composite of merged haplotypes from more than 20 people, with a single individual comprising most of the sequence. It contains biases and errors within a framework that does not represent global human genomic variation. A high-quality reference with global representation of common variants, including single-nucleotide variants, structural variants and functional elements, is needed. The Human Pangenome Reference Consortium aims to create a more sophisticated and complete human reference genome with a graph-based, telomere-to-telomere representation of global genomic diversity. Here we leverage innovations in technology, study design and global partnerships with the goal of constructing the highest-possible quality human pangenome reference. Our goal is to improve data representation and streamline analyses to enable routine assembly of complete diploid genomes. With attention to ethical frameworks, the human pangenome reference will contain a more accurate and diverse representation of global genomic variation, improve gene-disease association studies across populations, expand the scope of genomics research to the most repetitive and polymorphic regions of the genome, and serve as the ultimate genetic resource for future biomedical research and precision medicine.
Asunto(s)
Genoma Humano , Genómica , Genoma Humano/genética , Haplotipos/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Análisis de Secuencia de ADNRESUMEN
The Human Genome Project was an enormous accomplishment, providing a foundation for countless explorations into the genetics and genomics of the human species. Yet for many years, the human genome reference sequence remained incomplete and lacked representation of human genetic diversity. Recently, two major advances have emerged to address these shortcomings: complete gap-free human genome sequences, such as the one developed by the Telomere-to-Telomere Consortium, and high-quality pangenomes, such as the one developed by the Human Pangenome Reference Consortium. Facilitated by advances in long-read DNA sequencing and genome assembly algorithms, complete human genome sequences resolve regions that have been historically difficult to sequence, including centromeres, telomeres, and segmental duplications. In parallel, pangenomes capture the extensive genetic diversity across populations worldwide. Together, these advances usher in a new era of genomics research, enhancing the accuracy of genomic analysis, paving the path for precision medicine, and contributing to deeper insights into human biology.
Asunto(s)
Genoma Humano , Proyecto Genoma Humano , Humanos , Variación Genética , Genómica/métodos , Análisis de Secuencia de ADN/métodos , Telómero/genéticaRESUMEN
The differential performance of polygenic risk scores (PRSs) by group is one of the major ethical barriers to their clinical use. It is also one of the main practical challenges for any implementation effort. The social repercussions of how people are grouped in PRS research must be considered in communications with research participants, including return of results. Here, we outline the decisions faced and choices made by a large multi-site clinical implementation study returning PRSs to diverse participants in handling this issue of differential performance. Our approach to managing the complexities associated with the differential performance of PRSs serves as a case study that can help future implementers of PRSs to plot an anticipatory course in response to this issue.
Asunto(s)
Predisposición Genética a la Enfermedad , Herencia Multifactorial , Humanos , Herencia Multifactorial/genética , Factores de Riesgo , Estudio de Asociación del Genoma Completo , Medición de Riesgo , Pruebas Genéticas/métodos , Puntuación de Riesgo GenéticoRESUMEN
The heritability explained by local ancestry markers in an admixed population (hγ2) provides crucial insight into the genetic architecture of a complex disease or trait. Estimation of hγ2 can be susceptible to biases due to population structure in ancestral populations. Here, we present heritability estimation from admixture mapping summary statistics (HAMSTA), an approach that uses summary statistics from admixture mapping to infer heritability explained by local ancestry while adjusting for biases due to ancestral stratification. Through extensive simulations, we demonstrate that HAMSTA hγ2 estimates are approximately unbiased and are robust to ancestral stratification compared to existing approaches. In the presence of ancestral stratification, we show a HAMSTA-derived sampling scheme provides a calibrated family-wise error rate (FWER) of â¼5% for admixture mapping, unlike existing FWER estimation approaches. We apply HAMSTA to 20 quantitative phenotypes of up to 15,988 self-reported African American individuals in the Population Architecture using Genomics and Epidemiology (PAGE) study. We observe hËγ2 in the 20 phenotypes range from 0.0025 to 0.033 (mean hËγ2 = 0.012 ± 9.2 × 10-4), which translates to hË2 ranging from 0.062 to 0.85 (mean hË2 = 0.30 ± 0.023). Across these phenotypes we find little evidence of inflation due to ancestral population stratification in current admixture mapping studies (mean inflation factor of 0.99 ± 0.001). Overall, HAMSTA provides a fast and powerful approach to estimate genome-wide heritability and evaluate biases in test statistics of admixture mapping studies.
Asunto(s)
Negro o Afroamericano , Genética de Población , Humanos , Mapeo Cromosómico , Fenotipo , Polimorfismo de Nucleótido Simple/genéticaRESUMEN
Digital solutions are needed to support rapid increases in the application of genetic/genomic tests (GTs) in diverse clinical settings and patient populations. We developed GUÍA, a bilingual digital application that facilitates disclosure of GT results. The NYCKidSeq randomized controlled trial enrolled diverse children with neurologic, cardiac, and immunologic conditions who underwent GTs. The trial evaluated GUÍA's impact on understanding the GT results by randomizing families to results disclosure genetic counseling with GUÍA (intervention) or standard of care (SOC). Parents/legal guardians (participants) completed surveys at baseline, post-results disclosure, and 6 months later. Survey measures assessed the primary study outcomes of participants' perceived understanding of and confidence in explaining their child's GT results and the secondary outcome of objective understanding. The analysis included 551 diverse participants, 270 in the GUÍA arm and 281 in SOC. Participants in the GUÍA arm had significantly higher perceived understanding post-results (OR = 2.8, CI[1.004, 7.617], p = 0.049) and maintained higher objective understanding over time (OR = 1.1, CI[1.004, 1.127], p = 0.038) compared to SOC. There was no impact on perceived confidence. Hispanic/Latino(a) individuals in the GUÍA arm maintained higher perceived understanding (OR = 3.9, CI[1.603, 9.254], p = 0.003), confidence (OR = 2.7, CI[1.021, 7.277], p = 0.046), and objective understanding (OR = 1.1, CI[1.009, 1.212], p = 0.032) compared to SOC. This trial demonstrates that GUÍA positively impacts understanding of GT results in diverse parents of children with suspected genetic conditions and builds a case for utilizing GUÍA to deliver complex results. Continued development and evaluation of digital applications in diverse populations are critical for equitably scaling GT offerings in specialty clinics.
Asunto(s)
Revelación , Asesoramiento Genético , Niño , Humanos , Pruebas Genéticas , Padres , GenómicaRESUMEN
A primary goal of human genetics is to identify DNA sequence variants that influence biomedical traits, particularly those related to the onset and progression of human disease. Over the past 25 years, progress in realizing this objective has been transformed by advances in technology, foundational genomic resources and analytical tools, and by access to vast amounts of genotype and phenotype data. Genetic discoveries have substantially improved our understanding of the mechanisms responsible for many rare and common diseases and driven development of novel preventative and therapeutic strategies. Medical innovation will increasingly focus on delivering care tailored to individual patterns of genetic predisposition.
Asunto(s)
Variación Genética , Animales , Pruebas Genéticas , Genómica , Genotipo , Humanos , Fenotipo , Enfermedades Raras/genéticaRESUMEN
On average, Peruvian individuals are among the shortest in the world1. Here we show that Native American ancestry is associated with reduced height in an ethnically diverse group of Peruvian individuals, and identify a population-specific, missense variant in the FBN1 gene (E1297G) that is significantly associated with lower height. Each copy of the minor allele (frequency of 4.7%) reduces height by 2.2 cm (4.4 cm in homozygous individuals). To our knowledge, this is the largest effect size known for a common height-associated variant. FBN1 encodes the extracellular matrix protein fibrillin 1, which is a major structural component of microfibrils. We observed less densely packed fibrillin-1-rich microfibrils with irregular edges in the skin of individuals who were homozygous for G1297 compared with individuals who were homozygous for E1297. Moreover, we show that the E1297G locus is under positive selection in non-African populations, and that the E1297 variant shows subtle evidence of positive selection specifically within the Peruvian population. This variant is also significantly more frequent in coastal Peruvian populations than in populations from the Andes or the Amazon, which suggests that short stature might be the result of adaptation to factors that are associated with the coastal environment in Peru.
Asunto(s)
Estatura/genética , Fibrilina-1/genética , Mutación Missense , Selección Genética , Femenino , Frecuencia de los Genes , Estudio de Asociación del Genoma Completo , Herencia , Humanos , Indígenas Sudamericanos/genética , Masculino , Microfibrillas/química , Microfibrillas/genética , PerúRESUMEN
Understanding the genetic basis of human diseases and traits is dependent on the identification and accurate genotyping of genetic variants. Deep whole-genome sequencing (WGS), the gold standard technology for SNP and indel identification and genotyping, remains very expensive for most large studies. Here, we quantify the extent to which array genotyping followed by genotype imputation can approximate WGS in studies of individuals of African, Hispanic/Latino, and European ancestry in the US and of Finnish ancestry in Finland (a population isolate). For each study, we performed genotype imputation by using the genetic variants present on the Illumina Core, OmniExpress, MEGA, and Omni 2.5M arrays with the 1000G, HRC, and TOPMed imputation reference panels. Using the Omni 2.5M array and the TOPMed panel, ≥90% of bi-allelic single-nucleotide variants (SNVs) are well imputed (r2 > 0.8) down to minor-allele frequencies (MAFs) of 0.14% in African, 0.11% in Hispanic/Latino, 0.35% in European, and 0.85% in Finnish ancestries. There was little difference in TOPMed-based imputation quality among the arrays with >700k variants. Individual-level imputation quality varied widely between and within the three US studies. Imputation quality also varied across genomic regions, producing regions where even common (MAF > 5%) variants were consistently not well imputed across ancestries. The extent to which array genotyping and imputation can approximate WGS therefore depends on reference panel, genotype array, sample ancestry, and genomic location. Imputation quality by variant or genomic region can be queried with our new tool, RsqBrowser, now deployed on the Michigan Imputation Server.
Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento , Polimorfismo de Nucleótido Simple , Frecuencia de los Genes/genética , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Polimorfismo de Nucleótido Simple/genética , Secuenciación Completa del GenomaRESUMEN
Current publicly available tools that allow rapid exploration of linkage disequilibrium (LD) between markers (e.g., HaploReg and LDlink) are based on whole-genome sequence (WGS) data from 2,504 individuals in the 1000 Genomes Project. Here, we present TOP-LD, an online tool to explore LD inferred with high-coverage (â¼30×) WGS data from 15,578 individuals in the NHLBI Trans-Omics for Precision Medicine (TOPMed) program. TOP-LD provides a significant upgrade compared to current LD tools, as the TOPMed WGS data provide a more comprehensive representation of genetic variation than the 1000 Genomes data, particularly for rare variants and in the specific populations that we analyzed. For example, TOP-LD encompasses LD information for 150.3, 62.2, and 36.7 million variants for European, African, and East Asian ancestral samples, respectively, offering 2.6- to 9.1-fold increase in variant coverage compared to HaploReg 4.0 or LDlink. In addition, TOP-LD includes tens of thousands of structural variants (SVs). We demonstrate the value of TOP-LD in fine-mapping at the GGT1 locus associated with gamma glutamyltransferase in the African ancestry participants in UK Biobank. Beyond fine-mapping, TOP-LD can facilitate a wide range of applications that are based on summary statistics and estimates of LD. TOP-LD is freely available online.
Asunto(s)
Estudio de Asociación del Genoma Completo , Medicina de Precisión , Pueblo Asiatico , Humanos , Desequilibrio de Ligamiento/genética , Polimorfismo de Nucleótido Simple/genética , Secuenciación Completa del GenomaRESUMEN
One mechanism by which genetic factors influence complex traits and diseases is altering gene expression. Direct measurement of gene expression in relevant tissues is rarely tenable; however, genetically regulated gene expression (GReX) can be estimated using prediction models derived from large multi-omic datasets. These approaches have led to the discovery of many gene-trait associations, but whether models derived from predominantly European ancestry (EA) reference panels can map novel associations in ancestrally diverse populations remains unclear. We applied PrediXcan to impute GReX in 51,520 ancestrally diverse Population Architecture using Genomics and Epidemiology (PAGE) participants (35% African American, 45% Hispanic/Latino, 10% Asian, and 7% Hawaiian) across 25 key cardiometabolic traits and relevant tissues to identify 102 novel associations. We then compared associations in PAGE to those in a random subset of 50,000 White British participants from UK Biobank (UKBB50k) for height and body mass index (BMI). We identified 517 associations across 47 tissues in PAGE but not UKBB50k, demonstrating the importance of diverse samples in identifying trait-associated GReX. We observed that variants used in PrediXcan models were either more or less differentiated across continental-level populations than matched-control variants depending on the specific population reflecting sampling bias. Additionally, variants from identified genes specific to either PAGE or UKBB50k analyses were more ancestrally differentiated than those in genes detected in both analyses, underlining the value of population-specific discoveries. This suggests that while EA-derived transcriptome imputation models can identify new associations in non-EA populations, models derived from closely matched reference panels may yield further insights. Our findings call for more diversity in reference datasets of tissue-specific gene expression.
Asunto(s)
Enfermedades Cardiovasculares , Estudio de Asociación del Genoma Completo , Predisposición Genética a la Enfermedad , Humanos , Estilo de Vida , Polimorfismo de Nucleótido Simple , TranscriptomaRESUMEN
The integration of genomic data into health systems offers opportunities to identify genomic factors underlying the continuum of rare and common disease. We applied a population-scale haplotype association approach based on identity-by-descent (IBD) in a large multi-ethnic biobank to a spectrum of disease outcomes derived from electronic health records (EHRs) and uncovered a risk locus for liver disease. We used genome sequencing and in silico approaches to fine-map the signal to a non-coding variant (c.2784-12T>C) in the gene ABCB4. In vitro analysis confirmed the variant disrupted splicing of the ABCB4 pre-mRNA. Four of five homozygotes had evidence of advanced liver disease, and there was a significant association with liver disease among heterozygotes, suggesting the variant is linked to increased risk of liver disease in an allele dose-dependent manner. Population-level screening revealed the variant to be at a carrier rate of 1.95% in Puerto Rican individuals, likely as the result of a Puerto Rican founder effect. This work demonstrates that integrating EHR and genomic data at a population scale can facilitate strategies for understanding the continuum of genomic risk for common diseases, particularly in populations underrepresented in genomic medicine.
Asunto(s)
Atención a la Salud/organización & administración , Predisposición Genética a la Enfermedad , Hepatopatías/genética , Subfamilia B de Transportador de Casetes de Unión a ATP/genética , Registros Electrónicos de Salud , Haplotipos , Heterocigoto , Hispánicos o Latinos/genética , Homocigoto , Humanos , Puerto RicoRESUMEN
PURPOSE: To better understand the effects of returning diagnostic sequencing results on clinical actions and economic outcomes for pediatric patients with suspected genetic disorders. METHODS: Longitudinal physician claims data after diagnostic sequencing were obtained for patients aged 0 to 21 years with neurologic, cardiac, and immunologic disorders with suspected genetic etiology. We assessed specialist consultation rates prompted by primary diagnostic results, as well as marginal effects on overall 18-month physician services and costs. RESULTS: We included data on 857 patients (median age: 9.6 years) with a median follow-up of 17.3 months after disclosure of diagnostic sequencing results. The likelihood of having ≥1 recommendation for specialist consultation in 155 patients with positive findings was high (72%) vs 23% in 443 patients with uncertain findings and 21% in 259 patients with negative findings (P < .001). Follow-through consultation occurred in 30%. Increases in 18-month physician services and costs following a positive finding diminished after multivariable adjustment. Also, no significant differences between those with uncertain and negative findings were demonstrated. CONCLUSION: Our study did not provide evidence for significant increases in downstream physician services and costs after returning positive or uncertain diagnostic sequencing findings. More large-scale longitudinal studies are needed to confirm these findings.
Asunto(s)
Revelación , Médicos , Humanos , Niño , Costos y Análisis de CostoRESUMEN
PURPOSE: To examine associations between Pediatric Quality of Life Inventory (PedsQL) 4.0 Generic Core Scales and PedsQL Infant Scales with formal health care resource utilization (HCRU) and informal caregiver burden. METHODS: We studied a pediatric cohort of 837 patients (median age: 8.4 years) with suspected genetic disorders enrolled January 2019 through July 2021 in the NYCKidSeq program for diagnostic sequencing. Using linked ~ nine-month longitudinal survey and physician claims data collected through May 2022, we modeled the association between baseline PedsQL scores and post-baseline HCRU (median follow-up: 21.1 months) and informal care. We also assessed the longitudinal change in PedsQL scores with physician services using linear mixed-effects models. RESULTS: Lower PedsQL total and physical health scores were independently associated with increases in 18-month physician services, encounters, and weekly informal care. Comparing low vs. median total scores, increases were 10.6 services (95% CI: 1.0-24.6), 3.3 encounters (95% CI: 0.5-6.8), and $668 (95% CI: $350-965), respectively. For the psychosocial domain, higher scores were associated with decreased informal care. Based on adjusted linear mixed-effects modeling, every additional ten physician services was associated with diminished improvement in longitudinal PedsQL total score trajectories by 1.1 point (95% confidence interval: 0.6-1.6) on average. Similar trends were observed in the physical and psychosocial domains. CONCLUSION: PedsQL scores were independently associated with higher utilization of physician services and informal care. Moreover, longitudinal trajectories of PedsQL scores became less favorable with increased physician services. Adding PedsQL survey instruments to conventional measures for improved risk stratification should be evaluated in further research.
The Pediatric Quality of Life Inventory (PedsQL) is widely used to measure health-related quality of life in pediatric patients; however, few studies have examined whether the PedsQL is indicative of longitudinal outcomes of morbidity and health care needs. This study captures associations between PedsQL scores with utilization of physician and informal care in children with suspected genetic disorders. We demonstrate that lower PedsQL total and physical health scores are independently associated with greater utilization of physician services and informal care. Moreover, longitudinal trajectories of PedsQL scores become less favorable with increased physician services. Results can inform future applications of PedsQL instruments.
Asunto(s)
Calidad de Vida , Humanos , Masculino , Femenino , Niño , Preescolar , Adolescente , Enfermedades Genéticas Congénitas/psicología , Encuestas y Cuestionarios , Estudios Longitudinales , Cuidadores/psicología , Lactante , Atención al Paciente , Aceptación de la Atención de Salud/estadística & datos numéricos , Aceptación de la Atención de Salud/psicología , Médicos/psicología , Médicos/estadística & datos numéricosRESUMEN
SIGNIFICANCE STATEMENT: Pathogenic structural genetic variants, also known as genomic disorders, have been associated with pediatric CKD. This study extends those results across the lifespan, with genomic disorders enriched in both pediatric and adult patients compared with controls. In the Chronic Renal Insufficiency Cohort study, genomic disorders were also associated with lower serum Mg, lower educational performance, and a higher risk of death. A phenome-wide association study confirmed the link between kidney disease and genomic disorders in an unbiased way. Systematic detection of genomic disorders can provide a molecular diagnosis and refine prediction of risk and prognosis. BACKGROUND: Genomic disorders (GDs) are associated with many comorbid outcomes, including CKD. Identification of GDs has diagnostic utility. METHODS: We examined the prevalence of GDs among participants in the Chronic Kidney Disease in Children (CKiD) cohort II ( n =248), Chronic Renal Insufficiency Cohort (CRIC) study ( n =3375), Columbia University CKD Biobank (CU-CKD; n =1986), and the Family Investigation of Nephropathy and Diabetes (FIND; n =1318) compared with 30,746 controls. We also performed a phenome-wide association analysis (PheWAS) of GDs in the electronic MEdical Records and GEnomics (eMERGE; n =11,146) cohort. RESULTS: We found nine out of 248 (3.6%) CKiD II participants carried a GD, replicating prior findings in pediatric CKD. We also identified GDs in 72 out of 6679 (1.1%) adult patients with CKD in the CRIC, CU-CKD, and FIND cohorts, compared with 199 out of 30,746 (0.65%) GDs in controls (OR, 1.7; 95% CI, 1.3 to 2.2). Among adults with CKD, we found recurrent GDs at the 1q21.1, 16p11.2, 17q12, and 22q11.2 loci. The 17q12 GD (diagnostic of renal cyst and diabetes syndrome) was most frequent, present in 1:252 patients with CKD and diabetes. In the PheWAS, dialysis and neuropsychiatric phenotypes were the top associations with GDs. In CRIC participants, GDs were associated with lower serum magnesium, lower educational achievement, and higher mortality risk. CONCLUSION: Undiagnosed GDs are detected both in children and adults with CKD. Identification of GDs in these patients can enable a precise genetic diagnosis, inform prognosis, and help stratify risk in clinical studies. GDs could also provide a molecular explanation for nephropathy and comorbidities, such as poorer neurocognition for a subset of patients.
Asunto(s)
Longevidad , Insuficiencia Renal Crónica , Humanos , Estudios de Cohortes , Estudios Prospectivos , Insuficiencia Renal Crónica/epidemiología , Insuficiencia Renal Crónica/genética , Insuficiencia Renal Crónica/complicaciones , Genómica , Progresión de la Enfermedad , Factores de RiesgoRESUMEN
Inadequate representation of non-European ancestry populations in genome-wide association studies (GWAS) has limited opportunities to isolate functional variants. Fine-mapping in multi-ancestry populations should improve the efficiency of prioritizing variants for functional interrogation. To evaluate this hypothesis, we leveraged ancestry architecture to perform comparative GWAS and fine-mapping of obesity-related phenotypes in European ancestry populations from the UK Biobank (UKBB) and multi-ancestry samples from the Population Architecture for Genetic Epidemiology (PAGE) consortium with comparable sample sizes. In the investigated regions with genome-wide significant associations for obesity-related traits, fine-mapping in our ancestrally diverse sample led to 95% and 99% credible sets (CS) with fewer variants than in the European ancestry sample. Lead fine-mapped variants in PAGE regions had higher average coding scores, and higher average posterior probabilities for causality compared to UKBB. Importantly, 99% CS in PAGE loci contained strong expression quantitative trait loci (eQTLs) in adipose tissues or harbored more variants in tighter linkage disequilibrium (LD) with eQTLs. Leveraging ancestrally diverse populations with heterogeneous ancestry architectures, coupled with functional annotation, increased fine-mapping efficiency and performance, and reduced the set of candidate variants for consideration for future functional studies. Significant overlap in genetic causal variants across populations suggests generalizability of genetic mechanisms underpinning obesity-related traits across populations.