Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 45
Filtrar
Más filtros

Bases de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Am J Hum Genet ; 110(4): 575-591, 2023 04 06.
Artículo en Inglés | MEDLINE | ID: mdl-37028392

RESUMEN

Leveraging linkage disequilibrium (LD) patterns as representative of population substructure enables the discovery of additive association signals in genome-wide association studies (GWASs). Standard GWASs are well-powered to interrogate additive models; however, new approaches are required for invesigating other modes of inheritance such as dominance and epistasis. Epistasis, or non-additive interaction between genes, exists across the genome but often goes undetected because of a lack of statistical power. Furthermore, the adoption of LD pruning as customary in standard GWASs excludes detection of sites that are in LD but might underlie the genetic architecture of complex traits. We hypothesize that uncovering long-range interactions between loci with strong LD due to epistatic selection can elucidate genetic mechanisms underlying common diseases. To investigate this hypothesis, we tested for associations between 23 common diseases and 5,625,845 epistatic SNP-SNP pairs (determined by Ohta's D statistics) in long-range LD (>0.25 cM). Across five disease phenotypes, we identified one significant and four near-significant associations that replicated in two large genotype-phenotype datasets (UK Biobank and eMERGE). The genes that were most likely involved in the replicated associations were (1) members of highly conserved gene families with complex roles in multiple pathways, (2) essential genes, and/or (3) genes that were associated in the literature with complex traits that display variable expressivity. These results support the highly pleiotropic and conserved nature of variants in long-range LD under epistatic selection. Our work supports the hypothesis that epistatic interactions regulate diverse clinical mechanisms and might especially be driving factors in conditions with a wide range of phenotypic outcomes.


Asunto(s)
Epistasis Genética , Estudio de Asociación del Genoma Completo , Desequilibrio de Ligamiento/genética , Genotipo , Bancos de Muestras Biológicas , Reino Unido , Polimorfismo de Nucleótido Simple/genética
2.
Genet Epidemiol ; 46(1): 32-50, 2022 02.
Artículo en Inglés | MEDLINE | ID: mdl-34664742

RESUMEN

Statistical methods to integrate multiple layers of data, from exposures to intermediate traits to outcome variables, are needed to guide interpretation of complex data sets for which variables are likely contributing in a causal pathway from exposure to outcome. Statistical mediation analysis based on structural equation models provide a general modeling framework, yet they can be difficult to apply to high-dimensional data and they are not automated to select the best fitting model. To overcome these limitations, we developed novel algorithms and software to simultaneously evaluate multiple exposure variables, multiple intermediate traits, and multiple outcome variables. Our penalized mediation models are computationally efficient and simulations demonstrate that they produce reliable results for large data sets. Application of our methods to a study of vascular disease demonstrates their utility to identify novel direct effects of single-nucleotide polymorphisms (SNPs) on coronary heart disease and peripheral artery disease, while disentangling the effects of SNPs on the intermediate risk factors including lipids, cigarette smoking, systolic blood pressure, and type 2 diabetes.


Asunto(s)
Diabetes Mellitus Tipo 2 , Algoritmos , Diabetes Mellitus Tipo 2/genética , Humanos , Modelos Genéticos , Modelos Estadísticos , Fenotipo , Programas Informáticos
3.
Circulation ; 145(12): 877-891, 2022 03 22.
Artículo en Inglés | MEDLINE | ID: mdl-34930020

RESUMEN

BACKGROUND: Sequencing Mendelian arrhythmia genes in individuals without an indication for arrhythmia genetic testing can identify carriers of pathogenic or likely pathogenic (P/LP) variants. However, the extent to which these variants are associated with clinically meaningful phenotypes before or after return of variant results is unclear. In addition, the majority of discovered variants are currently classified as variants of uncertain significance, limiting clinical actionability. METHODS: The eMERGE-III study (Electronic Medical Records and Genomics Phase III) is a multicenter prospective cohort that included 21 846 participants without previous indication for cardiac genetic testing. Participants were sequenced for 109 Mendelian disease genes, including 10 linked to arrhythmia syndromes. Variant carriers were assessed with electronic health record-derived phenotypes and follow-up clinical examination. Selected variants of uncertain significance (n=50) were characterized in vitro with automated electrophysiology experiments in HEK293 cells. RESULTS: As previously reported, 3.0% of participants had P/LP variants in the 109 genes. Herein, we report 120 participants (0.6%) with P/LP arrhythmia variants. Compared with noncarriers, arrhythmia P/LP carriers had a significantly higher burden of arrhythmia phenotypes in their electronic health records. Fifty-four participants had variant results returned. Nineteen of these 54 participants had inherited arrhythmia syndrome diagnoses (primarily long-QT syndrome), and 12 of these 19 diagnoses were made only after variant results were returned (0.05%). After in vitro functional evaluation of 50 variants of uncertain significance, we reclassified 11 variants: 3 to likely benign and 8 to P/LP. CONCLUSIONS: Genome sequencing in a large population without indication for arrhythmia genetic testing identified phenotype-positive carriers of variants in congenital arrhythmia syndrome disease genes. As the genomes of large numbers of people are sequenced, the disease risk from rare variants in arrhythmia genes can be assessed by integrating genomic screening, electronic health record phenotypes, and in vitro functional studies. REGISTRATION: URL: https://www. CLINICALTRIALS: gov; Unique identifier; NCT03394859.


Asunto(s)
Arritmias Cardíacas , Pruebas Genéticas , Arritmias Cardíacas/diagnóstico , Arritmias Cardíacas/genética , Predisposición Genética a la Enfermedad , Pruebas Genéticas/métodos , Genómica , Células HEK293 , Humanos , Fenotipo , Estudios Prospectivos
4.
Am J Hum Genet ; 106(5): 707-716, 2020 05 07.
Artículo en Inglés | MEDLINE | ID: mdl-32386537

RESUMEN

Because polygenic risk scores (PRSs) for coronary heart disease (CHD) are derived from mainly European ancestry (EA) cohorts, their validity in African ancestry (AA) and Hispanic ethnicity (HE) individuals is unclear. We investigated associations of "restricted" and genome-wide PRSs with CHD in three major racial and ethnic groups in the U.S. The eMERGE cohort (mean age 48 ± 14 years, 58% female) included 45,645 EA, 7,597 AA, and 2,493 HE individuals. We assessed two restricted PRSs (PRSTikkanen and PRSTada; 28 and 50 variants, respectively) and two genome-wide PRSs (PRSmetaGRS and PRSLDPred; 1.7 M and 6.6 M variants, respectively) derived from EA cohorts. Over a median follow-up of 11.1 years, 2,652 incident CHD events occurred. Hazard and odds ratios for the association of PRSs with CHD were similar in EA and HE cohorts but lower in AA cohorts. Genome-wide PRSs were more strongly associated with CHD than restricted PRSs were. PRSmetaGRS, the best performing PRS, was associated with CHD in all three cohorts; hazard ratios (95% CI) per 1 SD increase were 1.53 (1.46-1.60), 1.53 (1.23-1.90), and 1.27 (1.13-1.43) for incident CHD in EA, HE, and AA individuals, respectively. The hazard ratios were comparable in the EA and HE cohorts (pinteraction = 0.77) but were significantly attenuated in AA individuals (pinteraction= 2.9 × 10-3). These results highlight the potential clinical utility of PRSs for CHD as well as the need to assemble diverse cohorts to generate ancestry- and ethnicity PRSs.


Asunto(s)
Negro o Afroamericano/genética , Enfermedad Coronaria/genética , Predisposición Genética a la Enfermedad , Hispánicos o Latinos/genética , Herencia Multifactorial/genética , Población Blanca/genética , Estudios de Cohortes , Femenino , Humanos , Masculino , Persona de Mediana Edad , Oportunidad Relativa
5.
Genet Med ; 25(4): 100006, 2023 04.
Artículo en Inglés | MEDLINE | ID: mdl-36621880

RESUMEN

PURPOSE: Assessing the risk of common, complex diseases requires consideration of clinical risk factors as well as monogenic and polygenic risks, which in turn may be reflected in family history. Returning risks to individuals and providers may influence preventive care or use of prophylactic therapies for those individuals at high genetic risk. METHODS: To enable integrated genetic risk assessment, the eMERGE (electronic MEdical Records and GEnomics) network is enrolling 25,000 diverse individuals in a prospective cohort study across 10 sites. The network developed methods to return cross-ancestry polygenic risk scores, monogenic risks, family history, and clinical risk assessments via a genome-informed risk assessment (GIRA) report and will assess uptake of care recommendations after return of results. RESULTS: GIRAs include summary care recommendations for 11 conditions, education pages, and clinical laboratory reports. The return of high-risk GIRA to individuals and providers includes guidelines for care and lifestyle recommendations. Assembling the GIRA required infrastructure and workflows for ingesting and presenting content from multiple sources. Recruitment began in February 2022. CONCLUSION: Return of a novel report for communicating monogenic, polygenic, and family history-based risk factors will inform the benefits of integrated genetic risk assessment for routine health care.


Asunto(s)
Genoma , Genómica , Humanos , Estudios Prospectivos , Genómica/métodos , Factores de Riesgo , Medición de Riesgo
6.
PLoS Genet ; 16(3): e1008684, 2020 03.
Artículo en Inglés | MEDLINE | ID: mdl-32226016

RESUMEN

Lipid levels are important markers for the development of cardio-metabolic diseases. Although hundreds of associated loci have been identified through genetic association studies, the contribution of genetic factors to variation in lipids is not fully understood, particularly in U.S. minority groups. We performed genome-wide association analyses for four lipid traits in over 45,000 ancestrally diverse participants from the Population Architecture using Genomics and Epidemiology (PAGE) Study, followed by a meta-analysis with several European ancestry studies. We identified nine novel lipid loci, five of which showed evidence of replication in independent studies. Furthermore, we discovered one novel gene in a PrediXcan analysis, minority-specific independent signals at eight previously reported loci, and potential functional variants at two known loci through fine-mapping. Systematic examination of known lipid loci revealed smaller effect estimates in African American and Hispanic ancestry populations than those in Europeans, and better performance of polygenic risk scores based on minority-specific effect estimates. Our findings provide new insight into the genetic architecture of lipid traits and highlight the importance of conducting genetic studies in diverse populations in the era of precision medicine.


Asunto(s)
Lípidos/sangre , Lípidos/genética , Grupos Raciales/genética , Bases de Datos Genéticas , Femenino , Estudio de Asociación del Genoma Completo/métodos , Genotipo , Humanos , Lípidos/análisis , Masculino , Metagenómica/métodos , Grupos Minoritarios , Herencia Multifactorial/genética , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Estados Unidos/epidemiología
7.
J Allergy Clin Immunol ; 150(5): 1086-1096, 2022 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-35595084

RESUMEN

BACKGROUND: Asthma is the most common chronic condition in children and the third leading cause of hospitalization in pediatrics. The genome-wide association study catalog reports 140 studies with genome-wide significance. A polygenic risk score (PRS) with predictive value across ancestries has not been evaluated for this important trait. OBJECTIVES: This study aimed to train and validate a PRS relying on genetic determinants for asthma to provide predictions for disease occurrence in pediatric cohorts of diverse ancestries. METHODS: This study applied a Bayesian regression framework method using the Trans-National Asthma Genetic Consortium genome-wide association study summary statistics to derive a multiancestral PRS score, used one Electronic Medical Records and Genomics (eMERGE) cohort as a training set, used a second independent eMERGE cohort to validate the score, and used the UK Biobank data to replicate the findings. A phenome-wide association study was performed using the PRS to identify shared genetic etiology with other phenotypes. RESULTS: The multiancestral asthma PRS was associated with asthma in the 2 pediatric validation datasets. Overall, the multiancestral asthma PRS has an area under the curve (AUC) of 0.70 (95% CI, 0.69-0.72) in the pediatric validation 1 and AUC of 0.66 (0.65-0.66) in the pediatric validation 2 datasets. We found significant discrimination across pediatric subcohorts of European (AUC, 95% CI, 0.60 and 0.66), African (AUC, 95% CI, 0.61 and 0.66), admixed American (AUC, 0.64 and 0.70), Southeast Asian (AUC, 0.65), and East Asian (AUC, 0.73) ancestry. Pediatric participants with the top 5% PRS had 2.80 to 5.82 increased odds of asthma compared to the bottom 5% across the training, validation 1, and validation 2 cohorts when adjusted for ancestry. Phenome-wide association study analysis confirmed the strong association of the identified PRS with asthma (odds ratio, 2.71, PFDR = 3.71 × 10-65) and related phenotypes. CONCLUSIONS: A multiancestral PRS for asthma based on Bayesian posterior genomic effect sizes identifies increased odds of pediatric asthma.


Asunto(s)
Asma , Estudio de Asociación del Genoma Completo , Humanos , Niño , Estudio de Asociación del Genoma Completo/métodos , Herencia Multifactorial , Predisposición Genética a la Enfermedad , Polimorfismo de Nucleótido Simple , Teorema de Bayes , Factores de Riesgo , Asma/genética
8.
J Lipid Res ; 63(1): 100160, 2022 01.
Artículo en Inglés | MEDLINE | ID: mdl-34902367

RESUMEN

A significant proportion of patients with elevated LDL and a clinical presentation of familial hypercholesterolemia do not carry known genetic mutations associated with hypercholesterolemia, such as defects in the LDL receptor. To identify new genes involved in the cellular uptake of LDL, we developed a novel whole-genome clustered regularly interspaced short palindromic repeat-Cas9 KO screen in HepG2 cells. We identified transgelin (TAGLN), an actin-binding protein, as a potentially new gene involved in LDL endocytosis. In silico validation demonstrated that genetically predicted differences in expression of TAGLN in human populations were significantly associated with elevated plasma lipids (triglycerides, total cholesterol, and LDL-C) in the Global Lipids Genetics Consortium and lipid-related phenotypes in the UK Biobank. In biochemical studies, TAGLN-KO HepG2 cells showed a reduction in cellular LDL uptake, as measured by flow cytometry. In confocal microscopy imaging, TAGLN-KO cells had disrupted actin filaments as well as an accumulation of LDL receptor on their surface because of decreased receptor internalization. Furthermore, TAGLN-KO cells exhibited a reduction in total and free cholesterol content, activation of SREBP2, and a compensatory increase in cholesterol biosynthesis. TAGLN deficiency also disrupted the uptake of VLDL and transferrin, other known cargoes for receptors that depend upon clathrin-mediated endocytosis. Our data suggest that TAGLN is a novel factor involved in the actin-dependent phase of clathrin-mediated endocytosis of LDL. The identification of novel genes involved in the endocytic uptake of LDL may improve the diagnosis of hypercholesterolemia and provide future therapeutic targets for the prevention of cardiovascular disease.


Asunto(s)
Proteínas de Microfilamentos , Proteínas Musculares
9.
Hum Genet ; 141(11): 1739-1748, 2022 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-35226188

RESUMEN

Uterine fibroids (UF) are common pelvic tumors in women, heritable, and genome-wide association studies (GWAS) have identified ~ 30 loci associated with increased risk in UF. Using summary statistics from a previously published UF GWAS performed in a non-Hispanic European Ancestry (NHW) female subset from the Electronic Medical Records and Genomics (eMERGE) Network, we constructed a polygenic risk score (PRS) for UF. UF-PRS was developed using PRSice and optimized in the separate clinical population of BioVU. PRS was validated using parallel methods of 10-fold cross-validation logistic regression and phenome-wide association study (PheWAS) in a seperate subset of eMERGE NHW females (validation set), excluding samples used in GWAS. PRSice determined pt < 0.001 and after linkage disequilibrium pruning (r2 < 0.2), 4458 variants were in the PRS which was significant (pseudo-R2 = 0.0018, p = 0.041). 10-fold cross-validation logistic regression modeling of validation set revealed the model had an area under the curve (AUC) value of 0.60 (95% confidence interval [CI] 0.58-0.62) when plotted in a receiver operator curve (ROC). PheWAS identified six phecodes associated with the PRS with the most significant phenotypes being 218 'benign neoplasm of uterus' and 218.1 'uterine leiomyoma' (p = 1.94 × 10-23, OR 1.31 [95% CI 1.26-1.37] and p = 3.50 × 10-23, OR 1.32 [95% CI 1.26-1.37]). We have developed and validated the first PRS for UF. We find our PRS has predictive ability for UF and captures genetic architecture of increased risk for UF that can be used in further studies.


Asunto(s)
Estudio de Asociación del Genoma Completo , Leiomioma , Femenino , Predisposición Genética a la Enfermedad , Genómica , Humanos , Leiomioma/genética , Desequilibrio de Ligamiento , Factores de Riesgo
10.
Curr Cardiol Rep ; 24(9): 1169-1177, 2022 09.
Artículo en Inglés | MEDLINE | ID: mdl-35796859

RESUMEN

PURPOSE OF REVIEW: A polygenic risk score (PRS) is a measure of genetic liability to a disease and is typically normally distributed in a population. Individuals in the upper tail of this distribution often have relative risk equivalent to that of monogenic form of the disease. The majority of currently available PRSs for coronary heart disease (CHD) have been generated from cohorts of European ancestry (EUR) and vary in their applicability to other ancestry groups. In this report, we review the performance of PRSs for CHD across different ancestries and efforts to reduce variability in performance including novel population and statistical genetics approaches. RECENT FINDINGS: PRSs for CHD perform robustly in EUR populations but lag in performance in non-EUR groups, particularly individuals of African ancestry. Several large consortia have been established to enable genomic studies in diverse ancestry groups and develop methods to improve PRS performance in multi-ancestry contexts as well as admixed individuals. These include fine-mapping to ascertain causal variants, trans ancestry meta-analyses, and ancestry deconvolution in admixed individuals. PRSs are being used in the clinical setting but enthusiasm has been tempered by the variable performance in non-EUR ancestry groups. Increasing diversity in genomic association studies and continued innovation in methodological approaches are needed to improve PRS performance in non-EUR individuals for equitable implementation of genomic medicine.


Asunto(s)
Enfermedad Coronaria , Estudio de Asociación del Genoma Completo , Enfermedad Coronaria/genética , Predisposición Genética a la Enfermedad , Humanos , Factores de Riesgo
11.
BMC Med Inform Decis Mak ; 22(1): 23, 2022 01 28.
Artículo en Inglés | MEDLINE | ID: mdl-35090449

RESUMEN

INTRODUCTION: Currently, one of the commonly used methods for disseminating electronic health record (EHR)-based phenotype algorithms is providing a narrative description of the algorithm logic, often accompanied by flowcharts. A challenge with this mode of dissemination is the potential for under-specification in the algorithm definition, which leads to ambiguity and vagueness. METHODS: This study examines incidents of under-specification that occurred during the implementation of 34 narrative phenotyping algorithms in the electronic Medical Record and Genomics (eMERGE) network. We reviewed the online communication history between algorithm developers and implementers within the Phenotype Knowledge Base (PheKB) platform, where questions could be raised and answered regarding the intended implementation of a phenotype algorithm. RESULTS: We developed a taxonomy of under-specification categories via an iterative review process between two groups of annotators. Under-specifications that lead to ambiguity and vagueness were consistently found across narrative phenotype algorithms developed by all involved eMERGE sites. DISCUSSION AND CONCLUSION: Our findings highlight that under-specification is an impediment to the accuracy and efficiency of the implementation of current narrative phenotyping algorithms, and we propose approaches for mitigating these issues and improved methods for disseminating EHR phenotyping algorithms.


Asunto(s)
Algoritmos , Registros Electrónicos de Salud , Genómica , Humanos , Bases del Conocimiento , Fenotipo
12.
Circulation ; 142(17): 1633-1646, 2020 10 27.
Artículo en Inglés | MEDLINE | ID: mdl-32981348

RESUMEN

BACKGROUND: Abdominal aortic aneurysm (AAA) is an important cause of cardiovascular mortality; however, its genetic determinants remain incompletely defined. In total, 10 previously identified risk loci explain a small fraction of AAA heritability. METHODS: We performed a genome-wide association study in the Million Veteran Program testing ≈18 million DNA sequence variants with AAA (7642 cases and 172 172 controls) in veterans of European ancestry with independent replication in up to 4972 cases and 99 858 controls. We then used mendelian randomization to examine the causal effects of blood pressure on AAA. We examined the association of AAA risk variants with aneurysms in the lower extremity, cerebral, and iliac arterial beds, and derived a genome-wide polygenic risk score (PRS) to identify a subset of the population at greater risk for disease. RESULTS: Through a genome-wide association study, we identified 14 novel loci, bringing the total number of known significant AAA loci to 24. In our mendelian randomization analysis, we demonstrate that a genetic increase of 10 mm Hg in diastolic blood pressure (odds ratio, 1.43 [95% CI, 1.24-1.66]; P=1.6×10-6), as opposed to systolic blood pressure (odds ratio, 1.06 [95% CI, 0.97-1.15]; P=0.2), likely has a causal relationship with AAA development. We observed that 19 of 24 AAA risk variants associate with aneurysms in at least 1 other vascular territory. A 29-variant PRS was strongly associated with AAA (odds ratioPRS, 1.26 [95% CI, 1.18-1.36]; PPRS=2.7×10-11 per SD increase in PRS), independent of family history and smoking risk factors (odds ratioPRS+family history+smoking, 1.24 [95% CI, 1.14-1.35]; PPRS=1.27×10-6). Using this PRS, we identified a subset of the population with AAA prevalence greater than that observed in screening trials informing current guidelines. CONCLUSIONS: We identify novel AAA genetic associations with therapeutic implications and identify a subset of the population at significantly increased genetic risk of AAA independent of family history. Our data suggest that extending current screening guidelines to include testing to identify those with high polygenic AAA risk, once the cost of genotyping becomes comparable with that of screening ultrasound, would significantly increase the yield of current screening at reasonable cost.


Asunto(s)
Aneurisma de la Aorta Abdominal/genética , Humanos , Veteranos
13.
Int J Obes (Lond) ; 45(1): 155-169, 2021 01.
Artículo en Inglés | MEDLINE | ID: mdl-32952152

RESUMEN

BACKGROUND/OBJECTIVES: Melanocortin-4 receptor (MC4R) plays an essential role in food intake and energy homeostasis. More than 170 MC4R variants have been described over the past two decades, with conflicting reports regarding the prevalence and phenotypic effects of these variants in diverse cohorts. To determine the frequency of MC4R variants in large cohort of different ancestries, we evaluated the MC4R coding region for 20,537 eMERGE participants with sequencing data plus additional 77,454 independent individuals with genome-wide genotyping data at this locus. SUBJECTS/METHODS: The sequencing data were obtained from the eMERGE phase III study, in which multisample variant call format calls have been generated, curated, and annotated. In addition to penetrance estimation using body mass index (BMI) as a binary outcome, GWAS and PheWAS were performed using median BMI in linear regression analyses. All results were adjusted for principal components, age, sex, and sites of genotyping. RESULTS: Targeted sequencing data of MC4R revealed 125 coding variants in 1839 eMERGE participants including 30 unreported coding variants that were predicted to be functionally damaging. Highly penetrant unreported variants included (L325I, E308K, D298N, S270F, F261L, T248A, D111V, and Y80F) in which seven participants had obesity class III defined as BMI ≥ 40 kg/m2. In GWAS analysis, in addition to known risk haplotype upstream of MC4R (best variant rs6567160 (P = 5.36 × 10-25, Beta = 0.37), a novel rare haplotype was detected which was protective against obesity and encompassed the V103I variant with known gain-of-function properties (P = 6.23 × 10-08, Beta = -0.62). PheWAS analyses extended this protective effect of V103I to type 2 diabetes, diabetic nephropathy, and chronic renal failure independent of BMI. CONCLUSIONS: MC4R screening in a large eMERGE cohort confirmed many previous findings, extend the MC4R pleotropic effects, and discovered additional MC4R rare alleles that probably contribute to obesity.


Asunto(s)
Variación Genética/genética , Estudio de Asociación del Genoma Completo , Obesidad , Receptor de Melanocortina Tipo 4/genética , Adulto , Anciano , Índice de Masa Corporal , Estudios de Cohortes , Femenino , Humanos , Masculino , Persona de Mediana Edad , Obesidad/epidemiología , Obesidad/genética
14.
J Biomed Inform ; 99: 103293, 2019 11.
Artículo en Inglés | MEDLINE | ID: mdl-31542521

RESUMEN

BACKGROUND: Implementation of phenotype algorithms requires phenotype engineers to interpret human-readable algorithms and translate the description (text and flowcharts) into computable phenotypes - a process that can be labor intensive and error prone. To address the critical need for reducing the implementation efforts, it is important to develop portable algorithms. METHODS: We conducted a retrospective analysis of phenotype algorithms developed in the Electronic Medical Records and Genomics (eMERGE) network and identified common customization tasks required for implementation. A novel scoring system was developed to quantify portability from three aspects: Knowledge conversion, clause Interpretation, and Programming (KIP). Tasks were grouped into twenty representative categories. Experienced phenotype engineers were asked to estimate the average time spent on each category and evaluate time saving enabled by a common data model (CDM), specifically the Observational Medical Outcomes Partnership (OMOP) model, for each category. RESULTS: A total of 485 distinct clauses (phenotype criteria) were identified from 55 phenotype algorithms, corresponding to 1153 customization tasks. In addition to 25 non-phenotype-specific tasks, 46 tasks are related to interpretation, 613 tasks are related to knowledge conversion, and 469 tasks are related to programming. A score between 0 and 2 (0 for easy, 1 for moderate, and 2 for difficult portability) is assigned for each aspect, yielding a total KIP score range of 0 to 6. The average clause-wise KIP score to reflect portability is 1.37 ±â€¯1.38. Specifically, the average knowledge (K) score is 0.64 ±â€¯0.66, interpretation (I) score is 0.33 ±â€¯0.55, and programming (P) score is 0.40 ±â€¯0.64. 5% of the categories can be completed within one hour (median). 70% of the categories take from days to months to complete. The OMOP model can assist with vocabulary mapping tasks. CONCLUSION: This study presents firsthand knowledge of the substantial implementation efforts in phenotyping and introduces a novel metric (KIP) to measure portability of phenotype algorithms for quantifying such efforts across the eMERGE Network. Phenotype developers are encouraged to analyze and optimize the portability in regards to knowledge, interpretation and programming. CDMs can be used to improve the portability for some 'knowledge-oriented' tasks.


Asunto(s)
Registros Electrónicos de Salud/clasificación , Informática Médica/métodos , Algoritmos , Genómica , Humanos , Fenotipo , Estudios Retrospectivos
15.
J Biomed Inform ; 96: 103253, 2019 08.
Artículo en Inglés | MEDLINE | ID: mdl-31325501

RESUMEN

BACKGROUND: Implementing clinical phenotypes across a network is labor intensive and potentially error prone. Use of a common data model may facilitate the process. METHODS: Electronic Medical Records and Genomics (eMERGE) sites implemented the Observational Health Data Sciences and Informatics (OHDSI) Observational Medical Outcomes Partnership (OMOP) Common Data Model across their electronic health record (EHR)-linked DNA biobanks. Two previously implemented eMERGE phenotypes were converted to OMOP and implemented across the network. RESULTS: It was feasible to implement the common data model across sites, with laboratory data producing the greatest challenge due to local encoding. Sites were then able to execute the OMOP phenotype in less than one day, as opposed to weeks of effort to manually implement an eMERGE phenotype in their bespoke research EHR databases. Of the sites that could compare the current OMOP phenotype implementation with the original eMERGE phenotype implementation, specific agreement ranged from 100% to 43%, with disagreements due to the original phenotype, the OMOP phenotype, changes in data, and issues in the databases. Using the OMOP query as a standard comparison revealed differences in the original implementations despite starting from the same definitions, code lists, flowcharts, and pseudocode. CONCLUSION: Using a common data model can dramatically speed phenotype implementation at the cost of having to populate that data model, though this will produce a net benefit as the number of phenotype implementations increases. Inconsistencies among the implementations of the original queries point to a potential benefit of using a common data model so that actual phenotype code and logic can be shared, mitigating human error in reinterpretation of a narrative phenotype definition.


Asunto(s)
Trastorno por Déficit de Atención con Hiperactividad/diagnóstico , Bases de Datos Factuales , Diabetes Mellitus Tipo 2/diagnóstico , Registros Electrónicos de Salud , Recolección de Datos , Humanos , Informática Médica , National Human Genome Research Institute (U.S.) , Estudios Observacionales como Asunto , Evaluación de Resultado en la Atención de Salud , Fenotipo , Proyectos de Investigación , Programas Informáticos , Estados Unidos
16.
Mayo Clin Proc Innov Qual Outcomes ; 8(1): 45-52, 2024 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-38274333

RESUMEN

We investigated the association of daylight saving time (DST) transitions with the rates of adverse cardiovascular events in a large, US-based nationwide study. The study cohort included 36,116,951 unique individuals from deidentified administrative claims data of the OptumLabs Data Warehouse. There were 74,722 total adverse cardiovascular events during DST transition and the control weeks (2 weeks before and after) in spring and autumn of 2015-2019. We used Bayesian hierarchical Poisson regression models to estimate event rate ratios representing the ratio of composite adverse cardiovascular event rates between DST transition and control weeks. There was an average increase of 3% (95% uncertainty interval, -3% to -10%) and 4% (95% uncertainty interval, -2% to -12%) in adverse cardiovascular event rates during Monday and Friday of the spring DST transition, respectively. The probability of this being associated with a moderate-to-large increase in the event rates (estimate event rate ratio, >1.10) was estimated to be less than 6% for Monday and Friday, and less than 1% for the remaining days. During autumn DST transition, the probability of any decrease in adverse cardiovascular event rates was estimated to be less than 46% and a moderate-to-large decrease in the event rates to be less than 4% across all days. Results were similar when adjusted by age. In conclusion, spring DST transition had a suggestive association with a minor increase in adverse cardiovascular event rates but with a very low estimated probability to be of clinical importance. Our findings suggest that DST transitions are unlikely to meaningfully impact the rate of cardiovascular events.

17.
JAMA Cardiol ; 9(6): 497-506, 2024 Jun 01.
Artículo en Inglés | MEDLINE | ID: mdl-38598228

RESUMEN

Importance: Clonal hematopoiesis of indeterminate potential (CHIP) may contribute to the risk of atrial fibrillation (AF) through its association with inflammation and cardiac remodeling. Objective: To determine whether CHIP was associated with AF, inflammatory and cardiac biomarkers, and cardiac structural changes. Design, Setting, and Participants: This was a population-based, prospective cohort study in participants of the Atherosclerosis Risk in Communities (ARIC) study and UK Biobank (UKB) cohort. Samples were collected and echocardiography was performed from 2011 to 2013 in the ARIC cohort, and samples were collected from 2006 to 2010 in the UKB cohort. Included in this study were adults without hematologic malignancies, mitral valve stenosis, or previous mitral valve procedure from both the ARIC and UKB cohorts; additionally, participants without hypertrophic cardiomyopathy and congenital heart disease from the UKB cohort were also included. Data analysis was completed in 2023. Exposures: CHIP (variant allele frequency [VAF] ≥2%), common gene-specific CHIP subtypes (DNMT3A, TET2, ASXL1), large CHIP (VAF ≥10%), inflammatory and cardiac biomarkers (high-sensitivity C-reactive protein, interleukin 6 [IL-6], IL-18, high-sensitivity troponin T [hs-TnT] and hs-TnI, N-terminal pro-B-type natriuretic peptide), and echocardiographic indices. Main Outcome Measure: Incident AF. Results: A total of 199 982 adults were included in this study. In ARIC participants (4131 [2.1%]; mean [SD] age, 76 [5] years; 2449 female [59%]; 1682 male [41%]; 935 Black [23%] and 3196 White [77%]), 1019 had any CHIP (24.7%), and 478 had large CHIP (11.6%). In UKB participants (195 851 [97.9%]; mean [SD] age, 56 [8] years; 108 370 female [55%]; 87 481 male [45%]; 3154 Black [2%], 183 747 White [94%], and 7971 other race [4%]), 11 328 had any CHIP (5.8%), and 5189 had large CHIP (2.6%). ARIC participants were followed up for a median (IQR) period of 7.0 (5.3-7.7) years, and UKB participants were followed up for a median (IQR) period of 12.2 (11.3-13.0) years. Meta-analyzed hazard ratios for AF were 1.12 (95% CI, 1.01-1.25; P = .04) for participants with vs without large CHIP, 1.29 (95% CI, 1.05-1.59; P = .02) for those with vs without large TET2 CHIP (seen in 1340 of 197 209 [0.67%]), and 1.45 (95% CI, 1.02-2.07; P = .04) for those with vs without large ASXL1 CHIP (seen in 314 of 197 209 [0.16%]). Large TET2 CHIP was associated with higher IL-6 levels. Additionally, large ASXL1 was associated with higher hs-TnT level and increased left ventricular mass index. Conclusions and Relevance: Large TET2 and ASXL1, but not DNMT3A, CHIP was associated with higher IL-6 level, indices of cardiac remodeling, and increased risk for AF. Future research is needed to elaborate on the mechanisms driving the associations and to investigate potential interventions to reduce the risk.


Asunto(s)
Fibrilación Atrial , Hematopoyesis Clonal , Proteínas de Unión al ADN , Dioxigenasas , Proteínas Proto-Oncogénicas , Proteínas Represoras , Humanos , Femenino , Masculino , Fibrilación Atrial/genética , Hematopoyesis Clonal/genética , Proteínas Represoras/genética , Proteínas de Unión al ADN/genética , Proteínas de Unión al ADN/metabolismo , Persona de Mediana Edad , Proteínas Proto-Oncogénicas/genética , Estudios Prospectivos , Anciano , ADN Metiltransferasa 3A , ADN (Citosina-5-)-Metiltransferasas/genética , Biomarcadores/sangre , Biomarcadores/metabolismo , Proteína C-Reactiva/metabolismo , Proteína C-Reactiva/genética , Interleucina-6/genética , Interleucina-6/metabolismo , Troponina T/genética , Troponina T/sangre , Troponina T/metabolismo , Ecocardiografía , Reino Unido/epidemiología
18.
Circ Genom Precis Med ; 17(3): e004272, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38380516

RESUMEN

BACKGROUND: Predictive performance of polygenic risk scores (PRS) varies across populations. To facilitate equitable clinical use, we developed PRS for coronary heart disease (CHD; PRSCHD) for 5 genetic ancestry groups. METHODS: We derived ancestry-specific and multi-ancestry PRSCHD based on pruning and thresholding (PRSPT) and ancestry-based continuous shrinkage priors (PRSCSx) applied to summary statistics from the largest multi-ancestry genome-wide association study meta-analysis for CHD to date, including 1.1 million participants from 5 major genetic ancestry groups. Following training and optimization in the Million Veteran Program, we evaluated the best-performing PRSCHD in 176,988 individuals across 9 diverse cohorts. RESULTS: Multi-ancestry PRSPT and PRSCSx outperformed ancestry-specific PRSPT and PRSCSx across a range of tuning values. Two best-performing multi-ancestry PRSCHD (ie, PRSPTmult and PRSCSxmult) and 1 ancestry-specific (PRSCSxEUR) were taken forward for validation. PRSPTmult demonstrated the strongest association with CHD in individuals of South Asian ancestry and European ancestry (odds ratio per 1 SD [95% CI, 2.75 [2.41-3.14], 1.65 [1.59-1.72]), followed by East Asian ancestry (1.56 [1.50-1.61]), Hispanic/Latino ancestry (1.38 [1.24-1.54]), and African ancestry (1.16 [1.11-1.21]). PRSCSxmult showed the strongest associations in South Asian ancestry (2.67 [2.38-3.00]) and European ancestry (1.65 [1.59-1.71]), lower in East Asian ancestry (1.59 [1.54-1.64]), Hispanic/Latino ancestry (1.51 [1.35-1.69]), and the lowest in African ancestry (1.20 [1.15-1.26]). CONCLUSIONS: The use of summary statistics from a large multi-ancestry genome-wide meta-analysis improved the performance of PRSCHD in most ancestry groups compared with single-ancestry methods. Despite the use of one of the largest and most diverse sets of training and validation cohorts to date, improvement of predictive performance was limited in African ancestry. This highlights the need for larger genome-wide association study datasets of underrepresented populations to enhance the performance of PRSCHD.


Asunto(s)
Enfermedad Coronaria , Estudio de Asociación del Genoma Completo , Herencia Multifactorial , Humanos , Enfermedad Coronaria/genética , Masculino , Femenino , Predisposición Genética a la Enfermedad , Polimorfismo de Nucleótido Simple , Factores de Riesgo , Persona de Mediana Edad , Puntuación de Riesgo Genético
19.
Nat Med ; 30(2): 480-487, 2024 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-38374346

RESUMEN

Polygenic risk scores (PRSs) have improved in predictive performance, but several challenges remain to be addressed before PRSs can be implemented in the clinic, including reduced predictive performance of PRSs in diverse populations, and the interpretation and communication of genetic results to both providers and patients. To address these challenges, the National Human Genome Research Institute-funded Electronic Medical Records and Genomics (eMERGE) Network has developed a framework and pipeline for return of a PRS-based genome-informed risk assessment to 25,000 diverse adults and children as part of a clinical study. From an initial list of 23 conditions, ten were selected for implementation based on PRS performance, medical actionability and potential clinical utility, including cardiometabolic diseases and cancer. Standardized metrics were considered in the selection process, with additional consideration given to strength of evidence in African and Hispanic populations. We then developed a pipeline for clinical PRS implementation (score transfer to a clinical laboratory, validation and verification of score performance), and used genetic ancestry to calibrate PRS mean and variance, utilizing genetically diverse data from 13,475 participants of the All of Us Research Program cohort to train and test model parameters. Finally, we created a framework for regulatory compliance and developed a PRS clinical report for return to providers and for inclusion in an additional genome-informed risk assessment. The initial experience from eMERGE can inform the approach needed to implement PRS-based testing in diverse clinical settings.


Asunto(s)
Enfermedad Crónica , Puntuación de Riesgo Genético , Salud Poblacional , Adulto , Niño , Humanos , Comunicación , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Factores de Riesgo , Estados Unidos
20.
JACC Adv ; 2(7): 100567, 2023 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-38939477

RESUMEN

Background: Genetic factors are not included in prediction models for coronary heart disease (CHD). Objectives: The authors assessed the predictive utility of a polygenic risk score (PRS) for CHD (defined as myocardial infarction, coronary revascularization, or cardiovascular death) and whether the risks due to monogenic familial hypercholesterolemia (FH) and family history (FamHx) are independent of and additive to the PRS. Methods: In UK-biobank participants, PRSCHD was calculated using metaGRS, and 10-year risk for incident CHD was estimated using the pooled cohort equations (PCE). The area under the curve (AUC) of the receiver operator curve and net reclassification improvement (NRI) were assessed. FH was defined as the presence of a pathogenic or likely pathogenic variant in LDLR, APOB, or PCSK9. FamHx was defined as a diagnosis of CHD in first-degree relatives. Independent and additive effects of PRSCHD, FH, and FamHx were evaluated in stratified analyses. Results: In 323,373 participants with genotype data, the addition of PRSCHD to PCE increased the AUC from 0.759 (95% CI: 0.755-0.763) to 0.773 (95% CI: 0.769-0.777). The AUC and NRIEvent for PRSCHD were higher before the age of 55 years. Of 199,997 participants with exome sequence data, 10,000 had a PRSCHD ≥95th percentile (PRSP95), 673 had FH, and 46,163 had FamHx. The CHD risk associated with PRSP95 was independent of FH and FamHx. The risks associated with combinations of PRSCHD, FH, and FamHx were additive and comprehensive estimates could be obtained by multiplying the risk from each genetic factor. Conclusions: Incorporating PRSCHD into the PCE improves risk prediction for CHD, especially at younger ages. The associations of PRSCHD, FH, and FamHx with CHD were independent and additive.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA