Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 35
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Hum Genet ; 141(11): 1739-1748, 2022 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-35226188

RESUMEN

Uterine fibroids (UF) are common pelvic tumors in women, heritable, and genome-wide association studies (GWAS) have identified ~ 30 loci associated with increased risk in UF. Using summary statistics from a previously published UF GWAS performed in a non-Hispanic European Ancestry (NHW) female subset from the Electronic Medical Records and Genomics (eMERGE) Network, we constructed a polygenic risk score (PRS) for UF. UF-PRS was developed using PRSice and optimized in the separate clinical population of BioVU. PRS was validated using parallel methods of 10-fold cross-validation logistic regression and phenome-wide association study (PheWAS) in a seperate subset of eMERGE NHW females (validation set), excluding samples used in GWAS. PRSice determined pt < 0.001 and after linkage disequilibrium pruning (r2 < 0.2), 4458 variants were in the PRS which was significant (pseudo-R2 = 0.0018, p = 0.041). 10-fold cross-validation logistic regression modeling of validation set revealed the model had an area under the curve (AUC) value of 0.60 (95% confidence interval [CI] 0.58-0.62) when plotted in a receiver operator curve (ROC). PheWAS identified six phecodes associated with the PRS with the most significant phenotypes being 218 'benign neoplasm of uterus' and 218.1 'uterine leiomyoma' (p = 1.94 × 10-23, OR 1.31 [95% CI 1.26-1.37] and p = 3.50 × 10-23, OR 1.32 [95% CI 1.26-1.37]). We have developed and validated the first PRS for UF. We find our PRS has predictive ability for UF and captures genetic architecture of increased risk for UF that can be used in further studies.


Asunto(s)
Estudio de Asociación del Genoma Completo , Leiomioma , Femenino , Predisposición Genética a la Enfermedad , Genómica , Humanos , Leiomioma/genética , Desequilibrio de Ligamiento , Factores de Riesgo
2.
Genet Med ; 24(5): 1130-1138, 2022 05.
Artículo en Inglés | MEDLINE | ID: mdl-35216901

RESUMEN

PURPOSE: The goal of Electronic Medical Records and Genomics (eMERGE) Phase III Network was to return actionable sequence variants to 25,084 consenting participants from 10 different health care institutions across the United States. The purpose of this study was to evaluate system-based issues relating to the return of results (RoR) disclosure process for clinical grade research genomic tests to eMERGE3 participants. METHODS: RoR processes were developed and approved by each eMERGE institution's internal review board. Investigators at each eMERGE3 site were surveyed for RoR processes related to the participant's disclosure of pathogenic or likely pathogenic variants and engagement with genetic counseling. Standard statistical analysis was performed. RESULTS: Of the 25,084 eMERGE participants, 1444 had a pathogenic or likely pathogenic variant identified on the eMERGEseq panel of 67 genes and 14 single nucleotide variants. Of these, 1077 (74.6%) participants had results disclosed, with 562 (38.9%) participants provided with variant-specific genetic counseling. Site-specific processes that either offered or required genetic counseling in their RoR process had an effect on whether a participant ultimately engaged with genetic counseling (P = .0052). CONCLUSION: The real-life experience of the multiarm eMERGE3 RoR study for returning actionable genomic results to consented research participants showed the impact of consent, method of disclosure, and genetic counseling on RoR.


Asunto(s)
Genoma , Genómica , Revelación , Asesoramiento Genético , Humanos , Grupos de Población
3.
Circulation ; 142(17): 1633-1646, 2020 10 27.
Artículo en Inglés | MEDLINE | ID: mdl-32981348

RESUMEN

BACKGROUND: Abdominal aortic aneurysm (AAA) is an important cause of cardiovascular mortality; however, its genetic determinants remain incompletely defined. In total, 10 previously identified risk loci explain a small fraction of AAA heritability. METHODS: We performed a genome-wide association study in the Million Veteran Program testing ≈18 million DNA sequence variants with AAA (7642 cases and 172 172 controls) in veterans of European ancestry with independent replication in up to 4972 cases and 99 858 controls. We then used mendelian randomization to examine the causal effects of blood pressure on AAA. We examined the association of AAA risk variants with aneurysms in the lower extremity, cerebral, and iliac arterial beds, and derived a genome-wide polygenic risk score (PRS) to identify a subset of the population at greater risk for disease. RESULTS: Through a genome-wide association study, we identified 14 novel loci, bringing the total number of known significant AAA loci to 24. In our mendelian randomization analysis, we demonstrate that a genetic increase of 10 mm Hg in diastolic blood pressure (odds ratio, 1.43 [95% CI, 1.24-1.66]; P=1.6×10-6), as opposed to systolic blood pressure (odds ratio, 1.06 [95% CI, 0.97-1.15]; P=0.2), likely has a causal relationship with AAA development. We observed that 19 of 24 AAA risk variants associate with aneurysms in at least 1 other vascular territory. A 29-variant PRS was strongly associated with AAA (odds ratioPRS, 1.26 [95% CI, 1.18-1.36]; PPRS=2.7×10-11 per SD increase in PRS), independent of family history and smoking risk factors (odds ratioPRS+family history+smoking, 1.24 [95% CI, 1.14-1.35]; PPRS=1.27×10-6). Using this PRS, we identified a subset of the population with AAA prevalence greater than that observed in screening trials informing current guidelines. CONCLUSIONS: We identify novel AAA genetic associations with therapeutic implications and identify a subset of the population at significantly increased genetic risk of AAA independent of family history. Our data suggest that extending current screening guidelines to include testing to identify those with high polygenic AAA risk, once the cost of genotyping becomes comparable with that of screening ultrasound, would significantly increase the yield of current screening at reasonable cost.


Asunto(s)
Aneurisma de la Aorta Abdominal/genética , Humanos , Veteranos
4.
Circulation ; 138(22): 2469-2481, 2018 11 27.
Artículo en Inglés | MEDLINE | ID: mdl-30571344

RESUMEN

BACKGROUND: Proteomic approaches allow measurement of thousands of proteins in a single specimen, which can accelerate biomarker discovery. However, applying these technologies to massive biobanks is not currently feasible because of the practical barriers and costs of implementing such assays at scale. To overcome these challenges, we used a "virtual proteomic" approach, linking genetically predicted protein levels to clinical diagnoses in >40 000 individuals. METHODS: We used genome-wide association data from the Framingham Heart Study (n=759) to construct genetic predictors for 1129 plasma protein levels. We validated the genetic predictors for 268 proteins and used them to compute predicted protein levels in 41 288 genotyped individuals in the Electronic Medical Records and Genomics (eMERGE) cohort. We tested associations for each predicted protein with 1128 clinical phenotypes. Lead associations were validated with directly measured protein levels and either low-density lipoprotein cholesterol or subclinical atherosclerosis in the MDCS (Malmö Diet and Cancer Study; n=651). RESULTS: In the virtual proteomic analysis in eMERGE, 55 proteins were associated with 89 distinct diagnoses at a false discovery rate q<0.1. Among these, 13 associations involved lipid (n=7) or atherosclerosis (n=6) phenotypes. We tested each association for validation in MDCS using directly measured protein levels. At Bonferroni-adjusted significance thresholds, levels of apolipoprotein E isoforms were associated with hyperlipidemia, and circulating C-type lectin domain family 1 member B and platelet-derived growth factor receptor-ß predicted subclinical atherosclerosis. Odds ratios for carotid atherosclerosis were 1.31 (95% CI, 1.08-1.58; P=0.006) per 1-SD increment in C-type lectin domain family 1 member B and 0.79 (0.66-0.94; P=0.008) per 1-SD increment in platelet-derived growth factor receptor-ß. CONCLUSIONS: We demonstrate a biomarker discovery paradigm to identify candidate biomarkers of cardiovascular and other diseases.


Asunto(s)
Biomarcadores/sangre , Enfermedades de las Arterias Carótidas/diagnóstico , Estudio de Asociación del Genoma Completo , Proteoma/análisis , Adulto , Anciano , Anciano de 80 o más Años , Enfermedades de las Arterias Carótidas/genética , Femenino , Genotipo , Humanos , Lectinas Tipo C/análisis , Masculino , Persona de Mediana Edad , Oportunidad Relativa , Fenotipo , Polimorfismo de Nucleótido Simple , Proteómica , Receptor beta de Factor de Crecimiento Derivado de Plaquetas/sangre
5.
N Engl J Med ; 374(12): 1134-44, 2016 03 24.
Artículo en Inglés | MEDLINE | ID: mdl-26934567

RESUMEN

BACKGROUND: The discovery of low-frequency coding variants affecting the risk of coronary artery disease has facilitated the identification of therapeutic targets. METHODS: Through DNA genotyping, we tested 54,003 coding-sequence variants covering 13,715 human genes in up to 72,868 patients with coronary artery disease and 120,770 controls who did not have coronary artery disease. Through DNA sequencing, we studied the effects of loss-of-function mutations in selected genes. RESULTS: We confirmed previously observed significant associations between coronary artery disease and low-frequency missense variants in the genes LPA and PCSK9. We also found significant associations between coronary artery disease and low-frequency missense variants in the genes SVEP1 (p.D2702G; minor-allele frequency, 3.60%; odds ratio for disease, 1.14; P=4.2×10(-10)) and ANGPTL4 (p.E40K; minor-allele frequency, 2.01%; odds ratio, 0.86; P=4.0×10(-8)), which encodes angiopoietin-like 4. Through sequencing of ANGPTL4, we identified 9 carriers of loss-of-function mutations among 6924 patients with myocardial infarction, as compared with 19 carriers among 6834 controls (odds ratio, 0.47; P=0.04); carriers of ANGPTL4 loss-of-function alleles had triglyceride levels that were 35% lower than the levels among persons who did not carry a loss-of-function allele (P=0.003). ANGPTL4 inhibits lipoprotein lipase; we therefore searched for mutations in LPL and identified a loss-of-function variant that was associated with an increased risk of coronary artery disease (p.D36N; minor-allele frequency, 1.9%; odds ratio, 1.13; P=2.0×10(-4)) and a gain-of-function variant that was associated with protection from coronary artery disease (p.S447*; minor-allele frequency, 9.9%; odds ratio, 0.94; P=2.5×10(-7)). CONCLUSIONS: We found that carriers of loss-of-function mutations in ANGPTL4 had triglyceride levels that were lower than those among noncarriers; these mutations were also associated with protection from coronary artery disease. (Funded by the National Institutes of Health and others.).


Asunto(s)
Angiopoyetinas/genética , Moléculas de Adhesión Celular/genética , Enfermedad de la Arteria Coronaria/genética , Lipoproteína Lipasa/genética , Mutación , Triglicéridos/sangre , Anciano , Proteína 4 Similar a la Angiopoyetina , Femenino , Técnicas de Genotipaje , Humanos , Lipoproteína Lipasa/antagonistas & inhibidores , Lipoproteína Lipasa/metabolismo , Masculino , Persona de Mediana Edad , Mutación Missense , Factores de Riesgo , Análisis de Secuencia de ADN , Triglicéridos/genética
6.
J Thromb Thrombolysis ; 41(4): 656-62, 2016 May.
Artículo en Inglés | MEDLINE | ID: mdl-26446588

RESUMEN

Abnormal platelet reactivity is associated with recurrent ischemia and bleeding following percutaneous coronary intervention (PCI). Protease-activated receptor-1 (PAR1), encoded by F2R, is a high affinity thrombin receptor on platelets and the target of the antiplatelet drug vorapaxar. The intronic single nucleotide polymorphism F2R IVS-14 A/T affects PAR1 receptor density and function. We hypothesized that carriers of the T allele, who have been shown to have decreased platelet reactivity, would be at lower risk for thrombotic events, but higher risk for bleeding following PCI. Using BioVU, the Vanderbilt DNA repository linked to the electronic medical record, we studied 660 patients who underwent PCI for unstable or stable coronary artery disease. Primary outcome measures were major adverse cardiovascular events (MACE, composite of revascularization, MI, stroke, death) and bleeding (assessed by Bleeding Academic Research Consortium scale) over 24 months. The minor allele (T) frequency was 14.8 %. There were no genotypic differences in the frequency of MACE (33.7, 28.8, and 31.6 % for A/A, A/T, and T/T respectively, P = 0.50) or bleeding (15.7, 14.7, and 18.8 % for A/A, A/T, and T/T respectively, P = 0.90). In a Cox regression model, fully adjusted for age, race, sex, BMI, and smoking status, carrying a T allele was not associated with MACE (HR 1.19, 95 % CI 0.89-1.59, P = 0.23) or bleeding (HR 0.73, 95 % CI 0.37-1.4, P = 0.34). In conclusion, in our population, F2R IVS-14 PAR1 variability does not affect risk of MACE or bleeding following PCI.


Asunto(s)
Enfermedad de la Arteria Coronaria/genética , Enfermedad de la Arteria Coronaria/cirugía , Intervención Coronaria Percutánea/efectos adversos , Polimorfismo Genético , Hemorragia Posoperatoria/genética , Receptor PAR-1/genética , Anciano , Anciano de 80 o más Años , Enfermedad de la Arteria Coronaria/mortalidad , Femenino , Humanos , Masculino , Persona de Mediana Edad , Infarto del Miocardio/etiología , Infarto del Miocardio/genética , Infarto del Miocardio/mortalidad , Hemorragia Posoperatoria/etiología , Hemorragia Posoperatoria/mortalidad , Accidente Cerebrovascular/etiología , Accidente Cerebrovascular/genética , Accidente Cerebrovascular/mortalidad
7.
J Allergy Clin Immunol ; 136(6): 1503-1510, 2015 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-26073756

RESUMEN

BACKGROUND: Asthma exacerbations are a major cause of morbidity and medical cost. OBJECTIVE: The objective of this study was to identify genetic predictors of exacerbations in asthmatic subjects. METHODS: We performed a genome-wide association study meta-analysis of acute asthma exacerbation in 2 pediatric clinical trials: the Childhood Asthma Management Program (n = 581) and the Childhood Asthma Research and Education (n = 205) network. Acute asthma exacerbations were defined as treatment with a 5-day course of oral steroids. We obtained a replication cohort from Biobank of Vanderbilt University Medical Center (BioVU; n = 786), the Vanderbilt University electronic medical record-linked DNA biobank. We used CD4(+) lymphocyte genome-wide mRNA expression profiling to identify associations of top single nucleotide polymorphisms with mRNA abundance of nearby genes. RESULTS: A locus in catenin (cadherin-associated protein), alpha 3 (CTNNA3), reached genome-wide significance (rs7915695, P = 2.19 × 10(-8); mean exacerbations, 6.05 for minor alleles vs 3.71 for homozygous major alleles). Among the 4 top single nucleotide polymorphisms replicated in BioVU, rs993312 in Sema domain, immunoglobulin domain (Ig), short basic domain, secreted, (semaphorin) 3D (SEMA3D) was significant (P = .0083) and displayed stronger association among African Americans (P = .0004 in BioVU [mean exacerbations, 3.91 vs 1.53]; P = .0089 in the Childhood Asthma Management Program [mean exacerbations, 6.0 vs 3.25]). CTNNA3 variants did not replicate in BioVU. A regulatory variant in the CTNNA3 locus was associated with CTNNA3 mRNA expression in CD4(+) cells from asthmatic patients (P = .00079). CTNNA3 appears to be active in the immune response, and SEMA3D has a plausible role in airway remodeling. We also provide a replication of a previous association of purinergic receptor P2X, ligand-gated ion channel, 7 (P2RX7), with asthma exacerbation. CONCLUSIONS: We identified 2 loci associated with exacerbations through a genome-wide association study. CTNNA3 met genome-wide significance thresholds, and SEMA3D replicated in a clinical biobank database.


Asunto(s)
Asma/genética , Semaforinas/genética , alfa Catenina/genética , Adolescente , Adulto , Niño , Preescolar , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Lactante , Masculino , Polimorfismo de Nucleótido Simple , ARN Mensajero/metabolismo , Análisis de Secuencia de ARN , Adulto Joven
8.
BMC Med Inform Decis Mak ; 15 Suppl 1: S2, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26045009

RESUMEN

BACKGROUND: Parsing, which generates a syntactic structure of a sentence (a parse tree), is a critical component of natural language processing (NLP) research in any domain including medicine. Although parsers developed in the general English domain, such as the Stanford parser, have been applied to clinical text, there are no formal evaluations and comparisons of their performance in the medical domain. METHODS: In this study, we investigated the performance of three state-of-the-art parsers: the Stanford parser, the Bikel parser, and the Charniak parser, using following two datasets: (1) A Treebank containing 1,100 sentences that were randomly selected from progress notes used in the 2010 i2b2 NLP challenge and manually annotated according to a Penn Treebank based guideline; and (2) the MiPACQ Treebank, which is developed based on pathology notes and clinical notes, containing 13,091 sentences. We conducted three experiments on both datasets. First, we measured the performance of the three state-of-the-art parsers on the clinical Treebanks with their default settings. Then we re-trained the parsers using the clinical Treebanks and evaluated their performance using the 10-fold cross validation method. Finally we re-trained the parsers by combining the clinical Treebanks with the Penn Treebank. RESULTS: Our results showed that the original parsers achieved lower performance in clinical text (Bracketing F-measure in the range of 66.6%-70.3%) compared to general English text. After retraining on the clinical Treebank, all parsers achieved better performance, with the best performance from the Stanford parser that reached the highest Bracketing F-measure of 73.68% on progress notes and 83.72% on the MiPACQ corpus using 10-fold cross validation. When the combined clinical Treebanks and Penn Treebank was used, of the three parsers, the Charniak parser achieved the highest Bracketing F-measure of 73.53% on progress notes and the Stanford parser reached the highest F-measure of 84.15% on the MiPACQ corpus. CONCLUSIONS: Our study demonstrates that re-training using clinical Treebanks is critical for improving general English parsers' performance on clinical text, and combining clinical and open domain corpora might achieve optimal performance for parsing clinical text.


Asunto(s)
Lingüística/métodos , Informática Médica/métodos , Procesamiento de Lenguaje Natural , Humanos
9.
J Am Med Inform Assoc ; 31(4): 846-854, 2024 Apr 03.
Artículo en Inglés | MEDLINE | ID: mdl-38263490

RESUMEN

IMPORTANCE: Knowledge gained from cohort studies has dramatically advanced both public and precision health. The All of Us Research Program seeks to enroll 1 million diverse participants who share multiple sources of data, providing unique opportunities for research. It is important to understand the phenomic profiles of its participants to conduct research in this cohort. OBJECTIVES: More than 280 000 participants have shared their electronic health records (EHRs) in the All of Us Research Program. We aim to understand the phenomic profiles of this cohort through comparisons with those in the US general population and a well-established nation-wide cohort, UK Biobank, and to test whether association results of selected commonly studied diseases in the All of Us cohort were comparable to those in UK Biobank. MATERIALS AND METHODS: We included participants with EHRs in All of Us and participants with health records from UK Biobank. The estimates of prevalence of diseases in the US general population were obtained from the Global Burden of Diseases (GBD) study. We conducted phenome-wide association studies (PheWAS) of 9 commonly studied diseases in both cohorts. RESULTS: This study included 287 012 participants from the All of Us EHR cohort and 502 477 participants from the UK Biobank. A total of 314 diseases curated by the GBD were evaluated in All of Us, 80.9% (N = 254) of which were more common in All of Us than in the US general population [prevalence ratio (PR) >1.1, P < 2 × 10-5]. Among 2515 diseases and phenotypes evaluated in both All of Us and UK Biobank, 85.6% (N = 2152) were more common in All of Us (PR >1.1, P < 2 × 10-5). The Pearson correlation coefficients of effect sizes from PheWAS between All of Us and UK Biobank were 0.61, 0.50, 0.60, 0.57, 0.40, 0.53, 0.46, 0.47, and 0.24 for ischemic heart diseases, lung cancer, chronic obstructive pulmonary disease, dementia, colorectal cancer, lower back pain, multiple sclerosis, lupus, and cystic fibrosis, respectively. DISCUSSION: Despite the differences in prevalence of diseases in All of Us compared to the US general population or the UK Biobank, our study supports that All of Us can facilitate rapid investigation of a broad range of diseases. CONCLUSION: Most diseases were more common in All of Us than in the general US population or the UK Biobank. Results of disease-disease association tests from All of Us are comparable to those estimated in another well-studied national cohort.


Asunto(s)
Fenómica , Salud Poblacional , Humanos , Bancos de Muestras Biológicas , Biobanco del Reino Unido , Fenotipo , Reino Unido/epidemiología
10.
Genet Med ; 15(10): 833-41, 2013 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-24009000

RESUMEN

PURPOSE: The design of electronic health records to translate genomic medicine into clinical care is crucial to successful introduction of new genomic services, yet there are few published guides to implementation. METHODS: The design, implemented features, and evolution of a locally developed electronic health record that supports a large pharmacogenomics program at a tertiary-care academic medical center was tracked over a 4-year development period. RESULTS: Developers and program staff created electronic health record mechanisms for ordering a pharmacogenomics panel in advance of clinical need (preemptive genotyping) and in response to a specific drug indication. Genetic data from panel-based genotyping were sequestered from the electronic health record until drug-gene interactions met evidentiary standards and deemed clinically actionable. A service to translate genotype to predicted drug-response phenotype populated a summary of drug-gene interactions, triggered inpatient and outpatient clinical decision support, updated laboratory records, and created gene results within online personal health records. CONCLUSION: The design of a locally developed electronic health record supporting pharmacogenomics has generalizable utility. The challenge of representing genomic data in a comprehensible and clinically actionable format is discussed along with reflection on the scalability of the model to larger sets of genomic data.


Asunto(s)
Registros Electrónicos de Salud , Farmacogenética , Centros Médicos Académicos , Sistemas de Apoyo a Decisiones Clínicas , Registros Electrónicos de Salud/normas , Registros Electrónicos de Salud/tendencias , Genotipo , Implementación de Plan de Salud , Humanos , Fenotipo , Investigación Biomédica Traslacional
11.
J Am Med Inform Assoc ; 30(3): 427-437, 2023 02 16.
Artículo en Inglés | MEDLINE | ID: mdl-36474423

RESUMEN

OBJECTIVE: The aim of this study was to analyze a publicly available sample of rule-based phenotype definitions to characterize and evaluate the variability of logical constructs used. MATERIALS AND METHODS: A sample of 33 preexisting phenotype definitions used in research that are represented using Fast Healthcare Interoperability Resources and Clinical Quality Language (CQL) was analyzed using automated analysis of the computable representation of the CQL libraries. RESULTS: Most of the phenotype definitions include narrative descriptions and flowcharts, while few provide pseudocode or executable artifacts. Most use 4 or fewer medical terminologies. The number of codes used ranges from 5 to 6865, and value sets from 1 to 19. We found that the most common expressions used were literal, data, and logical expressions. Aggregate and arithmetic expressions are the least common. Expression depth ranges from 4 to 27. DISCUSSION: Despite the range of conditions, we found that all of the phenotype definitions consisted of logical criteria, representing both clinical and operational logic, and tabular data, consisting of codes from standard terminologies and keywords for natural language processing. The total number and variety of expressions are low, which may be to simplify implementation, or authors may limit complexity due to data availability constraints. CONCLUSIONS: The phenotype definitions analyzed show significant variation in specific logical, arithmetic, and other operators but are all composed of the same high-level components, namely tabular data and logical expressions. A standard representation for phenotype definitions should support these formats and be modular to support localization and shared logic.


Asunto(s)
Registros Electrónicos de Salud , Lenguaje , Fenotipo , Narración
12.
Pharmacogenet Genomics ; 22(1): 32-42, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22108237

RESUMEN

OBJECTIVE: Tacrolimus, an immunosuppressive drug widely prescribed in kidney transplantation, requires therapeutic drug monitoring due to its marked interindividual pharmacokinetic variability and narrow therapeutic index. Previous studies have established that CYP3A5 rs776746 is associated with tacrolimus clearance, blood concentration, and dose requirement. The importance of other drug absorption, distribution, metabolism, and elimination (ADME) gene variants has not been well characterized. METHODS: We used novel DNA biobank and electronic medical record resources to identify ADME variants associated with tacrolimus dose requirement. Broad ADME genotyping was performed on 446 kidney transplant recipients, who had been dosed to a steady state with tacrolimus. The cohort was obtained from Vanderbilt's DNA biobank, BioVU, which contains linked deidentified electronic medical record data. Genotyping included Affymetrix drug-metabolizing enzymes and transporters Plus (1936 polymorphisms), custom Sequenom Massarray iPLEX Gold assay (95 polymorphisms), and ancestry-informative markers. The primary outcome was tacrolimus dose requirement defined as blood concentration to dose ratio. RESULTS: In analyses, which adjusted for race and other clinical factors, we replicated the association of tacrolimus blood concentration to dose ratio with CYP3A5 rs776746 (P=7.15×10), and identified associations with nine variants in linkage disequilibrium with rs776746, including eight CYP3A4 variants. No NR1I2 variants were significantly associated. Age, weight, and hemoglobin were also significantly associated with the outcome. In final models, rs776746 explained 39% of variability in dose requirement and 46% was explained by the model containing clinical covariates. CONCLUSION: This study highlights the utility of DNA biobanks and electronic medical records for tacrolimus pharmacogenomic research.


Asunto(s)
Citocromo P-450 CYP3A/genética , Registros Electrónicos de Salud , Inmunosupresores/farmacocinética , Trasplante de Riñón/inmunología , Tacrolimus/farmacocinética , Subfamilia B de Transportador de Casetes de Unión a ATP , Miembro 1 de la Subfamilia B de Casetes de Unión a ATP/genética , Adulto , Factores de Edad , Peso Corporal/genética , Bases de Datos de Ácidos Nucleicos , Relación Dosis-Respuesta a Droga , Monitoreo de Drogas , Femenino , Estudios de Asociación Genética , Genotipo , Hemoglobinas/genética , Humanos , Inmunosupresores/administración & dosificación , Inmunosupresores/sangre , Desequilibrio de Ligamiento , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple/genética , Receptor X de Pregnano , Receptores de Esteroides/genética , Tacrolimus/administración & dosificación , Tacrolimus/sangre
13.
Nat Commun ; 12(1): 168, 2021 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-33420026

RESUMEN

Increasingly, clinical phenotypes with matched genetic data from bio-bank linked electronic health records (EHRs) have been used for pleiotropy analyses. Thus far, pleiotropy analysis using individual-level EHR data has been limited to data from one site. However, it is desirable to integrate EHR data from multiple sites to improve the detection power and generalizability of the results. Due to privacy concerns, individual-level patients' data are not easily shared across institutions. As a result, we introduce Sum-Share, a method designed to efficiently integrate EHR and genetic data from multiple sites to perform pleiotropy analysis. Sum-Share requires only summary-level data and one round of communication from each site, yet it produces identical test statistics compared with that of pooled individual-level data. Consequently, Sum-Share can achieve lossless integration of multiple datasets. Using real EHR data from eMERGE, Sum-Share is able to identify 1734 potential pleiotropic SNPs for five cardiovascular diseases.


Asunto(s)
Registros Electrónicos de Salud/estadística & datos numéricos , Pleiotropía Genética , Comunicación , Bases de Datos Factuales , Estudio de Asociación del Genoma Completo/estadística & datos numéricos , Humanos , Modelos Biológicos , Fenotipo , Polimorfismo de Nucleótido Simple , Privacidad
14.
Sci Rep ; 9(1): 6077, 2019 04 15.
Artículo en Inglés | MEDLINE | ID: mdl-30988330

RESUMEN

Benign prostatic hyperplasia (BPH) results in a significant public health burden due to the morbidity caused by the disease and many of the available remedies. As much as 70% of men over 70 will develop BPH. Few studies have been conducted to discover the genetic determinants of BPH risk. Understanding the biological basis for this condition may provide necessary insight for development of novel pharmaceutical therapies or risk prediction. We have evaluated SNP-based heritability of BPH in two cohorts and conducted a genome-wide association study (GWAS) of BPH risk using 2,656 cases and 7,763 controls identified from the Electronic Medical Records and Genomics (eMERGE) network. SNP-based heritability estimates suggest that roughly 60% of the phenotypic variation in BPH is accounted for by genetic factors. We used logistic regression to model BPH risk as a function of principal components of ancestry, age, and imputed genotype data, with meta-analysis performed using METAL. The top result was on chromosome 22 in SYN3 at rs2710383 (p-value = 4.6 × 10-7; Odds Ratio = 0.69, 95% confidence interval = 0.55-0.83). Other suggestive signals were near genes GLGC, UNCA13, SORCS1 and between BTBD3 and SPTLC3. We also evaluated genetically-predicted gene expression in prostate tissue. The most significant result was with increasing predicted expression of ETV4 (chr17; p-value = 0.0015). Overexpression of this gene has been associated with poor prognosis in prostate cancer. In conclusion, although there were no genome-wide significant variants identified for BPH susceptibility, we present evidence supporting the heritability of this phenotype, have identified suggestive signals, and evaluated the association between BPH and genetically-predicted gene expression in prostate.


Asunto(s)
Predisposición Genética a la Enfermedad , Patrón de Herencia , Hiperplasia Prostática/genética , Anciano , Anciano de 80 o más Años , Biomarcadores/metabolismo , Estudios de Casos y Controles , Registros Electrónicos de Salud/estadística & datos numéricos , Perfilación de la Expresión Génica , Estudio de Asociación del Genoma Completo , Técnicas de Genotipaje , Humanos , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple , Próstata/patología , Hiperplasia Prostática/epidemiología , Hiperplasia Prostática/patología
15.
Am J Cardiol ; 121(12): 1552-1557, 2018 06 15.
Artículo en Inglés | MEDLINE | ID: mdl-29627106

RESUMEN

Diastolic dysfunction (DD), an abnormality in cardiac left ventricular (LV) chamber compliance, is associated with increased morbidity and mortality. Although DD has been extensively studied in older populations, co-morbidity patterns are less well characterized in middle-aged subjects. We screened 156,434 subjects with transthoracic echocardiogram reports available through Vanderbilt's electronic heath record and identified 6,612 subjects 40 to 55 years old with an LV ejection fraction ≥50% and diastolic function staging. We tested 452 incident and prevalent clinical diagnoses for associations with early-stage DD (n = 1,676) versus normal function. There were 44 co-morbid diagnoses associated with grade 1 DD including hypertension (odds ratio [OR] = 2.02, 95% confidence interval [CI] 1.78 to 2.28, p <5.3 × 10-29), type 2 diabetes (OR 1.96, 95% CI 1.68 to 2.29, p = 2.1 × 10-17), tachycardia (OR 1.38, 95% CI 0.53 to 2.19, p = 2.9 × 10-6), obesity (OR 1.76, 95% CI 1.51 to 2.06, p = 1.7 × 10-12), and clinical end points, including end-stage renal disease (OR 3.29, 95% CI 2.19 to 4.96, p = 1.2 × 10-8) and stroke (OR 1.5, 95% CI 1.12 to 2.02, p = 6.9 × 10-3). Among the 60 incident diagnoses associated with DD, heart failure with preserved ejection fraction (OR 4.63, 95% CI 3.39 to 6.32, p = 6.3 × 10-22) had the most significant association. Among subjects with normal diastolic function and blood pressure at baseline, a blood pressure measurement in the hypertensive range at the time of the second echocardiogram was associated with progression to stage 1 DD (p = 0.04). In conclusion, DD was common among subjects 40 to 55 years old and was associated with a heavy burden of co-morbid disease.


Asunto(s)
Insuficiencia Cardíaca Diastólica/diagnóstico por imagen , Insuficiencia Cardíaca Diastólica/epidemiología , Volumen Sistólico/fisiología , Disfunción Ventricular Izquierda/diagnóstico por imagen , Disfunción Ventricular Izquierda/epidemiología , Adulto , Distribución por Edad , Estudios de Cohortes , Bases de Datos Factuales , Diabetes Mellitus Tipo 2/diagnóstico , Diabetes Mellitus Tipo 2/epidemiología , Ecocardiografía/métodos , Femenino , Insuficiencia Cardíaca Diastólica/fisiopatología , Humanos , Hipertensión/diagnóstico , Hipertensión/epidemiología , Incidencia , Estimación de Kaplan-Meier , Modelos Lineales , Masculino , Persona de Mediana Edad , Análisis Multivariante , Pronóstico , Estudios Retrospectivos , Medición de Riesgo , Índice de Severidad de la Enfermedad , Distribución por Sexo , Análisis de Supervivencia , Estados Unidos/epidemiología , Disfunción Ventricular Izquierda/fisiopatología
16.
J Clin Endocrinol Metab ; 103(6): 2234-2243, 2018 06 01.
Artículo en Inglés | MEDLINE | ID: mdl-29659871

RESUMEN

Context: Mutations in alkaline phosphatase (AlkP), liver/bone/kidney (ALPL), which encodes tissue-nonspecific isozyme AlkP, cause hypophosphatasia (HPP). HPP is suspected by a low-serum AlkP. We hypothesized that some patients with bone or dental disease have undiagnosed HPP, caused by ALPL variants. Objective: Our objective was to discover the prevalence of these gene variants in the Vanderbilt University DNA Biobank (BioVU) and to assess phenotypic associations. Design: We identified subjects in BioVU, a repository of DNA, that had at least one of three known, rare HPP disease-causing variants in ALPL: rs199669988, rs121918007, and/or rs121918002. To evaluate for phenotypic associations, we conducted a sequential phenome-wide association study of ALPL variants and then performed a de-identified manual record review to refine the phenotype. Results: Out of 25,822 genotyped individuals, we identified 52 women and 53 men with HPP disease-causing variants in ALPL, 7/1000. None had a clinical diagnosis of HPP. For patients with ALPL variants, the average serum AlkP levels were in the lower range of normal or lower. Forty percent of men and 62% of women had documented bone and/or dental disease, compatible with the diagnosis of HPP. Forty percent of the female patients had ovarian pathology or other gynecological abnormalities compared with 15% seen in controls. Conclusions: Variants in the ALPL gene cause bone and dental disease in patients with and without the standard biomarker, low plasma AlkP. ALPL gene variants are more prevalent than currently reported and underdiagnosed. Gynecologic disease appears to be associated with HPP-causing variants in ALPL.


Asunto(s)
Fosfatasa Alcalina/genética , Hipofosfatasia/genética , Enfermedades del Ovario/genética , Polimorfismo de Nucleótido Simple , Enfermedades Uterinas/genética , Adulto , Anciano , Anciano de 80 o más Años , Alelos , Análisis Mutacional de ADN , Femenino , Frecuencia de los Genes , Predisposición Genética a la Enfermedad , Genotipo , Humanos , Masculino , Persona de Mediana Edad , Mutación , Fenotipo
17.
J Am Med Inform Assoc ; 25(3): 275-288, 2018 Mar 01.
Artículo en Inglés | MEDLINE | ID: mdl-29036387

RESUMEN

OBJECTIVE: Birth month and climate impact lifetime disease risk, while the underlying exposures remain largely elusive. We seek to uncover distal risk factors underlying these relationships by probing the relationship between global exposure variance and disease risk variance by birth season. MATERIAL AND METHODS: This study utilizes electronic health record data from 6 sites representing 10.5 million individuals in 3 countries (United States, South Korea, and Taiwan). We obtained birth month-disease risk curves from each site in a case-control manner. Next, we correlated each birth month-disease risk curve with each exposure. A meta-analysis was then performed of correlations across sites. This allowed us to identify the most significant birth month-exposure relationships supported by all 6 sites while adjusting for multiplicity. We also successfully distinguish relative age effects (a cultural effect) from environmental exposures. RESULTS: Attention deficit hyperactivity disorder was the only identified relative age association. Our methods identified several culprit exposures that correspond well with the literature in the field. These include a link between first-trimester exposure to carbon monoxide and increased risk of depressive disorder (R = 0.725, confidence interval [95% CI], 0.529-0.847), first-trimester exposure to fine air particulates and increased risk of atrial fibrillation (R = 0.564, 95% CI, 0.363-0.715), and decreased exposure to sunlight during the third trimester and increased risk of type 2 diabetes mellitus (R = -0.816, 95% CI, -0.5767, -0.929). CONCLUSION: A global study of birth month-disease relationships reveals distal risk factors involved in causal biological pathways that underlie them.

18.
Nat Commun ; 9(1): 3522, 2018 08 30.
Artículo en Inglés | MEDLINE | ID: mdl-30166544

RESUMEN

Defining the full spectrum of human disease associated with a biomarker is necessary to advance the biomarker into clinical practice. We hypothesize that associating biomarker measurements with electronic health record (EHR) populations based on shared genetic architectures would establish the clinical epidemiology of the biomarker. We use Bayesian sparse linear mixed modeling to calculate SNP weightings for 53 biomarkers from the Atherosclerosis Risk in Communities study. We use the SNP weightings to computed predicted biomarker values in an EHR population and test associations with 1139 diagnoses. Here we report 116 associations meeting a Bonferroni level of significance. A false discovery rate (FDR)-based significance threshold reveals more known and undescribed associations across a broad range of biomarkers, including biometric measures, plasma proteins and metabolites, functional assays, and behaviors. We confirm an inverse association between LDL-cholesterol level and septicemia risk in an independent epidemiological cohort. This approach efficiently discovers biomarker-disease associations.


Asunto(s)
Biomarcadores/análisis , Registros Electrónicos de Salud , Estudio de Asociación del Genoma Completo/métodos , Teorema de Bayes , Biomarcadores/sangre , LDL-Colesterol/sangre , Humanos , Estudios Prospectivos , Factores de Riesgo
19.
Nat Commun ; 9(1): 2904, 2018 07 25.
Artículo en Inglés | MEDLINE | ID: mdl-30046033

RESUMEN

Electrocardiographic PR interval measures atrio-ventricular depolarization and conduction, and abnormal PR interval is a risk factor for atrial fibrillation and heart block. Our genome-wide association study of over 92,000 European-descent individuals identifies 44 PR interval loci (34 novel). Examination of these loci reveals known and previously not-yet-reported biological processes involved in cardiac atrial electrical activity. Genes in these loci are over-represented in cardiac disease processes including heart block and atrial fibrillation. Variants in over half of the 44 loci were associated with atrial or blood transcript expression levels, or were in high linkage disequilibrium with missense variants. Six additional loci were identified either by meta-analysis of ~105,000 African and European-descent individuals and/or by pleiotropic analyses combining PR interval with heart rate, QRS interval, and atrial fibrillation. These findings implicate developmental pathways, and identify transcription factors, ion-channel genes, and cell-junction/cell-signaling proteins in atrio-ventricular conduction, identifying potential targets for drug development.


Asunto(s)
Función Atrial/fisiología , Nodo Atrioventricular/fisiología , Fenómenos Electrofisiológicos/genética , Estudio de Asociación del Genoma Completo , Electrocardiografía , Femenino , Humanos , Desequilibrio de Ligamiento/genética , Masculino , Mutación Missense/genética , Factores de Riesgo
20.
Pac Symp Biocomput ; 22: 348-355, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-27896988

RESUMEN

The major goal of precision medicine is to improve human health. A feature that unites much research in the field is the use of large datasets such as genomic data and electronic health records. Research in this field includes examination of variation in the core bases of DNA and their methylation status, through variations in metabolic and signaling molecules, all the way up to broader systems level changes in physiology and disease presentation. Intermediate goals include understanding the individual drivers of disease that differentiate the cause of disease in each individual. To match this development of approaches to physical and activitybased measurements, computational approaches to using these new streams of data to better understand improve human health are being rapidly developed by the thriving biomedical informatics research community. This session of the 2017 Pacific Symposium of Biocomputing presents some of the latest advances in the capture, analysis and use of diverse biomedical data in precision medicine.


Asunto(s)
Medicina de Precisión/estadística & datos numéricos , Biología Computacional , Bases de Datos Factuales , Registros Electrónicos de Salud , Genómica , Humanos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA