Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 90
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Trends Genet ; 35(1): 29-41, 2019 01.
Artículo en Inglés | MEDLINE | ID: mdl-30342790

RESUMEN

The field of human genomics has changed dramatically over time. Initial genomic studies were predominantly restricted to rare disorders in small families. Over the past decade, researchers changed course from family-based studies and instead focused on common diseases and traits in populations of unrelated individuals. With further advancements in biobanking, computer science, electronic health record (EHR) data, and more affordable high-throughput genomics, we are experiencing a new paradigm in human genomic research. Rapidly changing technologies and resources now make it possible to study thousands of diseases simultaneously at the genomic level. This review will focus on these advancements as scientists begin to incorporate phenome-wide strategies in human genomic research to understand the etiology of human diseases and develop new drugs to treat them.


Asunto(s)
Enfermedades Genéticas Congénitas/genética , Genoma Humano/genética , Genómica/tendencias , Bancos de Muestras Biológicas , Registros Electrónicos de Salud , Enfermedades Genéticas Congénitas/terapia , Estudio de Asociación del Genoma Completo , Humanos , Fenotipo , Polimorfismo de Nucleótido Simple/genética
2.
Pharmacogenet Genomics ; 32(1): 1-9, 2022 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-34380996

RESUMEN

OBJECTIVES: Primary nonresponse (PNR) to antitumor necrosis factor-α (TNFα) biologics is a serious concern in patients with inflammatory bowel disease (IBD). We aimed to identify the genetic variants associated with PNR. PATIENTS AND METHODS: Patients were recruited from outpatient GI clinics and PNR was determined using both clinical and endoscopic findings. A case-control genome-wide association study was performed in 589 IBD patients and associations were replicated in an independent cohort of 293 patients. Effect of the associated variant on gene expression and TNFα secretion was assessed by cell-based assays. Pleiotropic effects were investigated by Phenome-wide association study (PheWAS). RESULTS: We identified rs34767465 as associated with PNR to anti-TNFα therapy (odds ratio: 2.07, 95% CI, 1.46-2.94, P = 2.43 × 10-7, [replication odds ratio: 1.8, 95% CI, 1.04-3.16, P = 0.03]). rs34767465 is a multiple-tissue expression quantitative trait loci for FAM114A2. Using RNA-sequencing and protein quantification from HapMap lymphoblastoid cell lines (LCLs), we found a significant decrease in FAM114A2 mRNA and protein expression in both heterozygous and homozygous genotypes when compared to wild type LCLs. TNFα secretion was significantly higher in THP-1 cells [differentiated into macrophages] with FAM114A2 knockdown versus controls. Immunoblotting experiments showed that depletion of FAM114A2 impaired autophagy-related pathway genes suggesting autophagy-mediated TNFα secretion as a potential mechanism. PheWAS showed rs34767465 was associated with comorbid conditions found in IBD patients (derangement of joints [P = 3.7 × 10-4], pigmentary iris degeneration [P = 5.9 × 10-4], diverticulum of esophagus [P = 7 × 10-4]). CONCLUSIONS: We identified a variant rs34767465 associated with PNR to anti-TNFα biologics, which increases TNFα secretion through mechanism related to autophagy. rs34767465 may also explain the comorbidities associated with IBD.


Asunto(s)
Estudio de Asociación del Genoma Completo , Enfermedades Inflamatorias del Intestino , Estudios de Casos y Controles , Estudios de Cohortes , Humanos , Enfermedades Inflamatorias del Intestino/tratamiento farmacológico , Enfermedades Inflamatorias del Intestino/genética , Factor de Necrosis Tumoral alfa/genética
3.
Bioinformatics ; 37(21): 3966-3968, 2021 11 05.
Artículo en Inglés | MEDLINE | ID: mdl-34086863

RESUMEN

MOTIVATION: The use and functionality of Electronic Health Records (EHR) have increased rapidly in the past few decades. EHRs are becoming an important depository of patient health information and can capture family data. Pedigree analysis is a longstanding and powerful approach that can gain insight into the underlying genetic and environmental factors in human health, but traditional approaches to identifying and recruiting families are low-throughput and labor-intensive. Therefore, high-throughput methods to automatically construct family pedigrees are needed. RESULTS: We developed a stand-alone application: Electronic Pedigrees, or E-Pedigrees, which combines two validated family prediction algorithms into a single software package for high throughput pedigrees construction. The convenient platform considers patients' basic demographic information and/or emergency contact data to infer high-accuracy parent-child relationship. Importantly, E-Pedigrees allows users to layer in additional pedigree data when available and provides options for applying different logical rules to improve accuracy of inferred family relationships. This software is fast and easy to use, is compatible with different EHR data sources, and its output is a standard PED file appropriate for multiple downstream analyses. AVAILABILITY AND IMPLEMENTATION: The Python 3.3+ version E-Pedigrees application is freely available on: https://github.com/xiayuan-huang/E-pedigrees.


Asunto(s)
Algoritmos , Programas Informáticos , Humanos , Linaje , Registros Electrónicos de Salud
4.
Clin Genet ; 101(4): 429-441, 2022 04.
Artículo en Inglés | MEDLINE | ID: mdl-35112343

RESUMEN

The aim of this study was to explore kidney failure (KF) in Bardet-Biedl syndrome (BBS), focusing on high-risk gene variants, demographics, and morbidity. We employed the Clinical Registry Investigating BBS (CRIBBS) to identify 44 (7.2%) individuals with KF out of 607 subjects. Molecularly confirmed BBS was identified in 37 KF subjects and 364 CRIBBS registrants. KF was concomitant with recessive causal variants in 12 genes, with BBS10 the most predominant causal gene (26.6%), while disease penetrance was highest in SDCCAG8 (100%). Two truncating variants were present in 67.6% of KF cases. KF incidence was increased in genes not belonging to the BBSome or chaperonin-like genes (p < 0.001), including TTC21B, a new candidate BBS gene. Median age of KF was 12.5 years, with the vast majority of KF occurring by 30 years (86.3%). Females were disproportionately affected (77.3%). Diverse uropathies were identified, but were not more common in the KF group (p = 0.672). Kidney failure was evident in 11 of 15 (73.3%) deaths outside infancy. We conclude that KF poses a significant risk for premature morbidity in BBS. Risk factors for KF include female sex, truncating variants, and genes other than BBSome/chaperonin-like genes highlighting the value of comprehensive genetic investigation.


Asunto(s)
Síndrome de Bardet-Biedl , Insuficiencia Renal , Síndrome de Bardet-Biedl/complicaciones , Síndrome de Bardet-Biedl/genética , Chaperoninas/genética , Niño , Femenino , Humanos , Masculino , Mutación , Penetrancia , Insuficiencia Renal/genética
5.
J Am Soc Nephrol ; 32(8): 2031-2047, 2021 08.
Artículo en Inglés | MEDLINE | ID: mdl-33941608

RESUMEN

BACKGROUND: Genetic variants in complement genes have been associated with a wide range of human disease states, but well-powered genetic association studies of complement activation have not been performed in large multiethnic cohorts. METHODS: We performed medical records-based genome-wide and phenome-wide association studies for plasma C3 and C4 levels among participants of the Electronic Medical Records and Genomics (eMERGE) network. RESULTS: In a GWAS for C3 levels in 3949 individuals, we detected two genome-wide significant loci: chr.1q31.3 (CFH locus; rs3753396-A; ß=0.20; 95% CI, 0.14 to 0.25; P=1.52x10-11) and chr.19p13.3 (C3 locus; rs11569470-G; ß=0.19; 95% CI, 0.13 to 0.24; P=1.29x10-8). These two loci explained approximately 2% of variance in C3 levels. GWAS for C4 levels involved 3998 individuals and revealed a genome-wide significant locus at chr.6p21.32 (C4 locus; rs3135353-C; ß=0.40; 95% CI, 0.34 to 0.45; P=4.58x10-35). This locus explained approximately 13% of variance in C4 levels. The multiallelic copy number variant analysis defined two structural genomic C4 variants with large effect on blood C4 levels: C4-BS (ß=-0.36; 95% CI, -0.42 to -0.30; P=2.98x10-22) and C4-AL-BS (ß=0.25; 95% CI, 0.21 to 0.29; P=8.11x10-23). Overall, C4 levels were strongly correlated with copy numbers of C4A and C4B genes. In comprehensive phenome-wide association studies involving 102,138 eMERGE participants, we cataloged a full spectrum of autoimmune, cardiometabolic, and kidney diseases genetically related to systemic complement activation. CONCLUSIONS: We discovered genetic determinants of plasma C3 and C4 levels using eMERGE genomic data linked to electronic medical records. Genetic variants regulating C3 and C4 levels have large effects and multiple clinical correlations across the spectrum of complement-related diseases in humans.


Asunto(s)
Complemento C3/genética , Complemento C3/metabolismo , Complemento C4/genética , Complemento C4/metabolismo , Variación Genética , Registros Médicos , Adulto , Anciano , Alelos , Activación de Complemento/genética , Bases de Datos Genéticas , Estudios Epidemiológicos , Femenino , Dosificación de Gen , Sitios Genéticos , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Registro Médico Coordinado , Persona de Mediana Edad , Adulto Joven
6.
Am J Med Genet A ; 185(2): 508-516, 2021 02.
Artículo en Inglés | MEDLINE | ID: mdl-36046768

RESUMEN

Population-based genomic screening has the potential to improve health outcomes by identifying genetic causes of disease before they occur. While much attention has been paid to supporting the needs of the small percentage of patients who will receive a life-altering positive genomic screening result that requires medical attention, little attention has been given to the communication of negative screening results. As there are currently no best practices for returning negative genomic screening results, we drew on experiences across the electronic medical records and genomics (eMERGE) III Network to highlight the diversity of reporting methods employed, challenges encountered in reporting negative test results, and "lessons learned" across institutions. A 60-item survey that consisted of both multiple choice and open-ended questions was created to gather data across institutions. Even though institutions independently developed procedures for reporting negative results, and had very different study populations, we identified several similarities of approach, including but not limited to: returning results by mail, placing results in the electronic health record via an automated process, reporting results to participants' primary care provider, and providing genetic counseling to interested patients at no cost. Differences in procedures for reporting negative results included: differences in terminology used to describe negative results, definitions of negative results, guidance regarding the meaning of negative results for participants and their family members, and recommendations for clinical follow up. Our findings highlight emerging practices for reporting negative genomic screening results and highlight the need to create patient education and clinical support tools for reporting negative screening results.

7.
Lupus ; 30(8): 1264-1272, 2021 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-33977795

RESUMEN

OBJECTIVES: To test the hypothesis that genetic predisposition to systemic lupus erythematosus (SLE) increases the risk of cardiometabolic disorders. METHODS: Using 41 single nucleotide polymorphisms (SNPs) associated with SLE, we calculated a weighted genetic risk score (wGRS) for SLE. In a large biobank we tested the association between this wGRS and 9 cardiometabolic phenotypes previously associated with SLE: atrial fibrillation, ischemic stroke, coronary artery disease, type 1 and type 2 diabetes, obesity, chronic kidney disease, hypertension, and hypercholesterolemia. Additionally, we performed a phenome-wide association analysis (pheWAS) to discover novel clinical associations with a genetic predisposition to SLE. Findings were replicated in the Electronic Medical Records and Genomics (eMERGE) Network. To further define the association between SLE-related risk alleles and the selected cardiometabolic phenotypes, we performed an inverse variance weighted regression (IVWR) meta-analysis. RESULTS: The wGRS for SLE was calculated in 74,759 individuals of European ancestry. Among the pre-selected phenotypes, the wGRS was significantly associated with type 1 diabetes (OR [95%CI] =1.11 [1.06, 1.17], P-value = 1.05x10-5). In the PheWAS, the wGRS was associated with several autoimmune phenotypes, kidney disorders, and skin neoplasm; but only the associations with autoimmune phenotypes were replicated. In the IVWR meta-analysis, SLE-related risk alleles were nominally associated with type 1 diabetes (P = 0.048) but the associations were heterogeneous and did not meet the adjusted significance threshold. CONCLUSION: A weighted GRS for SLE was associated with an increased risk of several autoimmune-related phenotypes including type I diabetes but not with cardiometabolic disorders.


Asunto(s)
Enfermedades Cardiovasculares , Lupus Eritematoso Sistémico , Enfermedades Metabólicas , Alelos , Enfermedades Cardiovasculares/genética , Diabetes Mellitus Tipo 1/genética , Diabetes Mellitus Tipo 2 , Predisposición Genética a la Enfermedad , Humanos , Lupus Eritematoso Sistémico/genética , Polimorfismo de Nucleótido Simple
8.
World J Surg ; 44(1): 84-94, 2020 01.
Artículo en Inglés | MEDLINE | ID: mdl-31605180

RESUMEN

BACKGROUND: The extent to which obesity and genetics determine postoperative complications is incompletely understood. METHODS: We performed a retrospective study using two population cohorts with electronic health record (EHR) data. The first included 736,726 adults with body mass index (BMI) recorded between 1990 and 2017 at Vanderbilt University Medical Center. The second cohort consisted of 65,174 individuals from 12 institutions contributing EHR and genome-wide genotyping data to the Electronic Medical Records and Genomics (eMERGE) Network. Pairwise logistic regression analyses were used to measure the association of BMI categories with postoperative complications derived from International Classification of Disease-9 codes, including postoperative infection, incisional hernia, and intestinal obstruction. A genetic risk score was constructed from 97 obesity-risk single-nucleotide polymorphisms for a Mendelian randomization study to determine the association of genetic risk of obesity on postoperative complications. Logistic regression analyses were adjusted for sex, age, site, and race/principal components. RESULTS: Individuals with overweight or obese BMI (≥25 kg/m2) had increased risk of incisional hernia (odds ratio [OR] 1.7-5.5, p < 3.1 × 10-20), and people with obesity (BMI ≥ 30 kg/m2) had increased risk of postoperative infection (OR 1.2-2.3, p < 2.5 × 10-5). In the eMERGE cohort, genetically predicted BMI was associated with incisional hernia (OR 2.1 [95% CI 1.8-2.5], p = 1.4 × 10-6) and postoperative infection (OR 1.6 [95% CI 1.4-1.9], p = 3.1 × 10-6). Association findings were similar after limitation of the cohorts to those who underwent abdominal procedures. CONCLUSIONS: Clinical and Mendelian randomization studies suggest that obesity, as measured by BMI, is associated with the development of postoperative incisional hernia and infection.


Asunto(s)
Análisis de la Aleatorización Mendeliana/métodos , Obesidad/complicaciones , Complicaciones Posoperatorias/genética , Adulto , Índice de Masa Corporal , Femenino , Humanos , Modelos Logísticos , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple , Complicaciones Posoperatorias/etiología , Estudios Retrospectivos , Factores de Riesgo
9.
Clin Med Res ; 18(2-3): 58-67, 2020 08.
Artículo en Inglés | MEDLINE | ID: mdl-31959670

RESUMEN

BACKGROUND: Cardiovascular anomalies are more common in monochorionic twins, especially with twin-twin transfusion, compared to other twin types and to singletons. Because previous studies are based on fetal and neonatal echocardiography, more information is needed to study prevalence of cardiac anomalies in twin miscarriages, stillbirths, and children after the immediate neonatal period. METHODS: With specific attention to cardiac anomalies, we reviewed the medical records of 335 selected liveborn twin pairs from the Marshfield Clinic Twin Cohort (enriched for twin-twin transfusion) and all twins (175 pairs) identified in the Wisconsin Stillbirth Service Program cohort of late miscarriages and stillbirths. RESULTS: Structural cardiac defects occurred in 12% of liveborn monochorionic twin infants and 7.5% of stillborn infants with twin-twin transfusion compared to only 2% of liveborn dizygotic twins and no stillborn dizygotic infants. The most common cardiac lesion in liveborn twins was ventricular septal defect, which was usually isolated and discordant, preferentially affecting the smaller twin in monochorionic pairs. Among stillborn and miscarried monochorionic twins, the most common cardiac lesion was acardia. CONCLUSIONS: Monochorionic twins, particularly those with TTT, are at increased risk for a spectrum of structural cardiac malformations which we suggest may be related to asymmetry of the inner cell mass resulting in a smaller poorly perfused twin. In severe cases, limited cardiac and circulatory development in the affected twin leads to acardia. In less severe cases, the smaller infant has deficient septal growth that sometimes results in ventricular septal defect.


Asunto(s)
Transfusión Feto-Fetal , Cardiopatías Congénitas , Nacimiento Vivo/epidemiología , Mortinato/epidemiología , Gemelos Monocigóticos , Adolescente , Adulto , Niño , Preescolar , Femenino , Transfusión Feto-Fetal/epidemiología , Transfusión Feto-Fetal/patología , Estudios de Seguimiento , Cardiopatías Congénitas/epidemiología , Cardiopatías Congénitas/patología , Humanos , Lactante , Recién Nacido , Masculino , Embarazo , Factores de Riesgo
10.
Hum Genet ; 138(10): 1155-1169, 2019 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-31342140

RESUMEN

Vitamin D inadequacy, assessed by 25-hydroxyvitamin D [25(OH)D], affects around 50% of adults in the United States and is associated with numerous adverse health outcomes. Blood 25(OH)D concentrations are influenced by genetic factors that may determine how much vitamin D intake is required to reach optimal 25(OH)D. Despite large genome-wide association studies (GWASs), only a small portion of the genetic factors contributing to differences in 25(OH)D has been discovered. Therefore, knowledge of a fuller set of genetic factors could be useful for risk prediction of 25(OH)D inadequacy, personalized vitamin D supplementation, and prevention of downstream morbidity and mortality. Using PRSice and weights from published African- and European-ancestry GWAS summary statistics, ancestry-specific polygenic scores (PGSs) were created to capture a more complete set of genetic factors in those of European (n = 9569) or African ancestry (n = 2761) from three cohort studies. The PGS for African ancestry was derived using all input SNPs (a p value cutoff of 1.0) and had an R2 of 0.3%; for European ancestry, the optimal PGS used a p value cutoff of 3.5 × 10-4 in the target/tuning dataset and had an R2 of 1.0% in the validation cohort. Those with highest genetic risk had 25(OH)D that was 2.8-3.0 ng/mL lower than those with lowest genetic risk (p = 0.0463-3.2 × 10-13), requiring an additional 467-500 IU of vitamin D intake to maintain equivalent 25(OH)D. PGSs are a powerful predictive tool that could be leveraged for personalized vitamin D supplementation to prevent the negative downstream effects of 25(OH)D inadequacy.


Asunto(s)
Población Negra/genética , Genética de Población , Patrón de Herencia , Herencia Multifactorial , Polimorfismo de Nucleótido Simple , Vitamina D/análogos & derivados , Población Blanca/genética , Estudios de Cohortes , Bases de Datos Genéticas , Suplementos Dietéticos , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Persona de Mediana Edad , Rayos Ultravioleta , Vitamina D/sangre
11.
Bioinformatics ; 34(4): 635-642, 2018 02 15.
Artículo en Inglés | MEDLINE | ID: mdl-28968884

RESUMEN

Motivation: Pedigree analysis is a longstanding and powerful approach to gain insight into the underlying genetic factors in human health, but identifying, recruiting and genotyping families can be difficult, time consuming and costly. Development of high throughput methods to identify families and foster downstream analyses are necessary. Results: This paper describes simple methods that allowed us to identify 173 368 family pedigrees with high probability using basic demographic data available in most electronic health records (EHRs). We further developed and validate a novel statistical method that uses EHR data to identify families more likely to have a major genetic component to their diseases risk. Lastly, we showed that incorporating EHR-linked family data into genetic association testing may provide added power for genetic mapping without additional recruitment or genotyping. The totality of these results suggests that EHR-linked families can enable classical genetic analyses in a high-throughput manner. Availability and implementation: Pseudocode is provided as supplementary information. Contact: HEBBRING.SCOTT@marshfieldresearch.org. Supplementary information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Registros Electrónicos de Salud , Investigación Genética , Genoma Humano , Linaje , Algoritmos , Mapeo Cromosómico , Bases de Datos Factuales , Femenino , Estudios de Asociación Genética , Enfermedades Genéticas Congénitas , Humanos , Masculino , Persona de Mediana Edad
12.
J Med Virol ; 90(3): 436-446, 2018 03.
Artículo en Inglés | MEDLINE | ID: mdl-29053189

RESUMEN

Host response to influenza is highly variable, suggesting a potential role of host genetic variation. To investigate the host genetics of severe influenza in a targeted fashion, 32 single nucleotide polymorphisms (SNPs) within viral immune response genes were evaluated for association with seasonal influenza hospitalization in an adult study population with European ancestry. SNP allele and genotype frequencies were compared between hospitalized influenza patients (cases) and population controls in a case-control study that included a discovery group (26 cases and 993 controls) and two independent, validation groups (1 with 84 cases and 4076 controls; the other with 128 cases and 9187 controls). Cases and controls had similar allele frequencies for variant rs12252 in interferon-inducible transmembrane protein 3 (IFITM3) (P > 0.05), and the study did not replicate the previously reported association of rs12252 with hospitalized influenza. In the discovery group, the preliminary finding of an association with a nonsense polymorphism (rs8072510) within the schlafen family member 13 (SFLN13) gene (P = 0.0099) was not confirmed in either validation group. Neither rs12252 nor rs8072510 showed an association according to the presence of clinical risk factors for influenza complications (P > 0.05), suggesting that these factors did not modify associations between the SNPs and hospitalized influenza. No other SNPs showed a statistically significant association with hospitalized influenza. Further research is needed to identify genetic factors involved in host response to seasonal influenza infection and to assess whether rs12252, a low-frequency variant in Europeans, contributes to influenza severity in populations with European ancestry.


Asunto(s)
Predisposición Genética a la Enfermedad , Pruebas Genéticas , Hospitalización/estadística & datos numéricos , Gripe Humana/genética , Adulto , Anciano , Estudios de Casos y Controles , Registros Electrónicos de Salud , Femenino , Frecuencia de los Genes , Variación Genética , Genotipo , Humanos , Subtipo H1N1 del Virus de la Influenza A , Masculino , Proteínas de la Membrana/genética , Persona de Mediana Edad , Proyectos Piloto , Polimorfismo de Nucleótido Simple , Proteínas de Unión al ARN/genética
13.
J Med Genet ; 53(10): 681-9, 2016 10.
Artículo en Inglés | MEDLINE | ID: mdl-27287392

RESUMEN

BACKGROUND: Over 160 disease phenotypes have been mapped to the major histocompatibility complex (MHC) region on chromosome 6 by genome-wide association study (GWAS), suggesting that the MHC region as a whole may be involved in the aetiology of many phenotypes, including unstudied diseases. The phenome-wide association study (PheWAS), a powerful and complementary approach to GWAS, has demonstrated its ability to discover and rediscover genetic associations. The objective of this study is to comprehensively investigate the MHC region by PheWAS to identify new phenotypes mapped to this genetically important region. METHODS: In the current study, we systematically explored the MHC region using PheWAS to associate 2692 MHC-linked variants (minor allele frequency ≥0.01) with 6221 phenotypes in a cohort of 7481 subjects from the Marshfield Clinic Personalized Medicine Research Project. RESULTS: Findings showed that expected associations previously identified by GWAS could be identified by PheWAS (eg, psoriasis, ankylosing spondylitis, type I diabetes and coeliac disease) with some having strong cross-phenotype associations potentially driven by pleiotropic effects. Importantly, novel associations with eight diseases not previously assessed by GWAS (eg, lichen planus) were also identified and replicated in an independent population. Many of these associated diseases appear to be immune-related disorders. Further assessment of these diseases in 16 484 Marshfield Clinic twins suggests that some of these diseases, including lichen planus, may have genetic aetiologies. CONCLUSIONS: These results demonstrate that the PheWAS approach is a powerful and novel method to discover SNP-disease associations, and is ideal when characterising cross-phenotype associations, and further emphasise the importance of the MHC region in human health and disease.


Asunto(s)
Cromosomas Humanos Par 6 , Estudios de Asociación Genética/métodos , Enfermedades del Sistema Inmune/genética , Inflamación/genética , Complejo Mayor de Histocompatibilidad , Polimorfismo Genético , Adulto , Anciano , Femenino , Humanos , Liquen Plano/genética , Masculino , Persona de Mediana Edad , Fenotipo , Población Blanca/genética
14.
Bioinformatics ; 31(12): 1981-7, 2015 Jun 15.
Artículo en Inglés | MEDLINE | ID: mdl-25657332

RESUMEN

MOTIVATION: Genome-wide association studies (GWASs) are effective for describing genetic complexities of common diseases. Phenome-wide association studies (PheWASs) offer an alternative and complementary approach to GWAS using data embedded in the electronic health record (EHR) to define the phenome. International Classification of Disease version 9 (ICD9) codes are used frequently to define the phenome, but using ICD9 codes alone misses other clinically relevant information from the EHR that can be used for PheWAS analyses and discovery. RESULTS: As an alternative to ICD9 coding, a text-based phenome was defined by 23 384 clinically relevant terms extracted from Marshfield Clinic's EHR. Five single nucleotide polymorphisms (SNPs) with known phenotypic associations were genotyped in 4235 individuals and associated across the text-based phenome. All five SNPs genotyped were associated with expected terms (P<0.02), most at or near the top of their respective PheWAS ranking. Raw association results indicate that text data performed equivalently to ICD9 coding and demonstrate the utility of information beyond ICD9 coding for application in PheWAS.


Asunto(s)
Biología Computacional/métodos , Enfermedad/genética , Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Genotipo , Humanos
15.
Circ Res ; 115(12): 1017-25, 2014 Dec 05.
Artículo en Inglés | MEDLINE | ID: mdl-25326128

RESUMEN

RATIONALE: Dopamine ß-hydroxylase (DBH) catalyzes the conversion of dopamine to norepinephrine in the central nervous system and peripherally. DBH variants are associated with large changes in circulating DBH and implicated in multiple disorders; yet causal relationships and tissue-specific effects remain unresolved. OBJECTIVE: To characterize regulatory variants in DBH, effect on mRNA expression, and role in modulating sympathetic tone and disease risk. METHODS AND RESULTS: Analysis of DBH mRNA in human tissues confirmed high expression in the locus coeruleus and adrenal gland, but also in sympathetically innervated organs (liver>lung>heart). Allele-specific mRNA assays revealed pronounced allelic expression differences in the liver (2- to 11-fold) attributable to promoter rs1611115 and exon 2 rs1108580, but only small differences in locus coeruleus and adrenals. These alleles were also associated with significantly reduced mRNA expression in liver and lung. Although DBH protein is expressed in other sympathetically innervated organs, mRNA levels were too low for analysis. In mice, hepatic Dbh mRNA levels correlated with cardiovascular risk phenotypes. The minor alleles of rs1611115 and rs1108580 were associated with sympathetic phenotypes, including angina pectoris. Testing combined effects of these variants suggested protection against myocardial infarction in 3 separate clinical cohorts. CONCLUSIONS: We demonstrate profound effects of DBH variants on expression in 2 sympathetically innervated organs, liver and lung, but not in adrenals and brain. Preliminary results demonstrate an association of these variants with clinical phenotypes responsive to peripheral sympathetic tone. We hypothesize that in addition to endocrine effects via circulating DBH and norepinephrine, the variants act in sympathetically innervated target organs.


Asunto(s)
Enfermedades Cardiovasculares/genética , Dopamina beta-Hidroxilasa/genética , Corazón/inervación , Hígado/inervación , Pulmón/inervación , Polimorfismo de Nucleótido Simple , Sistema Nervioso Simpático/enzimología , Desequilibrio Alélico , Animales , Enfermedades Cardiovasculares/enzimología , Enfermedades Cardiovasculares/fisiopatología , Dopamina beta-Hidroxilasa/metabolismo , Exones , Femenino , Regulación Enzimológica de la Expresión Génica , Predisposición Genética a la Enfermedad , Humanos , Masculino , Ratones , Fenotipo , Regiones Promotoras Genéticas , Factores Protectores , ARN Mensajero/metabolismo , Factores de Riesgo , Sistema Nervioso Simpático/fisiopatología , Adulto Joven
16.
J Med Genet ; 52(4): 282-8, 2015 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-25587064

RESUMEN

BACKGROUND: Whole-genome sequencing (WGS) and whole-exome sequencing (WES) technologies are increasingly used to identify disease-contributing mutations in human genomic studies. It can be a significant challenge to process such data, especially when a large family or cohort is sequenced. Our objective was to develop a big data toolset to efficiently manipulate genome-wide variants, functional annotations and coverage, together with conducting family based sequencing data analysis. METHODS: Hadoop is a framework for reliable, scalable, distributed processing of large data sets using MapReduce programming models. Based on Hadoop and HBase, we developed SeqHBase, a big data-based toolset for analysing family based sequencing data to detect de novo, inherited homozygous, or compound heterozygous mutations that may contribute to disease manifestations. SeqHBase takes as input BAM files (for coverage at every site), variant call format (VCF) files (for variant calls) and functional annotations (for variant prioritisation). RESULTS: We applied SeqHBase to a 5-member nuclear family and a 10-member 3-generation family with WGS data, as well as a 4-member nuclear family with WES data. Analysis times were almost linearly scalable with number of data nodes. With 20 data nodes, SeqHBase took about 5 secs to analyse WES familial data and approximately 1 min to analyse WGS familial data. CONCLUSIONS: These results demonstrate SeqHBase's high efficiency and scalability, which is necessary as WGS and WES are rapidly becoming standard methods to study the genetics of familial disorders.


Asunto(s)
Genómica/métodos , Análisis de Secuencia de ADN/métodos , Programas Informáticos , Conjuntos de Datos como Asunto , Exoma , Genoma Humano , Humanos , Mutación
17.
Genet Epidemiol ; 38(8): 692-8, 2014 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-25250975

RESUMEN

Population-based genetic analyses, such as the Genome-Wide Association Study (GWAS), have proven powerful for describing the genetic complexities of common disease in epidemiologic research. However, the significant challenges faced by population-based study designs have resulted in revitalization of family-based approaches, including twin studies. Twin studies are unique in their ability to ascertain both heritable and environmental contributions to human disease. Several regional and national twin registries have been constructed using a variety of methods to identify potential twins. A significant challenge in constructing these large twin registries includes the substantial resources required to recruit participants, collect phenotypic data, and update the registries as time progresses. Here we describe the use of the Marshfield Clinic electronic medical record (EMR) to identify a cohort of 19,226 patients enriched for twins or multiples. This cohort defines the Marshfield Clinic Twin/Multiple Birth Cohort (MCTC). An EMR system provides both a mechanism to identify potential twins and a source of detailed phenotypic data in near real time without the need for patient contact outside standard medical care. To demonstrate that the MCTC can be used for genetic-based epidemiologic research, concordance rates for muscular dystrophy (MD) and fragile-X syndrome-two highly heritable diseases-were assessed. Observations indicate that both MD and fragile-X syndrome are highly correlated among affected twins in the MCTC (P ≅ 3.7 × 10(-6) and 1.1 × 10(-4) , respectively). These findings suggest that EMR systems may not only be an effective resource for predicting families of twins, but can also be rapidly applied to epidemiologic research.


Asunto(s)
Enfermedades en Gemelos/genética , Registros Electrónicos de Salud , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Niño , Preescolar , Estudios de Cohortes , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Lactante , Recién Nacido , Masculino , Persona de Mediana Edad , Progenie de Nacimiento Múltiple , Sistema de Registros , Gemelos/genética , Adulto Joven
18.
Immunology ; 141(2): 157-65, 2014 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-24147732

RESUMEN

Over the last decade, significant technological breakthroughs have revolutionized human genomic research in the form of genome-wide association studies (GWASs). GWASs have identified thousands of statistically significant genetic variants associated with hundreds of human conditions including many with immunological aetiologies (e.g. multiple sclerosis, ankylosing spondylitis and rheumatoid arthritis). Unfortunately, most GWASs fail to identify clinically significant associations. Identifying biologically significant variants by GWAS also presents a challenge. The GWAS is a phenotype-to-genotype approach. As a complementary/alternative approach to the GWAS, investigators have begun to exploit extensive electronic medical record systems to conduct a genotype-to-phenotype approach when studying human disease - specifically, the phenome-wide association study (PheWAS). Although the PheWAS approach is in its infancy, this method has already demonstrated its capacity to rediscover important genetic associations related to immunological diseases/conditions. Furthermore, PheWAS has the advantage of identifying genetic variants with pleiotropic properties. This is particularly relevant for HLA variants. For example, PheWAS results have demonstrated that the HLA-DRB1 variant associated with multiple sclerosis may also be associated with erythematous conditions including rosacea. Likewise, PheWAS has demonstrated that the HLA-B genotype is not only associated with spondylopathies, uveitis, and variability in platelet count, but may also play an important role in other conditions, such as mastoiditis. This review will discuss and compare general PheWAS methodologies, describe both the challenges and advantages of the PheWAS, and provide insight into the potential directions in which PheWAS may lead.


Asunto(s)
Estudio de Asociación del Genoma Completo , Fenotipo , Animales , Genotipo , Humanos , Clasificación Internacional de Enfermedades , Polimorfismo de Nucleótido Simple
19.
AMIA Jt Summits Transl Sci Proc ; 2024: 221-229, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38827091

RESUMEN

We recently demonstrated that electronically constructed family pedigrees (e-pedigrees) have great value in epidemiologic research using electronic health record (EHR) data. Prior to this work, it has been well accepted that family health history is a major predictor for a wide spectrum of diseases, reflecting shared effects of genetics, environment, and lifestyle. With the widespread digitalization of patient data via EHRs, there is an unprecedented opportunity to use machine learning algorithms to better predict disease risk. Although predictive models have previously been constructed for a few important diseases, we currently know very little about how accurately the risk for most diseases can be predicted. It is further unknown if the incorporation of e-pedigrees in machine learning can improve the value of these models. In this study, we devised a family pedigree-driven high-throughput machine learning pipeline to simultaneously predict risks for thousands of diagnosis codes using thousands of input features. Models were built to predict future disease risk for three time windows using both Logistic Regression and XGBoost. For example, we achieved average areas under the receiver operating characteristic curves (AUCs) of 0.82, 0.77 and 0.71 for 1, 6, and 24 months, respectively using XGBoost and without e-pedigrees. When adding e-pedigree features to the XGBoost pipeline, AUCs increased to 0.83, 0.79 and 0.74 for the same three time periods, respectively. E-pedigrees similarly improved the predictions when using Logistic Regression. These results emphasize the potential value of incorporating family health history via e-pedigrees into machine learning with no further human time.

20.
NAR Genom Bioinform ; 6(1): lqae022, 2024 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-38406797

RESUMEN

Breast cancer (BC) is one of the most commonly diagnosed cancers worldwide. As key regulatory molecules in several biological processes, microRNAs (miRNAs) are potential biomarkers for cancer. Understanding the miRNA markers that can detect BC may improve survival rates and develop new targeted therapeutic strategies. To identify a circulating miRNA signature for diagnostic prediction in patients with BC, we developed an evolutionary learning-based method called BSig. BSig established a compact set of miRNAs as potential markers from 1280 patients with BC and 2686 healthy controls retrieved from the serum miRNA expression profiles for the diagnostic prediction. BSig demonstrated outstanding prediction performance, with an independent test accuracy and area under the receiver operating characteristic curve were 99.90% and 0.99, respectively. We identified 12 miRNAs, including hsa-miR-3185, hsa-miR-3648, hsa-miR-4530, hsa-miR-4763-5p, hsa-miR-5100, hsa-miR-5698, hsa-miR-6124, hsa-miR-6768-5p, hsa-miR-6800-5p, hsa-miR-6807-5p, hsa-miR-642a-3p, and hsa-miR-6836-3p, which significantly contributed towards diagnostic prediction in BC. Moreover, through bioinformatics analysis, this study identified 65 miRNA-target genes specific to BC cell lines. A comprehensive gene-set enrichment analysis was also performed to understand the underlying mechanisms of these target genes. BSig, a tool capable of BC detection and facilitating therapeutic selection, is publicly available at https://github.com/mingjutsai/BSig.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA