Búsqueda | Portal de Búsqueda de la BVS Ecuador

1.

Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.

Taliun, Daniel; Harris, Daniel N; Kessler, Michael D; Carlson, Jedidiah; Szpiech, Zachary A; Torres, Raul; Taliun, Sarah A Gagliano; Corvelo, André; Gogarten, Stephanie M; Kang, Hyun Min; Pitsillides, Achilleas N; LeFaive, Jonathon; Lee, Seung-Been; Tian, Xiaowen; Browning, Brian L; Das, Sayantan; Emde, Anne-Katrin; Clarke, Wayne E; Loesch, Douglas P; Shetty, Amol C; Blackwell, Thomas W; Smith, Albert V; Wong, Quenna; Liu, Xiaoming; Conomos, Matthew P; Bobo, Dean M; Aguet, François; Albert, Christine; Alonso, Alvaro; Ardlie, Kristin G; Arking, Dan E; Aslibekyan, Stella; Auer, Paul L; Barnard, John; Barr, R Graham; Barwick, Lucas; Becker, Lewis C; Beer, Rebecca L; Benjamin, Emelia J; Bielak, Lawrence F; Blangero, John; Boehnke, Michael; Bowden, Donald W; Brody, Jennifer A; Burchard, Esteban G; Cade, Brian E; Casella, James F; Chalazan, Brandon; Chasman, Daniel I; Chen, Yii-Der Ida.

Nature ; 590(7845): 290-299, 2021 02.

Artículo en Inglés | MEDLINE | ID: mdl-33568819

RESUMEN

The Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes)1. In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.

Asunto(s)

Variación Genética/genética , Genoma Humano/genética , Genómica , National Heart, Lung, and Blood Institute (U.S.) , Medicina de Precisión , Citocromo P-450 CYP2D6/genética , Haplotipos/genética , Heterocigoto , Humanos , Mutación INDEL , Mutación con Pérdida de Función , Mutagénesis , Fenotipo , Polimorfismo de Nucleótido Simple , Densidad de Población , Medicina de Precisión/normas , Control de Calidad , Tamaño de la Muestra , Estados Unidos , Secuenciación Completa del Genoma/normas

2.

Whole genome sequencing based analysis of inflammation biomarkers in the Trans-Omics for Precision Medicine (TOPMed) consortium.

Jiang, Min-Zhi; Gaynor, Sheila M; Li, Xihao; Van Buren, Eric; Stilp, Adrienne; Buth, Erin; Wang, Fei Fei; Manansala, Regina; Gogarten, Stephanie M; Li, Zilin; Polfus, Linda M; Salimi, Shabnam; Bis, Joshua C; Pankratz, Nathan; Yanek, Lisa R; Durda, Peter; Tracy, Russell P; Rich, Stephen S; Rotter, Jerome I; Mitchell, Braxton D; Lewis, Joshua P; Psaty, Bruce M; Pratte, Katherine A; Silverman, Edwin K; Kaplan, Robert C; Avery, Christy; North, Kari E; Mathias, Rasika A; Faraday, Nauder; Lin, Honghuang; Wang, Biqi; Carson, April P; Norwood, Arnita F; Gibbs, Richard A; Kooperberg, Charles; Lundin, Jessica; Peters, Ulrike; Dupuis, Josée; Hou, Lifang; Fornage, Myriam; Benjamin, Emelia J; Reiner, Alexander P; Bowler, Russell P; Lin, Xihong; Auer, Paul L; Raffield, Laura M.

Hum Mol Genet ; 2024 May 15.

Artículo en Inglés | MEDLINE | ID: mdl-38747556

RESUMEN

Inflammation biomarkers can provide valuable insight into the role of inflammatory processes in many diseases and conditions. Sequencing based analyses of such biomarkers can also serve as an exemplar of the genetic architecture of quantitative traits. To evaluate the biological insight, which can be provided by a multi-ancestry, whole-genome based association study, we performed a comprehensive analysis of 21 inflammation biomarkers from up to 38 465 individuals with whole-genome sequencing from the Trans-Omics for Precision Medicine (TOPMed) program (with varying sample size by trait, where the minimum sample size was n = 737 for MMP-1). We identified 22 distinct single-variant associations across 6 traits-E-selectin, intercellular adhesion molecule 1, interleukin-6, lipoprotein-associated phospholipase A2 activity and mass, and P-selectin-that remained significant after conditioning on previously identified associations for these inflammatory biomarkers. We further expanded upon known biomarker associations by pairing the single-variant analysis with a rare variant set-based analysis that further identified 19 significant rare variant set-based associations with 5 traits. These signals were distinct from both significant single variant association signals within TOPMed and genetic signals observed in prior studies, demonstrating the complementary value of performing both single and rare variant analyses when analyzing quantitative traits. We also confirm several previously reported signals from semi-quantitative proteomics platforms. Many of these signals demonstrate the extensive allelic heterogeneity and ancestry-differentiated variant-trait associations common for inflammation biomarkers, a characteristic we hypothesize will be increasingly observed with well-powered, large-scale analyses of complex traits.

3.

Inherited causes of clonal haematopoiesis in 97,691 whole genomes.

Bick, Alexander G; Weinstock, Joshua S; Nandakumar, Satish K; Fulco, Charles P; Bao, Erik L; Zekavat, Seyedeh M; Szeto, Mindy D; Liao, Xiaotian; Leventhal, Matthew J; Nasser, Joseph; Chang, Kyle; Laurie, Cecelia; Burugula, Bala Bharathi; Gibson, Christopher J; Lin, Amy E; Taub, Margaret A; Aguet, Francois; Ardlie, Kristin; Mitchell, Braxton D; Barnes, Kathleen C; Moscati, Arden; Fornage, Myriam; Redline, Susan; Psaty, Bruce M; Silverman, Edwin K; Weiss, Scott T; Palmer, Nicholette D; Vasan, Ramachandran S; Burchard, Esteban G; Kardia, Sharon L R; He, Jiang; Kaplan, Robert C; Smith, Nicholas L; Arnett, Donna K; Schwartz, David A; Correa, Adolfo; de Andrade, Mariza; Guo, Xiuqing; Konkle, Barbara A; Custer, Brian; Peralta, Juan M; Gui, Hongsheng; Meyers, Deborah A; McGarvey, Stephen T; Chen, Ida Yii-Der; Shoemaker, M Benjamin; Peyser, Patricia A; Broome, Jai G; Gogarten, Stephanie M; Wang, Fei Fei.

Nature ; 586(7831): 763-768, 2020 10.

Artículo en Inglés | MEDLINE | ID: mdl-33057201

RESUMEN

Age is the dominant risk factor for most chronic human diseases, but the mechanisms through which ageing confers this risk are largely unknown1. The age-related acquisition of somatic mutations that lead to clonal expansion in regenerating haematopoietic stem cell populations has recently been associated with both haematological cancer2-4 and coronary heart disease5-this phenomenon is termed clonal haematopoiesis of indeterminate potential (CHIP)6. Simultaneous analyses of germline and somatic whole-genome sequences provide the opportunity to identify root causes of CHIP. Here we analyse high-coverage whole-genome sequences from 97,691 participants of diverse ancestries in the National Heart, Lung, and Blood Institute Trans-omics for Precision Medicine (TOPMed) programme, and identify 4,229 individuals with CHIP. We identify associations with blood cell, lipid and inflammatory traits that are specific to different CHIP driver genes. Association of a genome-wide set of germline genetic variants enabled the identification of three genetic loci associated with CHIP status, including one locus at TET2 that was specific to individuals of African ancestry. In silico-informed in vitro evaluation of the TET2 germline locus enabled the identification of a causal variant that disrupts a TET2 distal enhancer, resulting in increased self-renewal of haematopoietic stem cells. Overall, we observe that germline genetic variation shapes haematopoietic stem cell function, leading to CHIP through mechanisms that are specific to clonal haematopoiesis as well as shared mechanisms that lead to somatic mutations across tissues.

Asunto(s)

Hematopoyesis Clonal/genética , Predisposición Genética a la Enfermedad , Genoma Humano/genética , Secuenciación Completa del Genoma , Adulto , África/etnología , Anciano , Anciano de 80 o más Años , Población Negra/genética , Autorrenovación de las Células/genética , Proteínas de Unión al ADN/genética , Dioxigenasas , Femenino , Mutación de Línea Germinal/genética , Células Madre Hematopoyéticas/citología , Células Madre Hematopoyéticas/metabolismo , Humanos , Péptidos y Proteínas de Señalización Intracelular/genética , Masculino , Persona de Mediana Edad , National Heart, Lung, and Blood Institute (U.S.) , Fenotipo , Medicina de Precisión , Proteínas Proto-Oncogénicas/genética , Proteínas de Motivos Tripartitos/genética , Estados Unidos , alfa Carioferinas/genética

4.

Social and scientific motivations to move beyond groups in allele frequencies: The TOPMed experience.

Nelson, Sarah C; Gogarten, Stephanie M; Fullerton, Stephanie M; Isasi, Carmen R; Mitchell, Braxton D; North, Kari E; Rich, Stephen S; Taylor, Matthew R G; Zöllner, Sebastian; Sofer, Tamar.

Am J Hum Genet ; 109(9): 1582-1590, 2022 09 01.

Artículo en Inglés | MEDLINE | ID: mdl-36055210

RESUMEN

For the genomics community, allele frequencies within defined groups (or "strata") are useful across multiple research and clinical contexts. Benefits include allowing researchers to identify populations for replication or "look up" studies, enabling researchers to compare population-specific frequencies to validate findings, and facilitating assessment of variant pathogenicity in clinical contexts. However, there are potential concerns with stratified allele frequencies. These include potential re-identification (determining whether or not an individual participated in a given research study based on allele frequencies and individual-level genetic data), harm from associating stigmatizing variants with specific groups, potential reification of race as a biological rather than a socio-political category, and whether presenting stratified frequencies-and the downstream applications that this presentation enables-is consistent with participants' informed consents. The NHLBI Trans-Omics for Precision Medicine (TOPMed) program considered the scientific and social implications of different approaches for adding stratified frequencies to the TOPMed BRAVO (Browse All Variants Online) variant server. We recommend a novel approach of presenting ancestry-specific allele frequencies using a statistical method based upon local genetic ancestry inference. Notably, this approach does not require grouping individuals by either predominant global ancestry or race/ethnicity and, therefore, mitigates re-identification and other concerns as the mixture distribution of ancestral allele frequencies varies across the genome. Here we describe our considerations and approach, which can assist other genomics research programs facing similar issues of how to define and present stratified frequencies in publicly available variant databases.

Asunto(s)

Motivación , Medicina de Precisión , Etnicidad/genética , Frecuencia de los Genes/genética , Genómica/métodos , Humanos

5.

Admix-kit: an integrated toolkit and pipeline for genetic analyses of admixed populations.

Hou, Kangcheng; Gogarten, Stephanie; Kim, Joohyun; Hua, Xing; Dias, Julie-Alexia; Sun, Quan; Wang, Ying; Tan, Taotao; Atkinson, Elizabeth G; Martin, Alicia; Shortt, Jonathan; Hirbo, Jibril; Li, Yun; Pasaniuc, Bogdan; Zhang, Haoyu.

Bioinformatics ; 40(4)2024 Mar 29.

Artículo en Inglés | MEDLINE | ID: mdl-38490256

RESUMEN

SUMMARY: Admixed populations, with their unique and diverse genetic backgrounds, are often underrepresented in genetic studies. This oversight not only limits our understanding but also exacerbates existing health disparities. One major barrier has been the lack of efficient tools tailored for the special challenges of genetic studies of admixed populations. Here, we present admix-kit, an integrated toolkit and pipeline for genetic analyses of admixed populations. Admix-kit implements a suite of methods to facilitate genotype and phenotype simulation, association testing, genetic architecture inference, and polygenic scoring in admixed populations. AVAILABILITY AND IMPLEMENTATION: Admix-kit package is open-source and available at https://github.com/KangchengHou/admix-kit. Additionally, users can use the pipeline designed for admixed genotype simulation available at https://github.com/UW-GAC/admix-kit_workflow.

Asunto(s)

Programas Informáticos , Genotipo , Fenotipo

6.

Association of clonal hematopoiesis with chronic obstructive pulmonary disease.

Miller, Peter G; Qiao, Dandi; Rojas-Quintero, Joselyn; Honigberg, Michael C; Sperling, Adam S; Gibson, Christopher J; Bick, Alexander G; Niroula, Abhishek; McConkey, Marie E; Sandoval, Brittany; Miller, Brian C; Shi, Weiwei; Viswanathan, Kaushik; Leventhal, Matthew; Werner, Lillian; Moll, Matthew; Cade, Brian E; Barr, R Graham; Correa, Adolfo; Cupples, L Adrienne; Gharib, Sina A; Jain, Deepti; Gogarten, Stephanie M; Lange, Leslie A; London, Stephanie J; Manichaikul, Ani; O'Connor, George T; Oelsner, Elizabeth C; Redline, Susan; Rich, Stephen S; Rotter, Jerome I; Ramachandran, Vasan; Yu, Bing; Sholl, Lynette; Neuberg, Donna; Jaiswal, Siddhartha; Levy, Bruce D; Owen, Caroline A; Natarajan, Pradeep; Silverman, Edwin K; van Galen, Peter; Tesfaigzi, Yohannes; Cho, Michael H; Ebert, Benjamin L.

Blood ; 139(3): 357-368, 2022 01 20.

Artículo en Inglés | MEDLINE | ID: mdl-34855941

RESUMEN

Chronic obstructive pulmonary disease (COPD) is associated with age and smoking, but other determinants of the disease are incompletely understood. Clonal hematopoiesis of indeterminate potential (CHIP) is a common, age-related state in which somatic mutations in clonal blood populations induce aberrant inflammatory responses. Patients with CHIP have an elevated risk for cardiovascular disease, but the association of CHIP with COPD remains unclear. We analyzed whole-genome sequencing and whole-exome sequencing data to detect CHIP in 48 835 patients, of whom 8444 had moderate to very severe COPD, from four separate cohorts with COPD phenotyping and smoking history. We measured emphysema in murine models in which Tet2 was deleted in hematopoietic cells. In the COPDGene cohort, individuals with CHIP had risks of moderate-to-severe, severe, or very severe COPD that were 1.6 (adjusted 95% confidence interval [CI], 1.1-2.2) and 2.2 (adjusted 95% CI, 1.5-3.2) times greater than those for noncarriers. These findings were consistently observed in three additional cohorts and meta-analyses of all patients. CHIP was also associated with decreased FEV1% predicted in the COPDGene cohort (mean between-group differences, -5.7%; adjusted 95% CI, -8.8% to -2.6%), a finding replicated in additional cohorts. Smoke exposure was associated with a small but significant increased risk of having CHIP (odds ratio, 1.03 per 10 pack-years; 95% CI, 1.01-1.05 per 10 pack-years) in the meta-analysis of all patients. Inactivation of Tet2 in mouse hematopoietic cells exacerbated the development of emphysema and inflammation in models of cigarette smoke exposure. Somatic mutations in blood cells are associated with the development and severity of COPD, independent of age and cumulative smoke exposure.

Asunto(s)

Hematopoyesis Clonal , Enfermedad Pulmonar Obstructiva Crónica/genética , Animales , Femenino , Humanos , Masculino , Ratones , Persona de Mediana Edad , Oportunidad Relativa , Enfermedad Pulmonar Obstructiva Crónica/etiología , Factores de Riesgo , Fumar/efectos adversos , Secuenciación del Exoma

7.

Genome-wide association study of body fat distribution traits in Hispanics/Latinos from the HCHS/SOL.

Justice, Anne E; Young, Kristin; Gogarten, Stephanie M; Sofer, Tamar; Graff, Misa; Love, Shelly Ann M; Wang, Yujie; Klimentidis, Yann C; Cruz, Miguel; Guo, Xiuqing; Hartwig, Fernando; Petty, Lauren; Yao, Jie; Allison, Matthew A; Below, Jennifer E; Buchanan, Thomas A; Chen, Yii-Der Ida; Goodarzi, Mark O; Hanis, Craig; Highland, Heather M; Hsueh, Willa A; Ipp, Eli; Parra, Esteban; Palmas, Walter; Raffel, Leslie J; Rotter, Jerome I; Tan, Jingyi; Taylor, Kent D; Valladares, Adan; Xiang, Anny H; Sánchez-Johnsen, Lisa; Isasi, Carmen R; North, Kari E.

Hum Mol Genet ; 30(22): 2190-2204, 2021 11 01.

Artículo en Inglés | MEDLINE | ID: mdl-34165540

RESUMEN

Central obesity is a leading health concern with a great burden carried by ethnic minority populations, especially Hispanics/Latinos. Genetic factors contribute to the obesity burden overall and to inter-population differences. We aimed to identify the loci associated with central adiposity measured as waist-to-hip ratio (WHR), waist circumference (WC) and hip circumference (HIP) adjusted for body mass index (adjBMI) by using the Hispanic Community Health Study/Study of Latinos (HCHS/SOL); determine if differences in associations differ by background group within HCHS/SOL and determine whether previously reported associations generalize to HCHS/SOL. Our analyses included 7472 women and 5200 men of mainland (Mexican, Central and South American) and Caribbean (Puerto Rican, Cuban and Dominican) background residing in the USA. We performed genome-wide association analyses stratified and combined across sexes using linear mixed-model regression. We identified 16 variants for waist-to-hip ratio adjusted for body mass index (WHRadjBMI), 22 for waist circumference adjusted for body mass index (WCadjBMI) and 28 for hip circumference adjusted for body mass index (HIPadjBMI), which reached suggestive significance (P < 1 × 10-6). Many loci exhibited differences in strength of associations by ethnic background and sex. We brought a total of 66 variants forward for validation in cohorts (N = 34 161) with participants of Hispanic/Latino, African and European descent. We confirmed four novel loci (P < 0.05 and consistent direction of effect, and P < 5 × 10-8 after meta-analysis), including two for WHRadjBMI (rs13301996, rs79478137); one for WCadjBMI (rs3168072) and one for HIPadjBMI (rs28692724). Also, we generalized previously reported associations to HCHS/SOL, (8 for WHRadjBMI, 10 for WCadjBMI and 12 for HIPadjBMI). Our study highlights the importance of large-scale genomic studies in ancestrally diverse Hispanic/Latino populations for identifying and characterizing central obesity susceptibility that may be ancestry-specific.

Asunto(s)

Adiposidad/genética , Distribución de la Grasa Corporal , Estudio de Asociación del Genoma Completo , Hispánicos o Latinos/genética , Carácter Cuantitativo Heredable , Alelos , Humanos , Polimorfismo de Nucleótido Simple

8.

Allelic Heterogeneity at the CRP Locus Identified by Whole-Genome Sequencing in Multi-ancestry Cohorts.

Raffield, Laura M; Iyengar, Apoorva K; Wang, Biqi; Gaynor, Sheila M; Spracklen, Cassandra N; Zhong, Xue; Kowalski, Madeline H; Salimi, Shabnam; Polfus, Linda M; Benjamin, Emelia J; Bis, Joshua C; Bowler, Russell; Cade, Brian E; Choi, Won Jung; Comellas, Alejandro P; Correa, Adolfo; Cruz, Pedro; Doddapaneni, Harsha; Durda, Peter; Gogarten, Stephanie M; Jain, Deepti; Kim, Ryan W; Kral, Brian G; Lange, Leslie A; Larson, Martin G; Laurie, Cecelia; Lee, Jiwon; Lee, Seonwook; Lewis, Joshua P; Metcalf, Ginger A; Mitchell, Braxton D; Momin, Zeineen; Muzny, Donna M; Pankratz, Nathan; Park, Cheol Joo; Rich, Stephen S; Rotter, Jerome I; Ryan, Kathleen; Seo, Daekwan; Tracy, Russell P; Viaud-Martinez, Karine A; Yanek, Lisa R; Zhao, Lue Ping; Lin, Xihong; Li, Bingshan; Li, Yun; Dupuis, Josée; Reiner, Alexander P; Mohlke, Karen L; Auer, Paul L.

Am J Hum Genet ; 106(1): 112-120, 2020 01 02.

Artículo en Inglés | MEDLINE | ID: mdl-31883642

RESUMEN

Whole-genome sequencing (WGS) can improve assessment of low-frequency and rare variants, particularly in non-European populations that have been underrepresented in existing genomic studies. The genetic determinants of C-reactive protein (CRP), a biomarker of chronic inflammation, have been extensively studied, with existing genome-wide association studies (GWASs) conducted in >200,000 individuals of European ancestry. In order to discover novel loci associated with CRP levels, we examined a multi-ancestry population (n = 23,279) with WGS (â¼38× coverage) from the Trans-Omics for Precision Medicine (TOPMed) program. We found evidence for eight distinct associations at the CRP locus, including two variants that have not been identified previously (rs11265259 and rs181704186), both of which are non-coding and more common in individuals of African ancestry (â¼10% and â¼1% minor allele frequency, respectively, and rare or monomorphic in 1000 Genomes populations of East Asian, South Asian, and European ancestry). We show that the minor (G) allele of rs181704186 is associated with lower CRP levels and decreased transcriptional activity and protein binding in vitro, providing a plausible molecular mechanism for this African ancestry-specific signal. The individuals homozygous for rs181704186-G have a mean CRP level of 0.23 mg/L, in contrast to individuals heterozygous for rs181704186 with mean CRP of 2.97 mg/L and major allele homozygotes with mean CRP of 4.11 mg/L. This study demonstrates the utility of WGS in multi-ethnic populations to drive discovery of complex trait associations of large effect and to identify functional alleles in noncoding regulatory regions.

Asunto(s)

Pueblo Asiatico/genética , Población Negra/genética , Proteína C-Reactiva/genética , Predisposición Genética a la Enfermedad , Polimorfismo de Nucleótido Simple , Población Blanca/genética , Secuenciación Completa del Genoma/métodos , Estudios de Cohortes , Frecuencia de los Genes , Estudio de Asociación del Genoma Completo , Humanos , Desequilibrio de Ligamiento

9.

Efficient Variant Set Mixed Model Association Tests for Continuous and Binary Traits in Large-Scale Whole-Genome Sequencing Studies.

Chen, Han; Huffman, Jennifer E; Brody, Jennifer A; Wang, Chaolong; Lee, Seunggeun; Li, Zilin; Gogarten, Stephanie M; Sofer, Tamar; Bielak, Lawrence F; Bis, Joshua C; Blangero, John; Bowler, Russell P; Cade, Brian E; Cho, Michael H; Correa, Adolfo; Curran, Joanne E; de Vries, Paul S; Glahn, David C; Guo, Xiuqing; Johnson, Andrew D; Kardia, Sharon; Kooperberg, Charles; Lewis, Joshua P; Liu, Xiaoming; Mathias, Rasika A; Mitchell, Braxton D; O'Connell, Jeffrey R; Peyser, Patricia A; Post, Wendy S; Reiner, Alex P; Rich, Stephen S; Rotter, Jerome I; Silverman, Edwin K; Smith, Jennifer A; Vasan, Ramachandran S; Wilson, James G; Yanek, Lisa R; Redline, Susan; Smith, Nicholas L; Boerwinkle, Eric; Borecki, Ingrid B; Cupples, L Adrienne; Laurie, Cathy C; Morrison, Alanna C; Rice, Kenneth M; Lin, Xihong.

Am J Hum Genet ; 104(2): 260-274, 2019 02 07.

Artículo en Inglés | MEDLINE | ID: mdl-30639324

RESUMEN

With advances in whole-genome sequencing (WGS) technology, more advanced statistical methods for testing genetic association with rare variants are being developed. Methods in which variants are grouped for analysis are also known as variant-set, gene-based, and aggregate unit tests. The burden test and sequence kernel association test (SKAT) are two widely used variant-set tests, which were originally developed for samples of unrelated individuals and later have been extended to family data with known pedigree structures. However, computationally efficient and powerful variant-set tests are needed to make analyses tractable in large-scale WGS studies with complex study samples. In this paper, we propose the variant-set mixed model association tests (SMMAT) for continuous and binary traits using the generalized linear mixed model framework. These tests can be applied to large-scale WGS studies involving samples with population structure and relatedness, such as in the National Heart, Lung, and Blood Institute's Trans-Omics for Precision Medicine (TOPMed) program. SMMATs share the same null model for different variant sets, and a virtue of this null model, which includes covariates only, is that it needs to be fit only once for all tests in each genome-wide analysis. Simulation studies show that all the proposed SMMATs correctly control type I error rates for both continuous and binary traits in the presence of population structure and relatedness. We also illustrate our tests in a real data example of analysis of plasma fibrinogen levels in the TOPMed program (n = 23,763), using the Analysis Commons, a cloud-based computing platform.

Asunto(s)

Estudios de Asociación Genética , Modelos Genéticos , Secuenciación Completa del Genoma , Cromosomas Humanos Par 4/genética , Nube Computacional , Femenino , Fibrinógeno/análisis , Fibrinógeno/genética , Genética de Población , Humanos , Masculino , National Heart, Lung, and Blood Institute (U.S.) , Medicina de Precisión , Proyectos de Investigación , Factores de Tiempo , Estados Unidos

10.

Author Correction: Inherited causes of clonal haematopoiesis in 97,691 whole genomes.

Bick, Alexander G; Weinstock, Joshua S; Nandakumar, Satish K; Fulco, Charles P; Bao, Erik L; Zekavat, Seyedeh M; Szeto, Mindy D; Liao, Xiaotian; Leventhal, Matthew J; Nasser, Joseph; Chang, Kyle; Laurie, Cecelia; Burugula, Bala Bharathi; Gibson, Christopher J; Niroula, Abhishek; Lin, Amy E; Taub, Margaret A; Aguet, Francois; Ardlie, Kristin; Mitchell, Braxton D; Barnes, Kathleen C; Moscati, Arden; Fornage, Myriam; Redline, Susan; Psaty, Bruce M; Silverman, Edwin K; Weiss, Scott T; Palmer, Nicholette D; Vasan, Ramachandran S; Burchard, Esteban G; Kardia, Sharon L R; He, Jiang; Kaplan, Robert C; Smith, Nicholas L; Arnett, Donna K; Schwartz, David A; Correa, Adolfo; de Andrade, Mariza; Guo, Xiuqing; Konkle, Barbara A; Custer, Brian; Peralta, Juan M; Gui, Hongsheng; Meyers, Deborah A; McGarvey, Stephen T; Chen, Ida Yii-Der; Shoemaker, M Benjamin; Peyser, Patricia A; Broome, Jai G; Gogarten, Stephanie M.

Nature ; 591(7851): E27, 2021 Mar.

Artículo en Inglés | MEDLINE | ID: mdl-33707633

11.

A fully adjusted two-stage procedure for rank-normalization in genetic association studies.

Sofer, Tamar; Zheng, Xiuwen; Gogarten, Stephanie M; Laurie, Cecelia A; Grinde, Kelsey; Shaffer, John R; Shungin, Dmitry; O'Connell, Jeffrey R; Durazo-Arvizo, Ramon A; Raffield, Laura; Lange, Leslie; Musani, Solomon; Vasan, Ramachandran S; Cupples, L Adrienne; Reiner, Alexander P; Laurie, Cathy C; Rice, Kenneth M.

Genet Epidemiol ; 43(3): 263-275, 2019 04.

Artículo en Inglés | MEDLINE | ID: mdl-30653739

RESUMEN

When testing genotype-phenotype associations using linear regression, departure of the trait distribution from normality can impact both Type I error rate control and statistical power, with worse consequences for rarer variants. Because genotypes are expected to have small effects (if any) investigators now routinely use a two-stage method, in which they first regress the trait on covariates, obtain residuals, rank-normalize them, and then use the rank-normalized residuals in association analysis with the genotypes. Potential confounding signals are assumed to be removed at the first stage, so in practice, no further adjustment is done in the second stage. Here, we show that this widely used approach can lead to tests with undesirable statistical properties, due to both combination of a mis-specified mean-variance relationship and remaining covariate associations between the rank-normalized residuals and genotypes. We demonstrate these properties theoretically, and also in applications to genome-wide and whole-genome sequencing association studies. We further propose and evaluate an alternative fully adjusted two-stage approach that adjusts for covariates both when residuals are obtained and in the subsequent association test. This method can reduce excess Type I errors and improve statistical power.

Asunto(s)

Estudios de Asociación Genética , Modelos Genéticos , Simulación por Computador , Estudio de Asociación del Genoma Completo , Genotipo , Hemoglobinas/metabolismo , Hispánicos o Latinos , Humanos , Modelos Lineales , Fenotipo

12.

Genetic association testing using the GENESIS R/Bioconductor package.

Gogarten, Stephanie M; Sofer, Tamar; Chen, Han; Yu, Chaoyu; Brody, Jennifer A; Thornton, Timothy A; Rice, Kenneth M; Conomos, Matthew P.

Bioinformatics ; 35(24): 5346-5348, 2019 12 15.

Artículo en Inglés | MEDLINE | ID: mdl-31329242

RESUMEN

SUMMARY: The Genomic Data Storage (GDS) format provides efficient storage and retrieval of genotypes measured by microarrays and sequencing. We developed GENESIS to perform various single- and aggregate-variant association tests using genotype data stored in GDS format. GENESIS implements highly flexible mixed models, allowing for different link functions, multiple variance components and phenotypic heteroskedasticity. GENESIS integrates cohesively with other R/Bioconductor packages to build a complete genomic analysis workflow entirely within the R environment. AVAILABILITY AND IMPLEMENTATION: https://bioconductor.org/packages/GENESIS; vignettes included. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Asunto(s)

Genómica , Programas Informáticos , Pruebas Genéticas , Genoma , Análisis de Secuencia

13.

Genetic Diversity and Association Studies in US Hispanic/Latino Populations: Applications in the Hispanic Community Health Study/Study of Latinos.

Conomos, Matthew P; Laurie, Cecelia A; Stilp, Adrienne M; Gogarten, Stephanie M; McHugh, Caitlin P; Nelson, Sarah C; Sofer, Tamar; Fernández-Rhodes, Lindsay; Justice, Anne E; Graff, Mariaelisa; Young, Kristin L; Seyerle, Amanda A; Avery, Christy L; Taylor, Kent D; Rotter, Jerome I; Talavera, Gregory A; Daviglus, Martha L; Wassertheil-Smoller, Sylvia; Schneiderman, Neil; Heiss, Gerardo; Kaplan, Robert C; Franceschini, Nora; Reiner, Alex P; Shaffer, John R; Barr, R Graham; Kerr, Kathleen F; Browning, Sharon R; Browning, Brian L; Weir, Bruce S; Avilés-Santa, M Larissa; Papanicolaou, George J; Lumley, Thomas; Szpiro, Adam A; North, Kari E; Rice, Ken; Thornton, Timothy A; Laurie, Cathy C.

Am J Hum Genet ; 98(1): 165-84, 2016 Jan 07.

Artículo en Inglés | MEDLINE | ID: mdl-26748518

RESUMEN

US Hispanic/Latino individuals are diverse in genetic ancestry, culture, and environmental exposures. Here, we characterized and controlled for this diversity in genome-wide association studies (GWASs) for the Hispanic Community Health Study/Study of Latinos (HCHS/SOL). We simultaneously estimated population-structure principal components (PCs) robust to familial relatedness and pairwise kinship coefficients (KCs) robust to population structure, admixture, and Hardy-Weinberg departures. The PCs revealed substantial genetic differentiation within and among six self-identified background groups (Cuban, Dominican, Puerto Rican, Mexican, and Central and South American). To control for variation among groups, we developed a multi-dimensional clustering method to define a "genetic-analysis group" variable that retains many properties of self-identified background while achieving substantially greater genetic homogeneity within groups and including participants with non-specific self-identification. In GWASs of 22 biomedical traits, we used a linear mixed model (LMM) including pairwise empirical KCs to account for familial relatedness, PCs for ancestry, and genetic-analysis groups for additional group-associated effects. Including the genetic-analysis group as a covariate accounted for significant trait variation in 8 of 22 traits, even after we fit 20 PCs. Additionally, genetic-analysis groups had significant heterogeneity of residual variance for 20 of 22 traits, and modeling this heteroscedasticity within the LMM reduced genomic inflation for 19 traits. Furthermore, fitting an LMM that utilized a genetic-analysis group rather than a self-identified background group achieved higher power to detect previously reported associations. We expect that the methods applied here will be useful in other studies with multiple ethnic groups, admixture, and relatedness.

Asunto(s)

Variación Genética , Hispánicos o Latinos/genética , Estudio de Asociación del Genoma Completo , Humanos , Estados Unidos

14.

Genome-wide Association Study of Platelet Count Identifies Ancestry-Specific Loci in Hispanic/Latino Americans.

Schick, Ursula M; Jain, Deepti; Hodonsky, Chani J; Morrison, Jean V; Davis, James P; Brown, Lisa; Sofer, Tamar; Conomos, Matthew P; Schurmann, Claudia; McHugh, Caitlin P; Nelson, Sarah C; Vadlamudi, Swarooparani; Stilp, Adrienne; Plantinga, Anna; Baier, Leslie; Bien, Stephanie A; Gogarten, Stephanie M; Laurie, Cecelia A; Taylor, Kent D; Liu, Yongmei; Auer, Paul L; Franceschini, Nora; Szpiro, Adam; Rice, Ken; Kerr, Kathleen F; Rotter, Jerome I; Hanson, Robert L; Papanicolaou, George; Rich, Stephen S; Loos, Ruth J F; Browning, Brian L; Browning, Sharon R; Weir, Bruce S; Laurie, Cathy C; Mohlke, Karen L; North, Kari E; Thornton, Timothy A; Reiner, Alex P.

Am J Hum Genet ; 98(2): 229-42, 2016 Feb 04.

Artículo en Inglés | MEDLINE | ID: mdl-26805783

RESUMEN

Platelets play an essential role in hemostasis and thrombosis. We performed a genome-wide association study of platelet count in 12,491 participants of the Hispanic Community Health Study/Study of Latinos by using a mixed-model method that accounts for admixture and family relationships. We discovered and replicated associations with five genes (ACTN1, ETV7, GABBR1-MOG, MEF2C, and ZBTB9-BAK1). Our strongest association was with Amerindian-specific variant rs117672662 (p value = 1.16 × 10(-28)) in ACTN1, a gene implicated in congenital macrothrombocytopenia. rs117672662 exhibited allelic differences in transcriptional activity and protein binding in hematopoietic cells. Our results underscore the value of diverse populations to extend insights into the allelic architecture of complex traits.

Asunto(s)

Estudios de Asociación Genética/métodos , Sitios Genéticos , Hispánicos o Latinos/genética , Recuento de Plaquetas , Actinina/genética , Adolescente , Adulto , Anciano , Alelos , Frecuencia de los Genes , Genotipo , Técnicas de Genotipaje , Humanos , Factores de Transcripción MEF2/genética , Proteínas de la Membrana/genética , Persona de Mediana Edad , Fenotipo , Polimorfismo de Nucleótido Simple , Receptores de GABA-B/genética , Adulto Joven

15.

SeqArray-a storage-efficient high-performance data format for WGS variant calls.

Zheng, Xiuwen; Gogarten, Stephanie M; Lawrence, Michael; Stilp, Adrienne; Conomos, Matthew P; Weir, Bruce S; Laurie, Cathy; Levine, David.

Bioinformatics ; 33(15): 2251-2257, 2017 Aug 01.

Artículo en Inglés | MEDLINE | ID: mdl-28334390

RESUMEN

MOTIVATION: Whole-genome sequencing (WGS) data are being generated at an unprecedented rate. Analysis of WGS data requires a flexible data format to store the different types of DNA variation. Variant call format (VCF) is a general text-based format developed to store variant genotypes and their annotations. However, VCF files are large and data retrieval is relatively slow. Here we introduce a new WGS variant data format implemented in the R/Bioconductor package 'SeqArray' for storing variant calls in an array-oriented manner which provides the same capabilities as VCF, but with multiple high compression options and data access using high-performance parallel computing. RESULTS: Benchmarks using 1000 Genomes Phase 3 data show file sizes are 14.0 Gb (VCF), 12.3 Gb (BCF, binary VCF), 3.5 Gb (BGT) and 2.6 Gb (SeqArray) respectively. Reading genotypes in the SeqArray package are two to three times faster compared with the htslib C library using BCF files. For the allele frequency calculation, the implementation in the SeqArray package is over 5 times faster than PLINK v1.9 with VCF and BCF files, and over 16 times faster than vcftools. When used in conjunction with R/Bioconductor packages, the SeqArray package provides users a flexible, feature-rich, high-performance programming environment for analysis of WGS variant data. AVAILABILITY AND IMPLEMENTATION: http://www.bioconductor.org/packages/SeqArray. CONTACT: zhengx@u.washington.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Asunto(s)

Compresión de Datos/métodos , Variación Genética , Programas Informáticos , Secuenciación Completa del Genoma/métodos , Genoma Humano , Genómica/métodos , Humanos

16.

Genome-Wide Association Study of Heavy Smoking and Daily/Nondaily Smoking in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL).

Saccone, Nancy L; Emery, Leslie S; Sofer, Tamar; Gogarten, Stephanie M; Becker, Diane M; Bottinger, Erwin P; Chen, Li-Shiun; Culverhouse, Robert C; Duan, Weimin; Hancock, Dana B; Hosgood, H Dean; Johnson, Eric O; Loos, Ruth J F; Louie, Tin; Papanicolaou, George; Perreira, Krista M; Rodriquez, Erik J; Schurmann, Claudia; Stilp, Adrienne M; Szpiro, Adam A; Talavera, Gregory A; Taylor, Kent D; Thrasher, James F; Yanek, Lisa R; Laurie, Cathy C; Pérez-Stable, Eliseo J; Bierut, Laura J; Kaplan, Robert C.

Nicotine Tob Res ; 20(4): 448-457, 2018 03 06.

Artículo en Inglés | MEDLINE | ID: mdl-28520984

RESUMEN

Introduction: Genetic variants associated with nicotine dependence have previously been identified, primarily in European-ancestry populations. No genome-wide association studies (GWAS) have been reported for smoking behaviors in Hispanics/Latinos in the United States and Latin America, who are of mixed ancestry with European, African, and American Indigenous components. Methods: We examined genetic associations with smoking behaviors in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL) (N = 12 741 with smoking data, 5119 ever-smokers), using ~2.3 million genotyped variants imputed to the 1000 Genomes Project phase 3. Mixed logistic regression models accounted for population structure, sampling, relatedness, sex, and age. Results: The known region of CHRNA5, which encodes the α5 cholinergic nicotinic receptor subunit, was associated with heavy smoking at genome-wide significance (p ≤ 5 × 10-8) in a comparison of 1929 ever-smokers reporting cigarettes per day (CPD) > 10 versus 3156 reporting CPD ≤ 10. The functional variant rs16969968 in CHRNA5 had a p value of 2.20 × 10-7 and odds ratio (OR) of 1.32 for the minor allele (A); its minor allele frequency was 0.22 overall and similar across Hispanic/Latino background groups (Central American = 0.17; South American = 0.19; Mexican = 0.18; Puerto Rican = 0.22; Cuban = 0.29; Dominican = 0.19). CHRNA4 on chromosome 20 attained p < 10-4, supporting prior findings in non-Hispanics. For nondaily smoking, which is prevalent in Hispanic/Latino smokers, compared to daily smoking, loci on chromosomes 2 and 4 achieved genome-wide significance; replication attempts were limited by small Hispanic/Latino sample sizes. Conclusions: Associations of nicotinic receptor gene variants with smoking, first reported in non-Hispanic European-ancestry populations, generalized to Hispanics/Latinos despite different patterns of smoking behavior. Implications: We conducted the first large-scale genome-wide association study (GWAS) of smoking behavior in a US Hispanic/Latino cohort, and the first GWAS of daily/nondaily smoking in any population. Results show that the region of the nicotinic receptor subunit gene CHRNA5, which in non-Hispanic European-ancestry smokers has been associated with heavy smoking as well as cessation and treatment efficacy, is also significantly associated with heavy smoking in this Hispanic/Latino cohort. The results are an important addition to understanding the impact of genetic variants in understudied Hispanic/Latino smokers.

Asunto(s)

Estudio de Asociación del Genoma Completo/métodos , Hispánicos o Latinos/genética , Proteínas del Tejido Nervioso/genética , Salud Pública/métodos , Receptores Nicotínicos/genética , Fumar/epidemiología , Fumar/genética , Adulto , Femenino , Frecuencia de los Genes , Genotipo , Humanos , Masculino , Persona de Mediana Edad , Estados Unidos/epidemiología

17.

A genome-wide interaction analysis of tricyclic/tetracyclic antidepressants and RR and QT intervals: a pharmacogenomics study from the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium.

Noordam, Raymond; Sitlani, Colleen M; Avery, Christy L; Stewart, James D; Gogarten, Stephanie M; Wiggins, Kerri L; Trompet, Stella; Warren, Helen R; Sun, Fangui; Evans, Daniel S; Li, Xiaohui; Li, Jin; Smith, Albert V; Bis, Joshua C; Brody, Jennifer A; Busch, Evan L; Caulfield, Mark J; Chen, Yii-Der I; Cummings, Steven R; Cupples, L Adrienne; Duan, Qing; Franco, Oscar H; Méndez-Giráldez, Rául; Harris, Tamara B; Heckbert, Susan R; van Heemst, Diana; Hofman, Albert; Floyd, James S; Kors, Jan A; Launer, Lenore J; Li, Yun; Li-Gao, Ruifang; Lange, Leslie A; Lin, Henry J; de Mutsert, Renée; Napier, Melanie D; Newton-Cheh, Christopher; Poulter, Neil; Reiner, Alexander P; Rice, Kenneth M; Roach, Jeffrey; Rodriguez, Carlos J; Rosendaal, Frits R; Sattar, Naveed; Sever, Peter; Seyerle, Amanda A; Slagboom, P Eline; Soliman, Elsayed Z; Sotoodehnia, Nona; Stott, David J.

J Med Genet ; 54(5): 313-323, 2017 05.

Artículo en Inglés | MEDLINE | ID: mdl-28039329

RESUMEN

BACKGROUND: Increased heart rate and a prolonged QT interval are important risk factors for cardiovascular morbidity and mortality, and can be influenced by the use of various medications, including tricyclic/tetracyclic antidepressants (TCAs). We aim to identify genetic loci that modify the association between TCA use and RR and QT intervals. METHODS AND RESULTS: We conducted race/ethnic-specific genome-wide interaction analyses (with HapMap phase II imputed reference panel imputation) of TCAs and resting RR and QT intervals in cohorts of European (n=45â706; n=1417 TCA users), African (n=10â235; n=296 TCA users) and Hispanic/Latino (n=13â808; n=147 TCA users) ancestry, adjusted for clinical covariates. Among the populations of European ancestry, two genome-wide significant loci were identified for RR interval: rs6737205 in BRE (ß=56.3, pinteraction=3.9e-9) and rs9830388 in UBE2E2 (ß=25.2, pinteraction=1.7e-8). In Hispanic/Latino cohorts, rs2291477 in TGFBR3 significantly modified the association between TCAs and QT intervals (ß=9.3, pinteraction=2.55e-8). In the meta-analyses of the other ethnicities, these loci either were excluded from the meta-analyses (as part of quality control), or their effects did not reach the level of nominal statistical significance (pinteraction>0.05). No new variants were identified in these ethnicities. No additional loci were identified after inverse-variance-weighted meta-analysis of the three ancestries. CONCLUSIONS: Among Europeans, TCA interactions with variants in BRE and UBE2E2 were identified in relation to RR intervals. Among Hispanic/Latinos, variants in TGFBR3 modified the relation between TCAs and QT intervals. Future studies are required to confirm our results.

Asunto(s)

Envejecimiento/fisiología , Antidepresivos Tricíclicos/farmacología , Electrocardiografía , Estudio de Asociación del Genoma Completo , Corazón/fisiopatología , Farmacogenética , Anciano , Femenino , Sitios Genéticos , Corazón/efectos de los fármacos , Humanos , Masculino , Persona de Mediana Edad

18.

Association Between Titin Loss-of-Function Variants and Early-Onset Atrial Fibrillation.

Choi, Seung Hoan; Weng, Lu-Chen; Roselli, Carolina; Lin, Honghuang; Haggerty, Christopher M; Shoemaker, M Benjamin; Barnard, John; Arking, Dan E; Chasman, Daniel I; Albert, Christine M; Chaffin, Mark; Tucker, Nathan R; Smith, Jonathan D; Gupta, Namrata; Gabriel, Stacey; Margolin, Lauren; Shea, Marisa A; Shaffer, Christian M; Yoneda, Zachary T; Boerwinkle, Eric; Smith, Nicholas L; Silverman, Edwin K; Redline, Susan; Vasan, Ramachandran S; Burchard, Esteban G; Gogarten, Stephanie M; Laurie, Cecelia; Blackwell, Thomas W; Abecasis, Gonçalo; Carey, David J; Fornwalt, Brandon K; Smelser, Diane T; Baras, Aris; Dewey, Frederick E; Jaquish, Cashell E; Papanicolaou, George J; Sotoodehnia, Nona; Van Wagoner, David R; Psaty, Bruce M; Kathiresan, Sekar; Darbar, Dawood; Alonso, Alvaro; Heckbert, Susan R; Chung, Mina K; Roden, Dan M; Benjamin, Emelia J; Murray, Michael F; Lunetta, Kathryn L; Lubitz, Steven A; Ellinor, Patrick T.

JAMA ; 320(22): 2354-2364, 2018 12 11.

Artículo en Inglés | MEDLINE | ID: mdl-30535219

RESUMEN

Importance: Atrial fibrillation (AF) is the most common arrhythmia affecting 1% of the population. Young individuals with AF have a strong genetic association with the disease, but the mechanisms remain incompletely understood. Objective: To perform large-scale whole-genome sequencing to identify genetic variants related to AF. Design, Setting, and Participants: The National Heart, Lung, and Blood Institute's Trans-Omics for Precision Medicine Program includes longitudinal and cohort studies that underwent high-depth whole-genome sequencing between 2014 and 2017 in 18â¯526 individuals from the United States, Mexico, Puerto Rico, Costa Rica, Barbados, and Samoa. This case-control study included 2781 patients with early-onset AF from 9 studies and identified 4959 controls of European ancestry from the remaining participants. Results were replicated in the UK Biobank (346â¯546 participants) and the MyCode Study (42â¯782 participants). Exposures: Loss-of-function (LOF) variants in genes at AF loci and common genetic variation across the whole genome. Main Outcomes and Measures: Early-onset AF (defined as AF onset in persons <66 years of age). Due to multiple testing, the significance threshold for the rare variant analysis was P = 4.55 × 10-3. Results: Among 2781 participants with early-onset AF (the case group), 72.1% were men, and the mean (SD) age of AF onset was 48.7 (10.2) years. Participants underwent whole-genome sequencing at a mean depth of 37.8 fold and mean genome coverage of 99.1%. At least 1 LOF variant in TTN, the gene encoding the sarcomeric protein titin, was present in 2.1% of case participants compared with 1.1% in control participants (odds ratio [OR], 1.76 [95% CI, 1.04-2.97]). The proportion of individuals with early-onset AF who carried a LOF variant in TTN increased with an earlier age of AF onset (P value for trend, 4.92 × 10-4), and 6.5% of individuals with AF onset prior to age 30 carried a TTN LOF variant (OR, 5.94 [95% CI, 2.64-13.35]; P = 1.65 × 10-5). The association between TTN LOF variants and AF was replicated in an independent study of 1582 patients with early-onset AF (cases) and 41â¯200 control participants (OR, 2.16 [95% CI, 1.19-3.92]; P = .01). Conclusions and Relevance: In a case-control study, there was a statistically significant association between an LOF variant in the TTN gene and early-onset AF, with the variant present in a small percentage of participants with early-onset AF (the case group). Further research is necessary to understand whether this is a causal relationship.

Asunto(s)

Fibrilación Atrial/genética , Conectina/genética , Mutación con Pérdida de Función , Adulto , Edad de Inicio , Estudios de Casos y Controles , Femenino , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Heterocigoto , Humanos , Masculino , Persona de Mediana Edad , Control de Calidad

19.

Meta-Analysis of Genome-Wide Association Studies with Correlated Individuals: Application to the Hispanic Community Health Study/Study of Latinos (HCHS/SOL).

Sofer, Tamar; Shaffer, John R; Graff, Mariaelisa; Qi, Qibin; Stilp, Adrienne M; Gogarten, Stephanie M; North, Kari E; Isasi, Carmen R; Laurie, Cathy C; Szpiro, Adam A.

Genet Epidemiol ; 40(6): 492-501, 2016 09.

Artículo en Inglés | MEDLINE | ID: mdl-27256683

RESUMEN

Investigators often meta-analyze multiple genome-wide association studies (GWASs) to increase the power to detect associations of single nucleotide polymorphisms (SNPs) with a trait. Meta-analysis is also performed within a single cohort that is stratified by, e.g., sex or ancestry group. Having correlated individuals among the strata may complicate meta-analyses, limit power, and inflate Type 1 error. For example, in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL), sources of correlation include genetic relatedness, shared household, and shared community. We propose a novel mixed-effect model for meta-analysis, "MetaCor," which accounts for correlation between stratum-specific effect estimates. Simulations show that MetaCor controls inflation better than alternatives such as ignoring the correlation between the strata or analyzing all strata together in a "pooled" GWAS, especially with different minor allele frequencies (MAFs) between strata. We illustrate the benefits of MetaCor on two GWASs in the HCHS/SOL. Analysis of dental caries (tooth decay) stratified by ancestry group detected a genome-wide significant SNP (rs7791001, P-value = 3.66×10-8, compared to 4.67×10-7 in pooled), with different MAFs between strata. Stratified analysis of body mass index (BMI) by ancestry group and sex reduced overall inflation from λGC=1.050 (pooled) to λGC=1.028 (MetaCor). Furthermore, even after removing close relatives to obtain nearly uncorrelated strata, a naïve stratified analysis resulted in λGC=1.058 compared to λGC=1.027 for MetaCor.

Asunto(s)

Estudio de Asociación del Genoma Completo , Hispánicos o Latinos/genética , Índice de Masa Corporal , Caries Dental/genética , Caries Dental/patología , Frecuencia de los Genes , Humanos , Modelos Genéticos , Fenotipo , Polimorfismo de Nucleótido Simple , Salud Pública

20.

Genome-wide meta-analysis of homocysteine and methionine metabolism identifies five one carbon metabolism loci and a novel association of ALDH1L1 with ischemic stroke.

Williams, Stephen R; Yang, Qiong; Chen, Fang; Liu, Xuan; Keene, Keith L; Jacques, Paul; Chen, Wei-Min; Weinstein, Galit; Hsu, Fang-Chi; Beiser, Alexa; Wang, Liewei; Bookman, Ebony; Doheny, Kimberly F; Wolf, Philip A; Zilka, Michelle; Selhub, Jacob; Nelson, Sarah; Gogarten, Stephanie M; Worrall, Bradford B; Seshadri, Sudha; Sale, Michèle M.

PLoS Genet ; 10(3): e1004214, 2014 Mar.

Artículo en Inglés | MEDLINE | ID: mdl-24651765

RESUMEN

Circulating homocysteine levels (tHcy), a product of the folate one carbon metabolism pathway (FOCM) through the demethylation of methionine, are heritable and are associated with an increased risk of common diseases such as stroke, cardiovascular disease (CVD), cancer and dementia. The FOCM is the sole source of de novo methyl group synthesis, impacting many biological and epigenetic pathways. However, the genetic determinants of elevated tHcy (hyperhomocysteinemia), dysregulation of methionine metabolism and the underlying biological processes remain unclear. We conducted independent genome-wide association studies and a meta-analysis of methionine metabolism, characterized by post-methionine load test tHcy, in 2,710 participants from the Framingham Heart Study (FHS) and 2,100 participants from the Vitamin Intervention for Stroke Prevention (VISP) clinical trial, and then examined the association of the identified loci with incident stroke in FHS. Five genes in the FOCM pathway (GNMT [p = 1.60 × 10(-63)], CBS [p = 3.15 × 10(-26)], CPS1 [p = 9.10 × 10(-13)], ALDH1L1 [p = 7.3 × 10(-13)] and PSPH [p = 1.17 × 10(-16)]) were strongly associated with the difference between pre- and post-methionine load test tHcy levels (ΔPOST). Of these, one variant in the ALDH1L1 locus, rs2364368, was associated with incident ischemic stroke. Promoter analyses reveal genetic and epigenetic differences that may explain a direct effect on GNMT transcription and a downstream affect on methionine metabolism. Additionally, a genetic-score consisting of the five significant loci explains 13% of the variance of ΔPOST in FHS and 6% of the variance in VISP. Association between variants in FOCM genes with ΔPOST suggest novel mechanisms that lead to differences in methionine metabolism, and possibly the epigenome, impacting disease risk. These data emphasize the importance of a concerted effort to understand regulators of one carbon metabolism as potential therapeutic targets.

Asunto(s)

Aldehído Deshidrogenasa/genética , Homocisteína/metabolismo , Metionina/metabolismo , Accidente Cerebrovascular/genética , Carbono/metabolismo , Ácido Fólico/metabolismo , Estudio de Asociación del Genoma Completo , Genotipo , Homocisteína/genética , Humanos , Metionina/genética , Oxidorreductasas actuantes sobre Donantes de Grupo CH-NH , Accidente Cerebrovascular/patología , Vitamina B 12

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA