Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 44
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Hum Mol Genet ; 2024 May 15.
Artículo en Inglés | MEDLINE | ID: mdl-38747556

RESUMEN

Inflammation biomarkers can provide valuable insight into the role of inflammatory processes in many diseases and conditions. Sequencing based analyses of such biomarkers can also serve as an exemplar of the genetic architecture of quantitative traits. To evaluate the biological insight, which can be provided by a multi-ancestry, whole-genome based association study, we performed a comprehensive analysis of 21 inflammation biomarkers from up to 38 465 individuals with whole-genome sequencing from the Trans-Omics for Precision Medicine (TOPMed) program (with varying sample size by trait, where the minimum sample size was n = 737 for MMP-1). We identified 22 distinct single-variant associations across 6 traits-E-selectin, intercellular adhesion molecule 1, interleukin-6, lipoprotein-associated phospholipase A2 activity and mass, and P-selectin-that remained significant after conditioning on previously identified associations for these inflammatory biomarkers. We further expanded upon known biomarker associations by pairing the single-variant analysis with a rare variant set-based analysis that further identified 19 significant rare variant set-based associations with 5 traits. These signals were distinct from both significant single variant association signals within TOPMed and genetic signals observed in prior studies, demonstrating the complementary value of performing both single and rare variant analyses when analyzing quantitative traits. We also confirm several previously reported signals from semi-quantitative proteomics platforms. Many of these signals demonstrate the extensive allelic heterogeneity and ancestry-differentiated variant-trait associations common for inflammation biomarkers, a characteristic we hypothesize will be increasingly observed with well-powered, large-scale analyses of complex traits.

2.
Am J Hum Genet ; 109(10): 1894-1908, 2022 10 06.
Artículo en Inglés | MEDLINE | ID: mdl-36206743

RESUMEN

Individuals with cystic fibrosis (CF) develop complications of the gastrointestinal tract influenced by genetic variants outside of CFTR. Cystic fibrosis-related diabetes (CFRD) is a distinct form of diabetes with a variable age of onset that occurs frequently in individuals with CF, while meconium ileus (MI) is a severe neonatal intestinal obstruction affecting ∼20% of newborns with CF. CFRD and MI are slightly correlated traits with previous evidence of overlap in their genetic architectures. To better understand the genetic commonality between CFRD and MI, we used whole-genome-sequencing data from the CF Genome Project to perform genome-wide association. These analyses revealed variants at 11 loci (6 not previously identified) that associated with MI and at 12 loci (5 not previously identified) that associated with CFRD. Of these, variants at SLC26A9, CEBPB, and PRSS1 associated with both traits; variants at SLC26A9 and CEBPB increased risk for both traits, while variants at PRSS1, the higher-risk alleles for CFRD, conferred lower risk for MI. Furthermore, common and rare variants within the SLC26A9 locus associated with MI only or CFRD only. As expected, different loci modify risk of CFRD and MI; however, a subset exhibit pleiotropic effects indicating etiologic and mechanistic overlap between these two otherwise distinct complications of CF.


Asunto(s)
Fibrosis Quística , Diabetes Mellitus , Enfermedades del Recién Nacido , Obstrucción Intestinal , Fibrosis Quística/complicaciones , Fibrosis Quística/genética , Regulador de Conductancia de Transmembrana de Fibrosis Quística/genética , Diabetes Mellitus/genética , Estudio de Asociación del Genoma Completo , Humanos , Recién Nacido , Obstrucción Intestinal/complicaciones , Obstrucción Intestinal/genética
3.
Am J Hum Genet ; 109(6): 1175-1181, 2022 06 02.
Artículo en Inglés | MEDLINE | ID: mdl-35504290

RESUMEN

Current publicly available tools that allow rapid exploration of linkage disequilibrium (LD) between markers (e.g., HaploReg and LDlink) are based on whole-genome sequence (WGS) data from 2,504 individuals in the 1000 Genomes Project. Here, we present TOP-LD, an online tool to explore LD inferred with high-coverage (∼30×) WGS data from 15,578 individuals in the NHLBI Trans-Omics for Precision Medicine (TOPMed) program. TOP-LD provides a significant upgrade compared to current LD tools, as the TOPMed WGS data provide a more comprehensive representation of genetic variation than the 1000 Genomes data, particularly for rare variants and in the specific populations that we analyzed. For example, TOP-LD encompasses LD information for 150.3, 62.2, and 36.7 million variants for European, African, and East Asian ancestral samples, respectively, offering 2.6- to 9.1-fold increase in variant coverage compared to HaploReg 4.0 or LDlink. In addition, TOP-LD includes tens of thousands of structural variants (SVs). We demonstrate the value of TOP-LD in fine-mapping at the GGT1 locus associated with gamma glutamyltransferase in the African ancestry participants in UK Biobank. Beyond fine-mapping, TOP-LD can facilitate a wide range of applications that are based on summary statistics and estimates of LD. TOP-LD is freely available online.


Asunto(s)
Estudio de Asociación del Genoma Completo , Medicina de Precisión , Pueblo Asiatico , Humanos , Desequilibrio de Ligamiento/genética , Polimorfismo de Nucleótido Simple/genética , Secuenciación Completa del Genoma
4.
Am J Hum Genet ; 108(5): 874-893, 2021 05 06.
Artículo en Inglés | MEDLINE | ID: mdl-33887194

RESUMEN

Whole-genome sequencing (WGS), a powerful tool for detecting novel coding and non-coding disease-causing variants, has largely been applied to clinical diagnosis of inherited disorders. Here we leveraged WGS data in up to 62,653 ethnically diverse participants from the NHLBI Trans-Omics for Precision Medicine (TOPMed) program and assessed statistical association of variants with seven red blood cell (RBC) quantitative traits. We discovered 14 single variant-RBC trait associations at 12 genomic loci, which have not been reported previously. Several of the RBC trait-variant associations (RPN1, ELL2, MIDN, HBB, HBA1, PIEZO1, and G6PD) were replicated in independent GWAS datasets imputed to the TOPMed reference panel. Most of these discovered variants are rare/low frequency, and several are observed disproportionately among non-European Ancestry (African, Hispanic/Latino, or East Asian) populations. We identified a 3 bp indel p.Lys2169del (g.88717175_88717177TCT[4]) (common only in the Ashkenazi Jewish population) of PIEZO1, a gene responsible for the Mendelian red cell disorder hereditary xerocytosis (MIM: 194380), associated with higher mean corpuscular hemoglobin concentration (MCHC). In stepwise conditional analysis and in gene-based rare variant aggregated association analysis, we identified several of the variants in HBB, HBA1, TMPRSS6, and G6PD that represent the carrier state for known coding, promoter, or splice site loss-of-function variants that cause inherited RBC disorders. Finally, we applied base and nuclease editing to demonstrate that the sentinel variant rs112097551 (nearest gene RPN1) acts through a cis-regulatory element that exerts long-range control of the gene RUVBL1 which is essential for hematopoiesis. Together, these results demonstrate the utility of WGS in ethnically diverse population-based samples and gene editing for expanding knowledge of the genetic architecture of quantitative hematologic traits and suggest a continuum between complex trait and Mendelian red cell disorders.


Asunto(s)
Eritrocitos/metabolismo , Eritrocitos/patología , Estudio de Asociación del Genoma Completo , National Heart, Lung, and Blood Institute (U.S.)/organización & administración , Fenotipo , Adulto , Anciano , Cromosomas Humanos Par 16/genética , Conjuntos de Datos como Asunto , Femenino , Edición Génica , Variación Genética/genética , Células HEK293 , Humanos , Masculino , Persona de Mediana Edad , Control de Calidad , Reproducibilidad de los Resultados , Estados Unidos
5.
PLoS Genet ; 15(4): e1007739, 2019 04.
Artículo en Inglés | MEDLINE | ID: mdl-30990817

RESUMEN

Sleep disordered breathing (SDB)-related overnight hypoxemia is associated with cardiometabolic disease and other comorbidities. Understanding the genetic bases for variations in nocturnal hypoxemia may help understand mechanisms influencing oxygenation and SDB-related mortality. We conducted genome-wide association tests across 10 cohorts and 4 populations to identify genetic variants associated with three correlated measures of overnight oxyhemoglobin saturation: average and minimum oxyhemoglobin saturation during sleep and the percent of sleep with oxyhemoglobin saturation under 90%. The discovery sample consisted of 8,326 individuals. Variants with p < 1 × 10(-6) were analyzed in a replication group of 14,410 individuals. We identified 3 significantly associated regions, including 2 regions in multi-ethnic analyses (2q12, 10q22). SNPs in the 2q12 region associated with minimum SpO2 (rs78136548 p = 2.70 × 10(-10)). SNPs at 10q22 were associated with all three traits including average SpO2 (rs72805692 p = 4.58 × 10(-8)). SNPs in both regions were associated in over 20,000 individuals and are supported by prior associations or functional evidence. Four additional significant regions were detected in secondary sex-stratified and combined discovery and replication analyses, including a region overlapping Reelin, a known marker of respiratory complex neurons.These are the first genome-wide significant findings reported for oxyhemoglobin saturation during sleep, a phenotype of high clinical interest. Our replicated associations with HK1 and IL18R1 suggest that variants in inflammatory pathways, such as the biologically-plausible NLRP3 inflammasome, may contribute to nocturnal hypoxemia.


Asunto(s)
Hexoquinasa/genética , Subunidad alfa del Receptor de Interleucina-18/genética , Oxihemoglobinas/metabolismo , Sueño/genética , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Moléculas de Adhesión Celular Neuronal/genética , Biología Computacional , Proteínas de la Matriz Extracelular/genética , Femenino , Redes Reguladoras de Genes , Variación Genética , Estudio de Asociación del Genoma Completo , Humanos , Hipoxia/sangre , Hipoxia/genética , Masculino , Persona de Mediana Edad , Proteína con Dominio Pirina 3 de la Familia NLR/genética , Proteínas del Tejido Nervioso/genética , Oxígeno/sangre , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Proteína Reelina , Serina Endopeptidasas/genética , Síndromes de la Apnea del Sueño/sangre , Síndromes de la Apnea del Sueño/genética , Adulto Joven
6.
Am J Epidemiol ; 190(10): 1977-1992, 2021 10 01.
Artículo en Inglés | MEDLINE | ID: mdl-33861317

RESUMEN

Genotype-phenotype association studies often combine phenotype data from multiple studies to increase statistical power. Harmonization of the data usually requires substantial effort due to heterogeneity in phenotype definitions, study design, data collection procedures, and data-set organization. Here we describe a centralized system for phenotype harmonization that includes input from phenotype domain and study experts, quality control, documentation, reproducible results, and data-sharing mechanisms. This system was developed for the National Heart, Lung, and Blood Institute's Trans-Omics for Precision Medicine (TOPMed) program, which is generating genomic and other -omics data for more than 80 studies with extensive phenotype data. To date, 63 phenotypes have been harmonized across thousands of participants (recruited in 1948-2012) from up to 17 studies per phenotype. Here we discuss challenges in this undertaking and how they were addressed. The harmonized phenotype data and associated documentation have been submitted to National Institutes of Health data repositories for controlled access by the scientific community. We also provide materials to facilitate future harmonization efforts by the community, which include 1) the software code used to generate the 63 harmonized phenotypes, enabling others to reproduce, modify, or extend these harmonizations to additional studies, and 2) the results of labeling thousands of phenotype variables with controlled vocabulary terms.


Asunto(s)
Estudios de Asociación Genética/métodos , Fenómica/métodos , Medicina de Precisión/métodos , Agregación de Datos , Humanos , Difusión de la Información , National Heart, Lung, and Blood Institute (U.S.) , Fenotipo , Evaluación de Programas y Proyectos de Salud , Estados Unidos
7.
Hum Mol Genet ; 28(4): 675-687, 2019 02 15.
Artículo en Inglés | MEDLINE | ID: mdl-30403821

RESUMEN

Obstructive sleep apnea (OSA) is a common disorder associated with increased risk of cardiovascular disease and mortality. Its prevalence and severity vary across ancestral background. Although OSA traits are heritable, few genetic associations have been identified. To identify genetic regions associated with OSA and improve statistical power, we applied admixture mapping on three primary OSA traits [the apnea hypopnea index (AHI), overnight average oxyhemoglobin saturation (SaO2) and percentage time SaO2 < 90%] and a secondary trait (respiratory event duration) in a Hispanic/Latino American population study of 11 575 individuals with significant variation in ancestral background. Linear mixed models were performed using previously inferred African, European and Amerindian local genetic ancestry markers. Global African ancestry was associated with a lower AHI, higher SaO2 and shorter event duration. Admixture mapping analysis of the primary OSA traits identified local African ancestry at the chromosomal region 2q37 as genome-wide significantly associated with AHI (P < 5.7 × 10-5), and European and Amerindian ancestries at 18q21 suggestively associated with both AHI and percentage time SaO2 < 90% (P < 10-3). Follow-up joint ancestry-SNP association analyses identified novel variants in ferrochelatase (FECH), significantly associated with AHI and percentage time SaO2 < 90% after adjusting for multiple tests (P < 8 × 10-6). These signals contributed to the admixture mapping associations and were replicated in independent cohorts. In this first admixture mapping study of OSA, novel associations with variants in the iron/heme metabolism pathway suggest a role for iron in influencing respiratory traits underlying OSA.


Asunto(s)
Ferroquelatasa/genética , Estudio de Asociación del Genoma Completo , Apnea Obstructiva del Sueño/genética , Anciano , Mapeo Cromosómico , Femenino , Genotipo , Hispánicos o Latinos/genética , Humanos , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple/genética , Polisomnografía , Apnea Obstructiva del Sueño/diagnóstico por imagen , Apnea Obstructiva del Sueño/fisiopatología , Población Blanca/genética
8.
Am J Hum Genet ; 98(4): 653-66, 2016 Apr 07.
Artículo en Inglés | MEDLINE | ID: mdl-27018471

RESUMEN

Linear mixed models (LMMs) are widely used in genome-wide association studies (GWASs) to account for population structure and relatedness, for both continuous and binary traits. Motivated by the failure of LMMs to control type I errors in a GWAS of asthma, a binary trait, we show that LMMs are generally inappropriate for analyzing binary traits when population stratification leads to violation of the LMM's constant-residual variance assumption. To overcome this problem, we develop a computationally efficient logistic mixed model approach for genome-wide analysis of binary traits, the generalized linear mixed model association test (GMMAT). This approach fits a logistic mixed model once per GWAS and performs score tests under the null hypothesis of no association between a binary trait and individual genetic variants. We show in simulation studies and real data analysis that GMMAT effectively controls for population structure and relatedness when analyzing binary traits in a wide variety of study designs.


Asunto(s)
Estudios de Asociación Genética/métodos , Genética de Población/métodos , Modelos Lineales , Fenotipo , Asma/genética , Estudios de Casos y Controles , América Central , Simulación por Computador , Técnicas de Genotipaje , Humanos , Modelos Logísticos , Modelos Genéticos , Filogeografía , Polimorfismo de Nucleótido Simple , América del Sur
9.
Am J Hum Genet ; 99(3): 636-646, 2016 Sep 01.
Artículo en Inglés | MEDLINE | ID: mdl-27588450

RESUMEN

We analyzed genome-wide association studies (GWASs), including data from 71,638 individuals from four ancestries, for estimated glomerular filtration rate (eGFR), a measure of kidney function used to define chronic kidney disease (CKD). We identified 20 loci attaining genome-wide-significant evidence of association (p < 5 × 10(-8)) with kidney function and highlighted that allelic effects on eGFR at lead SNPs are homogeneous across ancestries. We leveraged differences in the pattern of linkage disequilibrium between diverse populations to fine-map the 20 loci through construction of "credible sets" of variants driving eGFR association signals. Credible variants at the 20 eGFR loci were enriched for DNase I hypersensitivity sites (DHSs) in human kidney cells. DHS credible variants were expression quantitative trait loci for NFATC1 and RGS14 (at the SLC34A1 locus) in multiple tissues. Loss-of-function mutations in ancestral orthologs of both genes in Drosophila melanogaster were associated with altered sensitivity to salt stress. Renal mRNA expression of Nfatc1 and Rgs14 in a salt-sensitive mouse model was also reduced after exposure to a high-salt diet or induced CKD. Our study (1) demonstrates the utility of trans-ethnic fine mapping through integration of GWASs involving diverse populations with genomic annotation from relevant tissues to define molecular mechanisms by which association signals exert their effect and (2) suggests that salt sensitivity might be an important marker for biological processes that affect kidney function and CKD in humans.


Asunto(s)
Etnicidad/genética , Estudio de Asociación del Genoma Completo , Riñón/fisiopatología , Insuficiencia Renal Crónica/genética , Insuficiencia Renal Crónica/fisiopatología , Cloruro de Sodio/farmacología , Estrés Fisiológico/efectos de los fármacos , Estrés Fisiológico/genética , Alelos , Animales , Desoxirribonucleasa I/metabolismo , Diabetes Mellitus/genética , Modelos Animales de Enfermedad , Drosophila melanogaster/genética , Femenino , Tasa de Filtración Glomerular/genética , Humanos , Riñón/patología , Desequilibrio de Ligamiento , Masculino , Factores de Transcripción NFATC/genética , Polimorfismo de Nucleótido Simple/genética , Sitios de Carácter Cuantitativo , Proteínas RGS/genética , Grupos Raciales/genética , Tolerancia a la Sal/genética , Proteínas Cotransportadoras de Sodio-Fosfato de Tipo IIa/genética
10.
Am J Hum Genet ; 98(1): 165-84, 2016 Jan 07.
Artículo en Inglés | MEDLINE | ID: mdl-26748518

RESUMEN

US Hispanic/Latino individuals are diverse in genetic ancestry, culture, and environmental exposures. Here, we characterized and controlled for this diversity in genome-wide association studies (GWASs) for the Hispanic Community Health Study/Study of Latinos (HCHS/SOL). We simultaneously estimated population-structure principal components (PCs) robust to familial relatedness and pairwise kinship coefficients (KCs) robust to population structure, admixture, and Hardy-Weinberg departures. The PCs revealed substantial genetic differentiation within and among six self-identified background groups (Cuban, Dominican, Puerto Rican, Mexican, and Central and South American). To control for variation among groups, we developed a multi-dimensional clustering method to define a "genetic-analysis group" variable that retains many properties of self-identified background while achieving substantially greater genetic homogeneity within groups and including participants with non-specific self-identification. In GWASs of 22 biomedical traits, we used a linear mixed model (LMM) including pairwise empirical KCs to account for familial relatedness, PCs for ancestry, and genetic-analysis groups for additional group-associated effects. Including the genetic-analysis group as a covariate accounted for significant trait variation in 8 of 22 traits, even after we fit 20 PCs. Additionally, genetic-analysis groups had significant heterogeneity of residual variance for 20 of 22 traits, and modeling this heteroscedasticity within the LMM reduced genomic inflation for 19 traits. Furthermore, fitting an LMM that utilized a genetic-analysis group rather than a self-identified background group achieved higher power to detect previously reported associations. We expect that the methods applied here will be useful in other studies with multiple ethnic groups, admixture, and relatedness.


Asunto(s)
Variación Genética , Hispánicos o Latinos/genética , Estudio de Asociación del Genoma Completo , Humanos , Estados Unidos
11.
Am J Hum Genet ; 98(2): 229-42, 2016 Feb 04.
Artículo en Inglés | MEDLINE | ID: mdl-26805783

RESUMEN

Platelets play an essential role in hemostasis and thrombosis. We performed a genome-wide association study of platelet count in 12,491 participants of the Hispanic Community Health Study/Study of Latinos by using a mixed-model method that accounts for admixture and family relationships. We discovered and replicated associations with five genes (ACTN1, ETV7, GABBR1-MOG, MEF2C, and ZBTB9-BAK1). Our strongest association was with Amerindian-specific variant rs117672662 (p value = 1.16 × 10(-28)) in ACTN1, a gene implicated in congenital macrothrombocytopenia. rs117672662 exhibited allelic differences in transcriptional activity and protein binding in hematopoietic cells. Our results underscore the value of diverse populations to extend insights into the allelic architecture of complex traits.


Asunto(s)
Estudios de Asociación Genética/métodos , Sitios Genéticos , Hispánicos o Latinos/genética , Recuento de Plaquetas , Actinina/genética , Adolescente , Adulto , Anciano , Alelos , Frecuencia de los Genes , Genotipo , Técnicas de Genotipaje , Humanos , Factores de Transcripción MEF2/genética , Proteínas de la Membrana/genética , Persona de Mediana Edad , Fenotipo , Polimorfismo de Nucleótido Simple , Receptores de GABA-B/genética , Adulto Joven
13.
Am J Respir Crit Care Med ; 198(2): 208-219, 2018 07 15.
Artículo en Inglés | MEDLINE | ID: mdl-29394082

RESUMEN

RATIONALE: Lung function and chronic obstructive pulmonary disease (COPD) are heritable traits. Genome-wide association studies (GWAS) have identified numerous pulmonary function and COPD loci, primarily in cohorts of European ancestry. OBJECTIVES: Perform a GWAS of COPD phenotypes in Hispanic/Latino populations to identify loci not previously detected in European populations. METHODS: GWAS of lung function and COPD in Hispanic/Latino participants from a population-based cohort. We performed replication studies of novel loci in independent studies. MEASUREMENTS AND MAIN RESULTS: Among 11,822 Hispanic/Latino participants, we identified eight novel signals; three replicated in independent populations of European Ancestry. A novel locus for FEV1 in ZSWIM7 (rs4791658; P = 4.99 × 10-9) replicated. A rare variant (minor allele frequency = 0.002) in HAL (rs145174011) was associated with FEV1/FVC (P = 9.59 × 10-9) in a region previously identified for COPD-related phenotypes; it remained significant in conditional analyses but did not replicate. Admixture mapping identified a novel region, with a variant in AGMO (rs41331850), associated with Amerindian ancestry and FEV1, which replicated. A novel locus for FEV1 identified among ever smokers (rs291231; P = 1.92 × 10-8) approached statistical significance for replication in admixed populations of African ancestry, and a novel SNP for COPD in PDZD2 (rs7709630; P = 1.56 × 10-8) regionally replicated. In addition, loci previously identified for lung function in European samples were associated in Hispanic/Latino participants in the Hispanic Community Health Study/Study of Latinos at the genome-wide significance level. CONCLUSIONS: We identified novel signals for lung function and COPD in a Hispanic/Latino cohort. Including admixed populations when performing genetic studies may identify variants contributing to genetic etiologies of COPD.


Asunto(s)
Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Hispánicos o Latinos/genética , Enfermedad Pulmonar Obstructiva Crónica/genética , Población Blanca/genética , Adolescente , Adulto , Anciano , Estudios de Cohortes , Europa (Continente) , Femenino , Frecuencia de los Genes , Sitios Genéticos , Humanos , Masculino , Persona de Mediana Edad , Pruebas de Función Respiratoria , Estados Unidos , Adulto Joven
14.
Am J Respir Cell Mol Biol ; 58(3): 391-401, 2018 03.
Artículo en Inglés | MEDLINE | ID: mdl-29077507

RESUMEN

Obstructive sleep apnea (OSA) is a common heritable disorder displaying marked sexual dimorphism in disease prevalence and progression. Previous genetic association studies have identified a few genetic loci associated with OSA and related quantitative traits, but they have only focused on single ethnic groups, and a large proportion of the heritability remains unexplained. The apnea-hypopnea index (AHI) is a commonly used quantitative measure characterizing OSA severity. Because OSA differs by sex, and the pathophysiology of obstructive events differ in rapid eye movement (REM) and non-REM (NREM) sleep, we hypothesized that additional genetic association signals would be identified by analyzing the NREM/REM-specific AHI and by conducting sex-specific analyses in multiethnic samples. We performed genome-wide association tests for up to 19,733 participants of African, Asian, European, and Hispanic/Latino American ancestry in 7 studies. We identified rs12936587 on chromosome 17 as a possible quantitative trait locus for NREM AHI in men (N = 6,737; P = 1.7 × 10-8) but not in women (P = 0.77). The association with NREM AHI was replicated in a physiological research study (N = 67; P = 0.047). This locus overlapping the RAI1 gene and encompassing genes PEMT1, SREBF1, and RASD1 was previously reported to be associated with coronary artery disease, lipid metabolism, and implicated in Potocki-Lupski syndrome and Smith-Magenis syndrome, which are characterized by abnormal sleep phenotypes. We also identified gene-by-sex interactions in suggestive association regions, suggesting that genetic variants for AHI appear to vary by sex, consistent with the clinical observations of strong sexual dimorphism.


Asunto(s)
Estudio de Asociación del Genoma Completo , Sitios de Carácter Cuantitativo/genética , Apnea Obstructiva del Sueño/genética , Sueño REM/fisiología , Factores de Transcripción/genética , Adulto , Anciano , Femenino , Humanos , Masculino , Persona de Mediana Edad , Fosfatidiletanolamina N-Metiltransferasa/genética , Caracteres Sexuales , Proteína 1 de Unión a los Elementos Reguladores de Esteroles/genética , Transactivadores , Proteínas ras/genética
15.
Carcinogenesis ; 39(9): 1135-1140, 2018 09 21.
Artículo en Inglés | MEDLINE | ID: mdl-29924316

RESUMEN

To identify genetic variation associated with lung cancer risk, we performed a genome-wide association analysis of 685 lung cancer cases that had a family history of two or more first or second degree relatives compared with 744 controls without lung cancer that were genotyped on an Illumina Human OmniExpressExome-8v1 array. To ensure robust results, we further evaluated these findings using data from six additional studies that were assembled through the Transdisciplinary Research on Cancer of the Lung Consortium comprising 1993 familial cases and 33 690 controls. We performed a meta-analysis after imputation of all variants using the 1000 Genomes Project Phase 1 (version 3 release date September 2013). Analyses were conducted for 9 327 222 SNPs integrating data from the two sources. A novel variant on chromosome 4p15.31 near the LCORL gene and an imputed rare variant intergenic between CDKN2A and IFNA8 on chromosome 9p21.3 were identified at a genome-wide level of significance for squamous cell carcinomas. Additionally, associations of CHRNA3 and CHRNA5 on chromosome 15q25.1 in sporadic lung cancer were confirmed at a genome-wide level of significance in familial lung cancer. Previously identified variants in or near CHRNA2, BRCA2, CYP2A6 for overall lung cancer, TERT, SECISPB2L and RTEL1 for adenocarcinoma and RAD52 and MHC for squamous carcinoma were significantly associated with lung cancer.


Asunto(s)
Adenocarcinoma/genética , Carcinoma de Células Escamosas/genética , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Neoplasias Pulmonares/epidemiología , Neoplasias Pulmonares/genética , Estudios de Casos y Controles , Cromosomas Humanos Par 15/genética , Cromosomas Humanos Par 4 , Cromosomas Humanos Par 9/genética , Humanos , Pulmón/patología , Anamnesis , Polimorfismo de Nucleótido Simple/genética
16.
Hum Mol Genet ; 25(15): 3245-3254, 2016 08 01.
Artículo en Inglés | MEDLINE | ID: mdl-27346520

RESUMEN

Imputation is commonly used in genome-wide association studies to expand the set of genetic variants available for analysis. Larger and more diverse reference panels, such as the final Phase 3 of the 1000 Genomes Project, hold promise for improving imputation accuracy in genetically diverse populations such as Hispanics/Latinos in the USA. Here, we sought to empirically evaluate imputation accuracy when imputing to a 1000 Genomes Phase 3 versus a Phase 1 reference, using participants from the Hispanic Community Health Study/Study of Latinos. Our assessments included calculating the correlation between imputed and observed allelic dosage in a subset of samples genotyped on a supplemental array. We observed that the Phase 3 reference yielded higher accuracy at rare variants, but that the two reference panels were comparable at common variants. At a sample level, the Phase 3 reference improved imputation accuracy in Hispanic/Latino samples from the Caribbean more than for Mainland samples, which we attribute primarily to the additional reference panel samples available in Phase 3. We conclude that a 1000 Genomes Project Phase 3 reference panel can yield improved imputation accuracy compared with Phase 1, particularly for rare variants and for samples of certain genetic ancestry compositions. Our findings can inform imputation design for other genome-wide association studies of participants with diverse ancestries, especially as larger and more diverse reference panels continue to become available.


Asunto(s)
Estudio de Asociación del Genoma Completo , Hispánicos o Latinos/genética , Proyecto Genoma Humano , Femenino , Humanos , Masculino , Estados Unidos
17.
Bioinformatics ; 33(15): 2251-2257, 2017 Aug 01.
Artículo en Inglés | MEDLINE | ID: mdl-28334390

RESUMEN

MOTIVATION: Whole-genome sequencing (WGS) data are being generated at an unprecedented rate. Analysis of WGS data requires a flexible data format to store the different types of DNA variation. Variant call format (VCF) is a general text-based format developed to store variant genotypes and their annotations. However, VCF files are large and data retrieval is relatively slow. Here we introduce a new WGS variant data format implemented in the R/Bioconductor package 'SeqArray' for storing variant calls in an array-oriented manner which provides the same capabilities as VCF, but with multiple high compression options and data access using high-performance parallel computing. RESULTS: Benchmarks using 1000 Genomes Phase 3 data show file sizes are 14.0 Gb (VCF), 12.3 Gb (BCF, binary VCF), 3.5 Gb (BGT) and 2.6 Gb (SeqArray) respectively. Reading genotypes in the SeqArray package are two to three times faster compared with the htslib C library using BCF files. For the allele frequency calculation, the implementation in the SeqArray package is over 5 times faster than PLINK v1.9 with VCF and BCF files, and over 16 times faster than vcftools. When used in conjunction with R/Bioconductor packages, the SeqArray package provides users a flexible, feature-rich, high-performance programming environment for analysis of WGS variant data. AVAILABILITY AND IMPLEMENTATION: http://www.bioconductor.org/packages/SeqArray. CONTACT: zhengx@u.washington.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Compresión de Datos/métodos , Variación Genética , Programas Informáticos , Secuenciación Completa del Genoma/métodos , Genoma Humano , Genómica/métodos , Humanos
18.
Nicotine Tob Res ; 20(4): 448-457, 2018 03 06.
Artículo en Inglés | MEDLINE | ID: mdl-28520984

RESUMEN

Introduction: Genetic variants associated with nicotine dependence have previously been identified, primarily in European-ancestry populations. No genome-wide association studies (GWAS) have been reported for smoking behaviors in Hispanics/Latinos in the United States and Latin America, who are of mixed ancestry with European, African, and American Indigenous components. Methods: We examined genetic associations with smoking behaviors in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL) (N = 12 741 with smoking data, 5119 ever-smokers), using ~2.3 million genotyped variants imputed to the 1000 Genomes Project phase 3. Mixed logistic regression models accounted for population structure, sampling, relatedness, sex, and age. Results: The known region of CHRNA5, which encodes the α5 cholinergic nicotinic receptor subunit, was associated with heavy smoking at genome-wide significance (p ≤ 5 × 10-8) in a comparison of 1929 ever-smokers reporting cigarettes per day (CPD) > 10 versus 3156 reporting CPD ≤ 10. The functional variant rs16969968 in CHRNA5 had a p value of 2.20 × 10-7 and odds ratio (OR) of 1.32 for the minor allele (A); its minor allele frequency was 0.22 overall and similar across Hispanic/Latino background groups (Central American = 0.17; South American = 0.19; Mexican = 0.18; Puerto Rican = 0.22; Cuban = 0.29; Dominican = 0.19). CHRNA4 on chromosome 20 attained p < 10-4, supporting prior findings in non-Hispanics. For nondaily smoking, which is prevalent in Hispanic/Latino smokers, compared to daily smoking, loci on chromosomes 2 and 4 achieved genome-wide significance; replication attempts were limited by small Hispanic/Latino sample sizes. Conclusions: Associations of nicotinic receptor gene variants with smoking, first reported in non-Hispanic European-ancestry populations, generalized to Hispanics/Latinos despite different patterns of smoking behavior. Implications: We conducted the first large-scale genome-wide association study (GWAS) of smoking behavior in a US Hispanic/Latino cohort, and the first GWAS of daily/nondaily smoking in any population. Results show that the region of the nicotinic receptor subunit gene CHRNA5, which in non-Hispanic European-ancestry smokers has been associated with heavy smoking as well as cessation and treatment efficacy, is also significantly associated with heavy smoking in this Hispanic/Latino cohort. The results are an important addition to understanding the impact of genetic variants in understudied Hispanic/Latino smokers.


Asunto(s)
Estudio de Asociación del Genoma Completo/métodos , Hispánicos o Latinos/genética , Proteínas del Tejido Nervioso/genética , Salud Pública/métodos , Receptores Nicotínicos/genética , Fumar/epidemiología , Fumar/genética , Adulto , Femenino , Frecuencia de los Genes , Genotipo , Humanos , Masculino , Persona de Mediana Edad , Estados Unidos/epidemiología
19.
J Am Soc Nephrol ; 28(7): 2211-2220, 2017 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-28137830

RESUMEN

Increased urine albumin excretion is highly prevalent in Hispanics/Latinos. Previous studies have found an association between urine albumin excretion and Amerindian ancestry in Hispanic/Latino populations. Admixture between racial/ethnic groups creates long-range linkage disequilibrium between variants with different allelic frequencies in the founding populations and it can be used to localize genes. Hispanic/Latino genomes are an admixture of European, African, and Amerindian ancestries. We leveraged this admixture to identify associations between urine albumin excretion (urine albumin-to-creatinine ratio [UACR]) and genomic regions harboring variants with highly differentiated allele frequencies among the ancestral populations. Admixture mapping analysis of 12,212 Hispanic Community Health Study/Study of Latinos participants, using a linear mixed model, identified three novel genome-wide significant signals on chromosomes 2, 11, and 16. The admixture mapping signal identified on chromosome 2, spanning q11.2-14.1 and not previously reported for UACR, is driven by a difference between Amerindian ancestry and the other two ancestries (P<5.7 × 10-5). Within this locus, two common variants located at the proapoptotic BCL2L11 gene associated with UACR: rs116907128 (allele frequency =0.14; P=1.5 × 10-7) and rs586283 (C allele frequency =0.35; P=4.2 × 10-7). In a secondary analysis, rs116907128 accounted for most of the admixture mapping signal observed in the region. The rs116907128 variant is common among full-heritage Pima Indians (A allele frequency =0.54) but is monomorphic in the 1000 Genomes European and African populations. In a replication analysis using a sample of full-heritage Pima Indians, rs116907128 significantly associated with UACR (P=0.01; n=1568). Our findings provide evidence for the presence of Amerindian-specific variants influencing the variation of urine albumin excretion in Hispanics/Latinos.


Asunto(s)
Albuminuria/genética , Mapeo Cromosómico , Grupos Raciales/genética , Población Negra/genética , Femenino , Frecuencia de los Genes , Hispánicos o Latinos/genética , Humanos , Indígenas Norteamericanos/genética , Masculino , Persona de Mediana Edad , Estados Unidos , Población Blanca
20.
J Am Soc Nephrol ; 28(3): 915-922, 2017 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-27650483

RESUMEN

African ancestry alleles may contribute to CKD among Hispanics/Latinos, but whether associations differ by Hispanic/Latino background remains unknown. We examined the association of CKD measures with African ancestry-specific APOL1 alleles that were directly genotyped and sickle cell trait (hemoglobin subunit ß gene [HBB] variant) on the basis of imputation in 12,226 adult Hispanics/Latinos grouped according to Caribbean or Mainland background. We also performed an unbiased genome-wide association scan of urine albumin-to-creatinine ratios. Overall, 41.4% of participants were male, 44.6% of participants had a Caribbean background, and the mean age of all participants was 46.1 years. The Caribbean background group, compared with the Mainland background group, had a higher frequency of two APOL1 alleles (1.0% versus 0.1%) and the HBB variant (2.0% versus 0.7%). In the Caribbean background group, presence of APOL1 alleles (2 versus 0/1 copies) or the HBB variant (1 versus 0 copies) were significantly associated with albuminuria (odds ratio [OR], 3.2; 95% confidence interval [95% CI], 1.7 to 6.1; and OR, 2.6; 95% CI, 1.8 to 3.8, respectively) and albuminuria and/or eGFR<60 ml/min per 1.73 m2 (OR, 2.9; 95% CI, 1.5 to 5.4; and OR, 2.4; 95% CI, 1.7 to 3.5, respectively). The urine albumin-to-creatinine ratio genome-wide association scan identified associations with the HBB variant among all participants, with the strongest association in the Caribbean background group (P=3.1×10-10 versus P=9.3×10-3 for the Mainland background group). In conclusion, African-specific alleles associate with CKD in Hispanics/Latinos, but allele frequency varies by Hispanic/Latino background/ancestry.


Asunto(s)
Alelos , Población Negra/genética , Hispánicos o Latinos/genética , Insuficiencia Renal Crónica/epidemiología , Insuficiencia Renal Crónica/genética , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Persona de Mediana Edad , Factores de Riesgo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA