Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 44
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Hum Mol Genet ; 2024 May 15.
Artigo em Inglês | MEDLINE | ID: mdl-38747556

RESUMO

Inflammation biomarkers can provide valuable insight into the role of inflammatory processes in many diseases and conditions. Sequencing based analyses of such biomarkers can also serve as an exemplar of the genetic architecture of quantitative traits. To evaluate the biological insight, which can be provided by a multi-ancestry, whole-genome based association study, we performed a comprehensive analysis of 21 inflammation biomarkers from up to 38 465 individuals with whole-genome sequencing from the Trans-Omics for Precision Medicine (TOPMed) program (with varying sample size by trait, where the minimum sample size was n = 737 for MMP-1). We identified 22 distinct single-variant associations across 6 traits-E-selectin, intercellular adhesion molecule 1, interleukin-6, lipoprotein-associated phospholipase A2 activity and mass, and P-selectin-that remained significant after conditioning on previously identified associations for these inflammatory biomarkers. We further expanded upon known biomarker associations by pairing the single-variant analysis with a rare variant set-based analysis that further identified 19 significant rare variant set-based associations with 5 traits. These signals were distinct from both significant single variant association signals within TOPMed and genetic signals observed in prior studies, demonstrating the complementary value of performing both single and rare variant analyses when analyzing quantitative traits. We also confirm several previously reported signals from semi-quantitative proteomics platforms. Many of these signals demonstrate the extensive allelic heterogeneity and ancestry-differentiated variant-trait associations common for inflammation biomarkers, a characteristic we hypothesize will be increasingly observed with well-powered, large-scale analyses of complex traits.

2.
Am J Hum Genet ; 109(10): 1894-1908, 2022 10 06.
Artigo em Inglês | MEDLINE | ID: mdl-36206743

RESUMO

Individuals with cystic fibrosis (CF) develop complications of the gastrointestinal tract influenced by genetic variants outside of CFTR. Cystic fibrosis-related diabetes (CFRD) is a distinct form of diabetes with a variable age of onset that occurs frequently in individuals with CF, while meconium ileus (MI) is a severe neonatal intestinal obstruction affecting ∼20% of newborns with CF. CFRD and MI are slightly correlated traits with previous evidence of overlap in their genetic architectures. To better understand the genetic commonality between CFRD and MI, we used whole-genome-sequencing data from the CF Genome Project to perform genome-wide association. These analyses revealed variants at 11 loci (6 not previously identified) that associated with MI and at 12 loci (5 not previously identified) that associated with CFRD. Of these, variants at SLC26A9, CEBPB, and PRSS1 associated with both traits; variants at SLC26A9 and CEBPB increased risk for both traits, while variants at PRSS1, the higher-risk alleles for CFRD, conferred lower risk for MI. Furthermore, common and rare variants within the SLC26A9 locus associated with MI only or CFRD only. As expected, different loci modify risk of CFRD and MI; however, a subset exhibit pleiotropic effects indicating etiologic and mechanistic overlap between these two otherwise distinct complications of CF.


Assuntos
Fibrose Cística , Diabetes Mellitus , Doenças do Recém-Nascido , Obstrução Intestinal , Fibrose Cística/complicações , Fibrose Cística/genética , Regulador de Condutância Transmembrana em Fibrose Cística/genética , Diabetes Mellitus/genética , Estudo de Associação Genômica Ampla , Humanos , Recém-Nascido , Obstrução Intestinal/complicações , Obstrução Intestinal/genética
3.
Am J Hum Genet ; 109(6): 1175-1181, 2022 06 02.
Artigo em Inglês | MEDLINE | ID: mdl-35504290

RESUMO

Current publicly available tools that allow rapid exploration of linkage disequilibrium (LD) between markers (e.g., HaploReg and LDlink) are based on whole-genome sequence (WGS) data from 2,504 individuals in the 1000 Genomes Project. Here, we present TOP-LD, an online tool to explore LD inferred with high-coverage (∼30×) WGS data from 15,578 individuals in the NHLBI Trans-Omics for Precision Medicine (TOPMed) program. TOP-LD provides a significant upgrade compared to current LD tools, as the TOPMed WGS data provide a more comprehensive representation of genetic variation than the 1000 Genomes data, particularly for rare variants and in the specific populations that we analyzed. For example, TOP-LD encompasses LD information for 150.3, 62.2, and 36.7 million variants for European, African, and East Asian ancestral samples, respectively, offering 2.6- to 9.1-fold increase in variant coverage compared to HaploReg 4.0 or LDlink. In addition, TOP-LD includes tens of thousands of structural variants (SVs). We demonstrate the value of TOP-LD in fine-mapping at the GGT1 locus associated with gamma glutamyltransferase in the African ancestry participants in UK Biobank. Beyond fine-mapping, TOP-LD can facilitate a wide range of applications that are based on summary statistics and estimates of LD. TOP-LD is freely available online.


Assuntos
Estudo de Associação Genômica Ampla , Medicina de Precisão , Povo Asiático , Humanos , Desequilíbrio de Ligação/genética , Polimorfismo de Nucleotídeo Único/genética , Sequenciamento Completo do Genoma
4.
Am J Hum Genet ; 108(5): 874-893, 2021 05 06.
Artigo em Inglês | MEDLINE | ID: mdl-33887194

RESUMO

Whole-genome sequencing (WGS), a powerful tool for detecting novel coding and non-coding disease-causing variants, has largely been applied to clinical diagnosis of inherited disorders. Here we leveraged WGS data in up to 62,653 ethnically diverse participants from the NHLBI Trans-Omics for Precision Medicine (TOPMed) program and assessed statistical association of variants with seven red blood cell (RBC) quantitative traits. We discovered 14 single variant-RBC trait associations at 12 genomic loci, which have not been reported previously. Several of the RBC trait-variant associations (RPN1, ELL2, MIDN, HBB, HBA1, PIEZO1, and G6PD) were replicated in independent GWAS datasets imputed to the TOPMed reference panel. Most of these discovered variants are rare/low frequency, and several are observed disproportionately among non-European Ancestry (African, Hispanic/Latino, or East Asian) populations. We identified a 3 bp indel p.Lys2169del (g.88717175_88717177TCT[4]) (common only in the Ashkenazi Jewish population) of PIEZO1, a gene responsible for the Mendelian red cell disorder hereditary xerocytosis (MIM: 194380), associated with higher mean corpuscular hemoglobin concentration (MCHC). In stepwise conditional analysis and in gene-based rare variant aggregated association analysis, we identified several of the variants in HBB, HBA1, TMPRSS6, and G6PD that represent the carrier state for known coding, promoter, or splice site loss-of-function variants that cause inherited RBC disorders. Finally, we applied base and nuclease editing to demonstrate that the sentinel variant rs112097551 (nearest gene RPN1) acts through a cis-regulatory element that exerts long-range control of the gene RUVBL1 which is essential for hematopoiesis. Together, these results demonstrate the utility of WGS in ethnically diverse population-based samples and gene editing for expanding knowledge of the genetic architecture of quantitative hematologic traits and suggest a continuum between complex trait and Mendelian red cell disorders.


Assuntos
Eritrócitos/metabolismo , Eritrócitos/patologia , Estudo de Associação Genômica Ampla , National Heart, Lung, and Blood Institute (U.S.)/organização & administração , Fenótipo , Adulto , Idoso , Cromossomos Humanos Par 16/genética , Conjuntos de Dados como Assunto , Feminino , Edição de Genes , Variação Genética/genética , Células HEK293 , Humanos , Masculino , Pessoa de Meia-Idade , Controle de Qualidade , Reprodutibilidade dos Testes , Estados Unidos
5.
PLoS Genet ; 15(4): e1007739, 2019 04.
Artigo em Inglês | MEDLINE | ID: mdl-30990817

RESUMO

Sleep disordered breathing (SDB)-related overnight hypoxemia is associated with cardiometabolic disease and other comorbidities. Understanding the genetic bases for variations in nocturnal hypoxemia may help understand mechanisms influencing oxygenation and SDB-related mortality. We conducted genome-wide association tests across 10 cohorts and 4 populations to identify genetic variants associated with three correlated measures of overnight oxyhemoglobin saturation: average and minimum oxyhemoglobin saturation during sleep and the percent of sleep with oxyhemoglobin saturation under 90%. The discovery sample consisted of 8,326 individuals. Variants with p < 1 × 10(-6) were analyzed in a replication group of 14,410 individuals. We identified 3 significantly associated regions, including 2 regions in multi-ethnic analyses (2q12, 10q22). SNPs in the 2q12 region associated with minimum SpO2 (rs78136548 p = 2.70 × 10(-10)). SNPs at 10q22 were associated with all three traits including average SpO2 (rs72805692 p = 4.58 × 10(-8)). SNPs in both regions were associated in over 20,000 individuals and are supported by prior associations or functional evidence. Four additional significant regions were detected in secondary sex-stratified and combined discovery and replication analyses, including a region overlapping Reelin, a known marker of respiratory complex neurons.These are the first genome-wide significant findings reported for oxyhemoglobin saturation during sleep, a phenotype of high clinical interest. Our replicated associations with HK1 and IL18R1 suggest that variants in inflammatory pathways, such as the biologically-plausible NLRP3 inflammasome, may contribute to nocturnal hypoxemia.


Assuntos
Hexoquinase/genética , Subunidade alfa de Receptor de Interleucina-18/genética , Oxiemoglobinas/metabolismo , Sono/genética , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Moléculas de Adesão Celular Neuronais/genética , Biologia Computacional , Proteínas da Matriz Extracelular/genética , Feminino , Redes Reguladoras de Genes , Variação Genética , Estudo de Associação Genômica Ampla , Humanos , Hipóxia/sangue , Hipóxia/genética , Masculino , Pessoa de Meia-Idade , Proteína 3 que Contém Domínio de Pirina da Família NLR/genética , Proteínas do Tecido Nervoso/genética , Oxigênio/sangue , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Proteína Reelina , Serina Endopeptidases/genética , Síndromes da Apneia do Sono/sangue , Síndromes da Apneia do Sono/genética , Adulto Jovem
6.
Am J Epidemiol ; 190(10): 1977-1992, 2021 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-33861317

RESUMO

Genotype-phenotype association studies often combine phenotype data from multiple studies to increase statistical power. Harmonization of the data usually requires substantial effort due to heterogeneity in phenotype definitions, study design, data collection procedures, and data-set organization. Here we describe a centralized system for phenotype harmonization that includes input from phenotype domain and study experts, quality control, documentation, reproducible results, and data-sharing mechanisms. This system was developed for the National Heart, Lung, and Blood Institute's Trans-Omics for Precision Medicine (TOPMed) program, which is generating genomic and other -omics data for more than 80 studies with extensive phenotype data. To date, 63 phenotypes have been harmonized across thousands of participants (recruited in 1948-2012) from up to 17 studies per phenotype. Here we discuss challenges in this undertaking and how they were addressed. The harmonized phenotype data and associated documentation have been submitted to National Institutes of Health data repositories for controlled access by the scientific community. We also provide materials to facilitate future harmonization efforts by the community, which include 1) the software code used to generate the 63 harmonized phenotypes, enabling others to reproduce, modify, or extend these harmonizations to additional studies, and 2) the results of labeling thousands of phenotype variables with controlled vocabulary terms.


Assuntos
Estudos de Associação Genética/métodos , Fenômica/métodos , Medicina de Precisão/métodos , Agregação de Dados , Humanos , Disseminação de Informação , National Heart, Lung, and Blood Institute (U.S.) , Fenótipo , Avaliação de Programas e Projetos de Saúde , Estados Unidos
7.
Hum Mol Genet ; 28(4): 675-687, 2019 02 15.
Artigo em Inglês | MEDLINE | ID: mdl-30403821

RESUMO

Obstructive sleep apnea (OSA) is a common disorder associated with increased risk of cardiovascular disease and mortality. Its prevalence and severity vary across ancestral background. Although OSA traits are heritable, few genetic associations have been identified. To identify genetic regions associated with OSA and improve statistical power, we applied admixture mapping on three primary OSA traits [the apnea hypopnea index (AHI), overnight average oxyhemoglobin saturation (SaO2) and percentage time SaO2 < 90%] and a secondary trait (respiratory event duration) in a Hispanic/Latino American population study of 11 575 individuals with significant variation in ancestral background. Linear mixed models were performed using previously inferred African, European and Amerindian local genetic ancestry markers. Global African ancestry was associated with a lower AHI, higher SaO2 and shorter event duration. Admixture mapping analysis of the primary OSA traits identified local African ancestry at the chromosomal region 2q37 as genome-wide significantly associated with AHI (P < 5.7 × 10-5), and European and Amerindian ancestries at 18q21 suggestively associated with both AHI and percentage time SaO2 < 90% (P < 10-3). Follow-up joint ancestry-SNP association analyses identified novel variants in ferrochelatase (FECH), significantly associated with AHI and percentage time SaO2 < 90% after adjusting for multiple tests (P < 8 × 10-6). These signals contributed to the admixture mapping associations and were replicated in independent cohorts. In this first admixture mapping study of OSA, novel associations with variants in the iron/heme metabolism pathway suggest a role for iron in influencing respiratory traits underlying OSA.


Assuntos
Ferroquelatase/genética , Estudo de Associação Genômica Ampla , Apneia Obstrutiva do Sono/genética , Idoso , Mapeamento Cromossômico , Feminino , Genótipo , Hispânico ou Latino/genética , Humanos , Masculino , Pessoa de Meia-Idade , Polimorfismo de Nucleotídeo Único/genética , Polissonografia , Apneia Obstrutiva do Sono/diagnóstico por imagem , Apneia Obstrutiva do Sono/fisiopatologia , População Branca/genética
8.
Am J Hum Genet ; 98(4): 653-66, 2016 Apr 07.
Artigo em Inglês | MEDLINE | ID: mdl-27018471

RESUMO

Linear mixed models (LMMs) are widely used in genome-wide association studies (GWASs) to account for population structure and relatedness, for both continuous and binary traits. Motivated by the failure of LMMs to control type I errors in a GWAS of asthma, a binary trait, we show that LMMs are generally inappropriate for analyzing binary traits when population stratification leads to violation of the LMM's constant-residual variance assumption. To overcome this problem, we develop a computationally efficient logistic mixed model approach for genome-wide analysis of binary traits, the generalized linear mixed model association test (GMMAT). This approach fits a logistic mixed model once per GWAS and performs score tests under the null hypothesis of no association between a binary trait and individual genetic variants. We show in simulation studies and real data analysis that GMMAT effectively controls for population structure and relatedness when analyzing binary traits in a wide variety of study designs.


Assuntos
Estudos de Associação Genética/métodos , Genética Populacional/métodos , Modelos Lineares , Fenótipo , Asma/genética , Estudos de Casos e Controles , América Central , Simulação por Computador , Técnicas de Genotipagem , Humanos , Modelos Logísticos , Modelos Genéticos , Filogeografia , Polimorfismo de Nucleotídeo Único , América do Sul
9.
Am J Hum Genet ; 99(3): 636-646, 2016 Sep 01.
Artigo em Inglês | MEDLINE | ID: mdl-27588450

RESUMO

We analyzed genome-wide association studies (GWASs), including data from 71,638 individuals from four ancestries, for estimated glomerular filtration rate (eGFR), a measure of kidney function used to define chronic kidney disease (CKD). We identified 20 loci attaining genome-wide-significant evidence of association (p < 5 × 10(-8)) with kidney function and highlighted that allelic effects on eGFR at lead SNPs are homogeneous across ancestries. We leveraged differences in the pattern of linkage disequilibrium between diverse populations to fine-map the 20 loci through construction of "credible sets" of variants driving eGFR association signals. Credible variants at the 20 eGFR loci were enriched for DNase I hypersensitivity sites (DHSs) in human kidney cells. DHS credible variants were expression quantitative trait loci for NFATC1 and RGS14 (at the SLC34A1 locus) in multiple tissues. Loss-of-function mutations in ancestral orthologs of both genes in Drosophila melanogaster were associated with altered sensitivity to salt stress. Renal mRNA expression of Nfatc1 and Rgs14 in a salt-sensitive mouse model was also reduced after exposure to a high-salt diet or induced CKD. Our study (1) demonstrates the utility of trans-ethnic fine mapping through integration of GWASs involving diverse populations with genomic annotation from relevant tissues to define molecular mechanisms by which association signals exert their effect and (2) suggests that salt sensitivity might be an important marker for biological processes that affect kidney function and CKD in humans.


Assuntos
Etnicidade/genética , Estudo de Associação Genômica Ampla , Rim/fisiopatologia , Insuficiência Renal Crônica/genética , Insuficiência Renal Crônica/fisiopatologia , Cloreto de Sódio/farmacologia , Estresse Fisiológico/efeitos dos fármacos , Estresse Fisiológico/genética , Alelos , Animais , Desoxirribonuclease I/metabolismo , Diabetes Mellitus/genética , Modelos Animais de Doenças , Drosophila melanogaster/genética , Feminino , Taxa de Filtração Glomerular/genética , Humanos , Rim/patologia , Desequilíbrio de Ligação , Masculino , Fatores de Transcrição NFATC/genética , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas , Proteínas RGS/genética , Grupos Raciais/genética , Tolerância ao Sal/genética , Proteínas Cotransportadoras de Sódio-Fosfato Tipo IIa/genética
10.
Am J Hum Genet ; 98(1): 165-84, 2016 Jan 07.
Artigo em Inglês | MEDLINE | ID: mdl-26748518

RESUMO

US Hispanic/Latino individuals are diverse in genetic ancestry, culture, and environmental exposures. Here, we characterized and controlled for this diversity in genome-wide association studies (GWASs) for the Hispanic Community Health Study/Study of Latinos (HCHS/SOL). We simultaneously estimated population-structure principal components (PCs) robust to familial relatedness and pairwise kinship coefficients (KCs) robust to population structure, admixture, and Hardy-Weinberg departures. The PCs revealed substantial genetic differentiation within and among six self-identified background groups (Cuban, Dominican, Puerto Rican, Mexican, and Central and South American). To control for variation among groups, we developed a multi-dimensional clustering method to define a "genetic-analysis group" variable that retains many properties of self-identified background while achieving substantially greater genetic homogeneity within groups and including participants with non-specific self-identification. In GWASs of 22 biomedical traits, we used a linear mixed model (LMM) including pairwise empirical KCs to account for familial relatedness, PCs for ancestry, and genetic-analysis groups for additional group-associated effects. Including the genetic-analysis group as a covariate accounted for significant trait variation in 8 of 22 traits, even after we fit 20 PCs. Additionally, genetic-analysis groups had significant heterogeneity of residual variance for 20 of 22 traits, and modeling this heteroscedasticity within the LMM reduced genomic inflation for 19 traits. Furthermore, fitting an LMM that utilized a genetic-analysis group rather than a self-identified background group achieved higher power to detect previously reported associations. We expect that the methods applied here will be useful in other studies with multiple ethnic groups, admixture, and relatedness.


Assuntos
Variação Genética , Hispânico ou Latino/genética , Estudo de Associação Genômica Ampla , Humanos , Estados Unidos
11.
Am J Hum Genet ; 98(2): 229-42, 2016 Feb 04.
Artigo em Inglês | MEDLINE | ID: mdl-26805783

RESUMO

Platelets play an essential role in hemostasis and thrombosis. We performed a genome-wide association study of platelet count in 12,491 participants of the Hispanic Community Health Study/Study of Latinos by using a mixed-model method that accounts for admixture and family relationships. We discovered and replicated associations with five genes (ACTN1, ETV7, GABBR1-MOG, MEF2C, and ZBTB9-BAK1). Our strongest association was with Amerindian-specific variant rs117672662 (p value = 1.16 × 10(-28)) in ACTN1, a gene implicated in congenital macrothrombocytopenia. rs117672662 exhibited allelic differences in transcriptional activity and protein binding in hematopoietic cells. Our results underscore the value of diverse populations to extend insights into the allelic architecture of complex traits.


Assuntos
Estudos de Associação Genética/métodos , Loci Gênicos , Hispânico ou Latino/genética , Contagem de Plaquetas , Actinina/genética , Adolescente , Adulto , Idoso , Alelos , Frequência do Gene , Genótipo , Técnicas de Genotipagem , Humanos , Fatores de Transcrição MEF2/genética , Proteínas de Membrana/genética , Pessoa de Meia-Idade , Fenótipo , Polimorfismo de Nucleotídeo Único , Receptores de GABA-B/genética , Adulto Jovem
13.
Am J Respir Crit Care Med ; 198(2): 208-219, 2018 07 15.
Artigo em Inglês | MEDLINE | ID: mdl-29394082

RESUMO

RATIONALE: Lung function and chronic obstructive pulmonary disease (COPD) are heritable traits. Genome-wide association studies (GWAS) have identified numerous pulmonary function and COPD loci, primarily in cohorts of European ancestry. OBJECTIVES: Perform a GWAS of COPD phenotypes in Hispanic/Latino populations to identify loci not previously detected in European populations. METHODS: GWAS of lung function and COPD in Hispanic/Latino participants from a population-based cohort. We performed replication studies of novel loci in independent studies. MEASUREMENTS AND MAIN RESULTS: Among 11,822 Hispanic/Latino participants, we identified eight novel signals; three replicated in independent populations of European Ancestry. A novel locus for FEV1 in ZSWIM7 (rs4791658; P = 4.99 × 10-9) replicated. A rare variant (minor allele frequency = 0.002) in HAL (rs145174011) was associated with FEV1/FVC (P = 9.59 × 10-9) in a region previously identified for COPD-related phenotypes; it remained significant in conditional analyses but did not replicate. Admixture mapping identified a novel region, with a variant in AGMO (rs41331850), associated with Amerindian ancestry and FEV1, which replicated. A novel locus for FEV1 identified among ever smokers (rs291231; P = 1.92 × 10-8) approached statistical significance for replication in admixed populations of African ancestry, and a novel SNP for COPD in PDZD2 (rs7709630; P = 1.56 × 10-8) regionally replicated. In addition, loci previously identified for lung function in European samples were associated in Hispanic/Latino participants in the Hispanic Community Health Study/Study of Latinos at the genome-wide significance level. CONCLUSIONS: We identified novel signals for lung function and COPD in a Hispanic/Latino cohort. Including admixed populations when performing genetic studies may identify variants contributing to genetic etiologies of COPD.


Assuntos
Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Hispânico ou Latino/genética , Doença Pulmonar Obstrutiva Crônica/genética , População Branca/genética , Adolescente , Adulto , Idoso , Estudos de Coortes , Europa (Continente) , Feminino , Frequência do Gene , Loci Gênicos , Humanos , Masculino , Pessoa de Meia-Idade , Testes de Função Respiratória , Estados Unidos , Adulto Jovem
14.
Am J Respir Cell Mol Biol ; 58(3): 391-401, 2018 03.
Artigo em Inglês | MEDLINE | ID: mdl-29077507

RESUMO

Obstructive sleep apnea (OSA) is a common heritable disorder displaying marked sexual dimorphism in disease prevalence and progression. Previous genetic association studies have identified a few genetic loci associated with OSA and related quantitative traits, but they have only focused on single ethnic groups, and a large proportion of the heritability remains unexplained. The apnea-hypopnea index (AHI) is a commonly used quantitative measure characterizing OSA severity. Because OSA differs by sex, and the pathophysiology of obstructive events differ in rapid eye movement (REM) and non-REM (NREM) sleep, we hypothesized that additional genetic association signals would be identified by analyzing the NREM/REM-specific AHI and by conducting sex-specific analyses in multiethnic samples. We performed genome-wide association tests for up to 19,733 participants of African, Asian, European, and Hispanic/Latino American ancestry in 7 studies. We identified rs12936587 on chromosome 17 as a possible quantitative trait locus for NREM AHI in men (N = 6,737; P = 1.7 × 10-8) but not in women (P = 0.77). The association with NREM AHI was replicated in a physiological research study (N = 67; P = 0.047). This locus overlapping the RAI1 gene and encompassing genes PEMT1, SREBF1, and RASD1 was previously reported to be associated with coronary artery disease, lipid metabolism, and implicated in Potocki-Lupski syndrome and Smith-Magenis syndrome, which are characterized by abnormal sleep phenotypes. We also identified gene-by-sex interactions in suggestive association regions, suggesting that genetic variants for AHI appear to vary by sex, consistent with the clinical observations of strong sexual dimorphism.


Assuntos
Estudo de Associação Genômica Ampla , Locos de Características Quantitativas/genética , Apneia Obstrutiva do Sono/genética , Sono REM/fisiologia , Fatores de Transcrição/genética , Adulto , Idoso , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Fosfatidiletanolamina N-Metiltransferase/genética , Caracteres Sexuais , Proteína de Ligação a Elemento Regulador de Esterol 1/genética , Transativadores , Proteínas ras/genética
15.
Carcinogenesis ; 39(9): 1135-1140, 2018 09 21.
Artigo em Inglês | MEDLINE | ID: mdl-29924316

RESUMO

To identify genetic variation associated with lung cancer risk, we performed a genome-wide association analysis of 685 lung cancer cases that had a family history of two or more first or second degree relatives compared with 744 controls without lung cancer that were genotyped on an Illumina Human OmniExpressExome-8v1 array. To ensure robust results, we further evaluated these findings using data from six additional studies that were assembled through the Transdisciplinary Research on Cancer of the Lung Consortium comprising 1993 familial cases and 33 690 controls. We performed a meta-analysis after imputation of all variants using the 1000 Genomes Project Phase 1 (version 3 release date September 2013). Analyses were conducted for 9 327 222 SNPs integrating data from the two sources. A novel variant on chromosome 4p15.31 near the LCORL gene and an imputed rare variant intergenic between CDKN2A and IFNA8 on chromosome 9p21.3 were identified at a genome-wide level of significance for squamous cell carcinomas. Additionally, associations of CHRNA3 and CHRNA5 on chromosome 15q25.1 in sporadic lung cancer were confirmed at a genome-wide level of significance in familial lung cancer. Previously identified variants in or near CHRNA2, BRCA2, CYP2A6 for overall lung cancer, TERT, SECISPB2L and RTEL1 for adenocarcinoma and RAD52 and MHC for squamous carcinoma were significantly associated with lung cancer.


Assuntos
Adenocarcinoma/genética , Carcinoma de Células Escamosas/genética , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Neoplasias Pulmonares/epidemiologia , Neoplasias Pulmonares/genética , Estudos de Casos e Controles , Cromossomos Humanos Par 15/genética , Cromossomos Humanos Par 4 , Cromossomos Humanos Par 9/genética , Humanos , Pulmão/patologia , Anamnese , Polimorfismo de Nucleotídeo Único/genética
16.
Hum Mol Genet ; 25(15): 3245-3254, 2016 08 01.
Artigo em Inglês | MEDLINE | ID: mdl-27346520

RESUMO

Imputation is commonly used in genome-wide association studies to expand the set of genetic variants available for analysis. Larger and more diverse reference panels, such as the final Phase 3 of the 1000 Genomes Project, hold promise for improving imputation accuracy in genetically diverse populations such as Hispanics/Latinos in the USA. Here, we sought to empirically evaluate imputation accuracy when imputing to a 1000 Genomes Phase 3 versus a Phase 1 reference, using participants from the Hispanic Community Health Study/Study of Latinos. Our assessments included calculating the correlation between imputed and observed allelic dosage in a subset of samples genotyped on a supplemental array. We observed that the Phase 3 reference yielded higher accuracy at rare variants, but that the two reference panels were comparable at common variants. At a sample level, the Phase 3 reference improved imputation accuracy in Hispanic/Latino samples from the Caribbean more than for Mainland samples, which we attribute primarily to the additional reference panel samples available in Phase 3. We conclude that a 1000 Genomes Project Phase 3 reference panel can yield improved imputation accuracy compared with Phase 1, particularly for rare variants and for samples of certain genetic ancestry compositions. Our findings can inform imputation design for other genome-wide association studies of participants with diverse ancestries, especially as larger and more diverse reference panels continue to become available.


Assuntos
Estudo de Associação Genômica Ampla , Hispânico ou Latino/genética , Projeto Genoma Humano , Feminino , Humanos , Masculino , Estados Unidos
17.
Bioinformatics ; 33(15): 2251-2257, 2017 Aug 01.
Artigo em Inglês | MEDLINE | ID: mdl-28334390

RESUMO

MOTIVATION: Whole-genome sequencing (WGS) data are being generated at an unprecedented rate. Analysis of WGS data requires a flexible data format to store the different types of DNA variation. Variant call format (VCF) is a general text-based format developed to store variant genotypes and their annotations. However, VCF files are large and data retrieval is relatively slow. Here we introduce a new WGS variant data format implemented in the R/Bioconductor package 'SeqArray' for storing variant calls in an array-oriented manner which provides the same capabilities as VCF, but with multiple high compression options and data access using high-performance parallel computing. RESULTS: Benchmarks using 1000 Genomes Phase 3 data show file sizes are 14.0 Gb (VCF), 12.3 Gb (BCF, binary VCF), 3.5 Gb (BGT) and 2.6 Gb (SeqArray) respectively. Reading genotypes in the SeqArray package are two to three times faster compared with the htslib C library using BCF files. For the allele frequency calculation, the implementation in the SeqArray package is over 5 times faster than PLINK v1.9 with VCF and BCF files, and over 16 times faster than vcftools. When used in conjunction with R/Bioconductor packages, the SeqArray package provides users a flexible, feature-rich, high-performance programming environment for analysis of WGS variant data. AVAILABILITY AND IMPLEMENTATION: http://www.bioconductor.org/packages/SeqArray. CONTACT: zhengx@u.washington.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Compressão de Dados/métodos , Variação Genética , Software , Sequenciamento Completo do Genoma/métodos , Genoma Humano , Genômica/métodos , Humanos
18.
Nicotine Tob Res ; 20(4): 448-457, 2018 03 06.
Artigo em Inglês | MEDLINE | ID: mdl-28520984

RESUMO

Introduction: Genetic variants associated with nicotine dependence have previously been identified, primarily in European-ancestry populations. No genome-wide association studies (GWAS) have been reported for smoking behaviors in Hispanics/Latinos in the United States and Latin America, who are of mixed ancestry with European, African, and American Indigenous components. Methods: We examined genetic associations with smoking behaviors in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL) (N = 12 741 with smoking data, 5119 ever-smokers), using ~2.3 million genotyped variants imputed to the 1000 Genomes Project phase 3. Mixed logistic regression models accounted for population structure, sampling, relatedness, sex, and age. Results: The known region of CHRNA5, which encodes the α5 cholinergic nicotinic receptor subunit, was associated with heavy smoking at genome-wide significance (p ≤ 5 × 10-8) in a comparison of 1929 ever-smokers reporting cigarettes per day (CPD) > 10 versus 3156 reporting CPD ≤ 10. The functional variant rs16969968 in CHRNA5 had a p value of 2.20 × 10-7 and odds ratio (OR) of 1.32 for the minor allele (A); its minor allele frequency was 0.22 overall and similar across Hispanic/Latino background groups (Central American = 0.17; South American = 0.19; Mexican = 0.18; Puerto Rican = 0.22; Cuban = 0.29; Dominican = 0.19). CHRNA4 on chromosome 20 attained p < 10-4, supporting prior findings in non-Hispanics. For nondaily smoking, which is prevalent in Hispanic/Latino smokers, compared to daily smoking, loci on chromosomes 2 and 4 achieved genome-wide significance; replication attempts were limited by small Hispanic/Latino sample sizes. Conclusions: Associations of nicotinic receptor gene variants with smoking, first reported in non-Hispanic European-ancestry populations, generalized to Hispanics/Latinos despite different patterns of smoking behavior. Implications: We conducted the first large-scale genome-wide association study (GWAS) of smoking behavior in a US Hispanic/Latino cohort, and the first GWAS of daily/nondaily smoking in any population. Results show that the region of the nicotinic receptor subunit gene CHRNA5, which in non-Hispanic European-ancestry smokers has been associated with heavy smoking as well as cessation and treatment efficacy, is also significantly associated with heavy smoking in this Hispanic/Latino cohort. The results are an important addition to understanding the impact of genetic variants in understudied Hispanic/Latino smokers.


Assuntos
Estudo de Associação Genômica Ampla/métodos , Hispânico ou Latino/genética , Proteínas do Tecido Nervoso/genética , Saúde Pública/métodos , Receptores Nicotínicos/genética , Fumar/epidemiologia , Fumar/genética , Adulto , Feminino , Frequência do Gene , Genótipo , Humanos , Masculino , Pessoa de Meia-Idade , Estados Unidos/epidemiologia
19.
J Am Soc Nephrol ; 28(7): 2211-2220, 2017 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-28137830

RESUMO

Increased urine albumin excretion is highly prevalent in Hispanics/Latinos. Previous studies have found an association between urine albumin excretion and Amerindian ancestry in Hispanic/Latino populations. Admixture between racial/ethnic groups creates long-range linkage disequilibrium between variants with different allelic frequencies in the founding populations and it can be used to localize genes. Hispanic/Latino genomes are an admixture of European, African, and Amerindian ancestries. We leveraged this admixture to identify associations between urine albumin excretion (urine albumin-to-creatinine ratio [UACR]) and genomic regions harboring variants with highly differentiated allele frequencies among the ancestral populations. Admixture mapping analysis of 12,212 Hispanic Community Health Study/Study of Latinos participants, using a linear mixed model, identified three novel genome-wide significant signals on chromosomes 2, 11, and 16. The admixture mapping signal identified on chromosome 2, spanning q11.2-14.1 and not previously reported for UACR, is driven by a difference between Amerindian ancestry and the other two ancestries (P<5.7 × 10-5). Within this locus, two common variants located at the proapoptotic BCL2L11 gene associated with UACR: rs116907128 (allele frequency =0.14; P=1.5 × 10-7) and rs586283 (C allele frequency =0.35; P=4.2 × 10-7). In a secondary analysis, rs116907128 accounted for most of the admixture mapping signal observed in the region. The rs116907128 variant is common among full-heritage Pima Indians (A allele frequency =0.54) but is monomorphic in the 1000 Genomes European and African populations. In a replication analysis using a sample of full-heritage Pima Indians, rs116907128 significantly associated with UACR (P=0.01; n=1568). Our findings provide evidence for the presence of Amerindian-specific variants influencing the variation of urine albumin excretion in Hispanics/Latinos.


Assuntos
Albuminúria/genética , Mapeamento Cromossômico , Grupos Raciais/genética , População Negra/genética , Feminino , Frequência do Gene , Hispânico ou Latino/genética , Humanos , Indígenas Norte-Americanos/genética , Masculino , Pessoa de Meia-Idade , Estados Unidos , População Branca
20.
J Am Soc Nephrol ; 28(3): 915-922, 2017 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-27650483

RESUMO

African ancestry alleles may contribute to CKD among Hispanics/Latinos, but whether associations differ by Hispanic/Latino background remains unknown. We examined the association of CKD measures with African ancestry-specific APOL1 alleles that were directly genotyped and sickle cell trait (hemoglobin subunit ß gene [HBB] variant) on the basis of imputation in 12,226 adult Hispanics/Latinos grouped according to Caribbean or Mainland background. We also performed an unbiased genome-wide association scan of urine albumin-to-creatinine ratios. Overall, 41.4% of participants were male, 44.6% of participants had a Caribbean background, and the mean age of all participants was 46.1 years. The Caribbean background group, compared with the Mainland background group, had a higher frequency of two APOL1 alleles (1.0% versus 0.1%) and the HBB variant (2.0% versus 0.7%). In the Caribbean background group, presence of APOL1 alleles (2 versus 0/1 copies) or the HBB variant (1 versus 0 copies) were significantly associated with albuminuria (odds ratio [OR], 3.2; 95% confidence interval [95% CI], 1.7 to 6.1; and OR, 2.6; 95% CI, 1.8 to 3.8, respectively) and albuminuria and/or eGFR<60 ml/min per 1.73 m2 (OR, 2.9; 95% CI, 1.5 to 5.4; and OR, 2.4; 95% CI, 1.7 to 3.5, respectively). The urine albumin-to-creatinine ratio genome-wide association scan identified associations with the HBB variant among all participants, with the strongest association in the Caribbean background group (P=3.1×10-10 versus P=9.3×10-3 for the Mainland background group). In conclusion, African-specific alleles associate with CKD in Hispanics/Latinos, but allele frequency varies by Hispanic/Latino background/ancestry.


Assuntos
Alelos , População Negra/genética , Hispânico ou Latino/genética , Insuficiência Renal Crônica/epidemiologia , Insuficiência Renal Crônica/genética , Feminino , Estudo de Associação Genômica Ampla , Humanos , Masculino , Pessoa de Meia-Idade , Fatores de Risco
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA