Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 335
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Nature ; 616(7958): 755-763, 2023 04.
Artigo em Inglês | MEDLINE | ID: mdl-37046083

RESUMO

Mutations in a diverse set of driver genes increase the fitness of haematopoietic stem cells (HSCs), leading to clonal haematopoiesis1. These lesions are precursors for blood cancers2-6, but the basis of their fitness advantage remains largely unknown, partly owing to a paucity of large cohorts in which the clonal expansion rate has been assessed by longitudinal sampling. Here, to circumvent this limitation, we developed a method to infer the expansion rate from data from a single time point. We applied this method to 5,071 people with clonal haematopoiesis. A genome-wide association study revealed that a common inherited polymorphism in the TCL1A promoter was associated with a slower expansion rate in clonal haematopoiesis overall, but the effect varied by driver gene. Those carrying this protective allele exhibited markedly reduced growth rates or prevalence of clones with driver mutations in TET2, ASXL1, SF3B1 and SRSF2, but this effect was not seen in clones with driver mutations in DNMT3A. TCL1A was not expressed in normal or DNMT3A-mutated HSCs, but the introduction of mutations in TET2 or ASXL1 led to the expression of TCL1A protein and the expansion of HSCs in vitro. The protective allele restricted TCL1A expression and expansion of mutant HSCs, as did experimental knockdown of TCL1A expression. Forced expression of TCL1A promoted the expansion of human HSCs in vitro and mouse HSCs in vivo. Our results indicate that the fitness advantage of several commonly mutated driver genes in clonal haematopoiesis may be mediated by TCL1A activation.


Assuntos
Hematopoiese Clonal , Células-Tronco Hematopoéticas , Animais , Humanos , Camundongos , Alelos , Hematopoiese Clonal/genética , Estudo de Associação Genômica Ampla , Hematopoese/genética , Células-Tronco Hematopoéticas/citologia , Células-Tronco Hematopoéticas/metabolismo , Mutação , Regiões Promotoras Genéticas
2.
Am J Hum Genet ; 110(10): 1704-1717, 2023 10 05.
Artigo em Inglês | MEDLINE | ID: mdl-37802043

RESUMO

Long non-coding RNAs (lncRNAs) are known to perform important regulatory functions in lipid metabolism. Large-scale whole-genome sequencing (WGS) studies and new statistical methods for variant set tests now provide an opportunity to assess more associations between rare variants in lncRNA genes and complex traits across the genome. In this study, we used high-coverage WGS from 66,329 participants of diverse ancestries with measurement of blood lipids and lipoproteins (LDL-C, HDL-C, TC, and TG) in the National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) program to investigate the role of lncRNAs in lipid variability. We aggregated rare variants for 165,375 lncRNA genes based on their genomic locations and conducted rare-variant aggregate association tests using the STAAR (variant-set test for association using annotation information) framework. We performed STAAR conditional analysis adjusting for common variants in known lipid GWAS loci and rare-coding variants in nearby protein-coding genes. Our analyses revealed 83 rare lncRNA variant sets significantly associated with blood lipid levels, all of which were located in known lipid GWAS loci (in a ±500-kb window of a Global Lipids Genetics Consortium index variant). Notably, 61 out of 83 signals (73%) were conditionally independent of common regulatory variation and rare protein-coding variation at the same loci. We replicated 34 out of 61 (56%) conditionally independent associations using the independent UK Biobank WGS data. Our results expand the genetic architecture of blood lipids to rare variants in lncRNAs.


Assuntos
RNA Longo não Codificante , Humanos , RNA Longo não Codificante/genética , Estudo de Associação Genômica Ampla , Medicina de Precisão , Sequenciamento Completo do Genoma/métodos , Lipídeos/genética , Polimorfismo de Nucleotídeo Único/genética
3.
Nature ; 586(7831): 763-768, 2020 10.
Artigo em Inglês | MEDLINE | ID: mdl-33057201

RESUMO

Age is the dominant risk factor for most chronic human diseases, but the mechanisms through which ageing confers this risk are largely unknown1. The age-related acquisition of somatic mutations that lead to clonal expansion in regenerating haematopoietic stem cell populations has recently been associated with both haematological cancer2-4 and coronary heart disease5-this phenomenon is termed clonal haematopoiesis of indeterminate potential (CHIP)6. Simultaneous analyses of germline and somatic whole-genome sequences provide the opportunity to identify root causes of CHIP. Here we analyse high-coverage whole-genome sequences from 97,691 participants of diverse ancestries in the National Heart, Lung, and Blood Institute Trans-omics for Precision Medicine (TOPMed) programme, and identify 4,229 individuals with CHIP. We identify associations with blood cell, lipid and inflammatory traits that are specific to different CHIP driver genes. Association of a genome-wide set of germline genetic variants enabled the identification of three genetic loci associated with CHIP status, including one locus at TET2 that was specific to individuals of African ancestry. In silico-informed in vitro evaluation of the TET2 germline locus enabled the identification of a causal variant that disrupts a TET2 distal enhancer, resulting in increased self-renewal of haematopoietic stem cells. Overall, we observe that germline genetic variation shapes haematopoietic stem cell function, leading to CHIP through mechanisms that are specific to clonal haematopoiesis as well as shared mechanisms that lead to somatic mutations across tissues.


Assuntos
Hematopoiese Clonal/genética , Predisposição Genética para Doença , Genoma Humano/genética , Sequenciamento Completo do Genoma , Adulto , África/etnologia , Idoso , Idoso de 80 Anos ou mais , População Negra/genética , Autorrenovação Celular/genética , Proteínas de Ligação a DNA/genética , Dioxigenases , Feminino , Mutação em Linhagem Germinativa/genética , Células-Tronco Hematopoéticas/citologia , Células-Tronco Hematopoéticas/metabolismo , Humanos , Peptídeos e Proteínas de Sinalização Intracelular/genética , Masculino , Pessoa de Meia-Idade , National Heart, Lung, and Blood Institute (U.S.) , Fenótipo , Medicina de Precisão , Proteínas Proto-Oncogênicas/genética , Proteínas com Motivo Tripartido/genética , Estados Unidos , alfa Carioferinas/genética
4.
Hum Mol Genet ; 32(6): 1048-1060, 2023 03 06.
Artigo em Inglês | MEDLINE | ID: mdl-36444934

RESUMO

Diabetic kidney disease (DKD) is recognized as an important public health challenge. However, its genomic mechanisms are poorly understood. To identify rare variants for DKD, we conducted a whole-exome sequencing (WES) study leveraging large cohorts well-phenotyped for chronic kidney disease and diabetes. Our two-stage WES study included 4372 European and African ancestry participants from the Chronic Renal Insufficiency Cohort and Atherosclerosis Risk in Communities studies (stage 1) and 11 487 multi-ancestry Trans-Omics for Precision Medicine participants (stage 2). Generalized linear mixed models, which accounted for genetic relatedness and adjusted for age, sex and ancestry, were used to test associations between single variants and DKD. Gene-based aggregate rare variant analyses were conducted using an optimized sequence kernel association test implemented within our mixed model framework. We identified four novel exome-wide significant DKD-related loci through initiating diabetes. In single-variant analyses, participants carrying a rare, in-frame insertion in the DIS3L2 gene (rs141560952) exhibited a 193-fold increased odds [95% confidence interval (CI): 33.6, 1105] of DKD compared with noncarriers (P = 3.59 × 10-9). Likewise, each copy of a low-frequency KRT6B splice-site variant (rs425827) conferred a 5.31-fold higher odds (95% CI: 3.06, 9.21) of DKD (P = 2.72 × 10-9). Aggregate gene-based analyses further identified ERAP2 (P = 4.03 × 10-8) and NPEPPS (P = 1.51 × 10-7), which are both expressed in the kidney and implicated in renin-angiotensin-aldosterone system modulated immune response. In the largest WES study of DKD, we identified novel rare variant loci attaining exome-wide significance. These findings provide new insights into the molecular mechanisms underlying DKD.


Assuntos
Diabetes Mellitus , Nefropatias Diabéticas , Insuficiência Renal Crônica , Humanos , Aminopeptidases , Nefropatias Diabéticas/genética , Sequenciamento do Exoma , Rim , Insuficiência Renal Crônica/genética
5.
Am J Hum Genet ; 109(5): 783-801, 2022 05 05.
Artigo em Inglês | MEDLINE | ID: mdl-35334221

RESUMO

Integrative analysis of genome-wide association studies (GWASs) and gene expression studies in the form of a transcriptome-wide association study (TWAS) has the potential to better elucidate the molecular mechanisms underlying disease etiology. Here we present a method, METRO, that can leverage gene expression data collected from multiple genetic ancestries to enhance TWASs. METRO incorporates expression prediction models constructed in different genetic ancestries through a likelihood-based inference framework, producing calibrated p values with substantially improved TWAS power. We illustrate the benefits of METRO in both simulations and applications to seven complex traits and diseases obtained from four GWASs. These GWASs include two of primarily European ancestry (n = 188,577 and 339,226) and two of primarily African ancestry (n = 42,752 and 23,827). In the real data applications, we leverage gene expression data measured on 1,032 African Americans and 801 European Americans from the Genetic Epidemiology Network of Arteriopathy (GENOA) study to identify a substantially larger number of gene-trait associations as compared to existing TWAS approaches. The benefits of METRO are most prominent in applications to GWASs of African ancestry where the sample size is much smaller than GWASs of European ancestry and where a more powerful TWAS method is crucial. Among the identified associations are high-density lipoprotein-associated genes including PLTP and PPARG that are critical for maintaining lipid homeostasis and the type II diabetes-associated gene MAPT that supports microtubule-associated protein tau as a key component underlying impaired insulin secretion.


Assuntos
Diabetes Mellitus Tipo 2 , Estudo de Associação Genômica Ampla , Diabetes Mellitus Tipo 2/genética , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla/métodos , Humanos , Funções Verossimilhança , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/genética , Transcriptoma/genética
6.
Brain ; 146(2): 492-506, 2023 02 13.
Artigo em Inglês | MEDLINE | ID: mdl-35943854

RESUMO

Cerebral white matter hyperintensities on MRI are markers of cerebral small vessel disease, a major risk factor for dementia and stroke. Despite the successful identification of multiple genetic variants associated with this highly heritable condition, its genetic architecture remains incompletely understood. More specifically, the role of DNA methylation has received little attention. We investigated the association between white matter hyperintensity burden and DNA methylation in blood at ∼450 000 cytosine-phosphate-guanine (CpG) sites in 9732 middle-aged to older adults from 14 community-based studies. Single CpG and region-based association analyses were carried out. Functional annotation and integrative cross-omics analyses were performed to identify novel genes underlying the relationship between DNA methylation and white matter hyperintensities. We identified 12 single CpG and 46 region-based DNA methylation associations with white matter hyperintensity burden. Our top discovery single CpG, cg24202936 (P = 7.6 × 10-8), was associated with F2 expression in blood (P = 6.4 × 10-5) and co-localized with FOLH1 expression in brain (posterior probability = 0.75). Our top differentially methylated regions were in PRMT1 and in CCDC144NL-AS1, which were also represented in single CpG associations (cg17417856 and cg06809326, respectively). Through Mendelian randomization analyses cg06809326 was putatively associated with white matter hyperintensity burden (P = 0.03) and expression of CCDC144NL-AS1 possibly mediated this association. Differentially methylated region analysis, joint epigenetic association analysis and multi-omics co-localization analysis consistently identified a role of DNA methylation near SH3PXD2A, a locus previously identified in genome-wide association studies of white matter hyperintensities. Gene set enrichment analyses revealed functions of the identified DNA methylation loci in the blood-brain barrier and in the immune response. Integrative cross-omics analysis identified 19 key regulatory genes in two networks related to extracellular matrix organization, and lipid and lipoprotein metabolism. A drug-repositioning analysis indicated antihyperlipidaemic agents, more specifically peroxisome proliferator-activated receptor-alpha, as possible target drugs for white matter hyperintensities. Our epigenome-wide association study and integrative cross-omics analyses implicate novel genes influencing white matter hyperintensity burden, which converged on pathways related to the immune response and to a compromised blood-brain barrier possibly due to disrupted cell-cell and cell-extracellular matrix interactions. The results also suggest that antihyperlipidaemic therapy may contribute to lowering risk for white matter hyperintensities possibly through protection against blood-brain barrier disruption.


Assuntos
Substância Branca , Pessoa de Meia-Idade , Humanos , Idoso , Substância Branca/diagnóstico por imagem , Estudo de Associação Genômica Ampla/métodos , Encéfalo/diagnóstico por imagem , Metilação de DNA/genética , Imageamento por Ressonância Magnética , Epigênese Genética , Proteína-Arginina N-Metiltransferases , Proteínas Repressoras
7.
Alzheimers Dement ; 2024 Jun 18.
Artigo em Inglês | MEDLINE | ID: mdl-38889280

RESUMO

BACKGROUND: We investigated the effects of apolipoprotein E (APOE) ε4 and its interactions with sociodemographic characteristics on cognitive measures in South Asians from the Diagnostic Assessment of Dementia for the Longitudinal Aging Study of India (LASI-DAD). METHODS: Linear regression was used to assess the association between APOE ε4 and global- and domain-specific cognitive function in 2563 participants (mean age 69.6 ± 7.3 years; 53% female). Effect modification by age, sex, and education were explored using interaction terms and subgroup analyses. RESULTS: APOE ε4 was inversely associated with most cognitive measures (p < 0.05). This association was stronger with advancing age for the Hindi Mental State Examination (HMSE) score (ßε4×age = -0.44, p = 0.03), orientation (ßε4×age = -0.07, p = 0.01), and language/fluency (ßε4×age = -0.07, p = 0.01), as well as in females for memory (ßε4×male = 0.17, p = 0.02) and language/fluency (ßε4×male = 0.12, p = 0.03). DISCUSSION: APOE Îµ4 is associated with lower cognitive function in South Asians from India, with a more pronounced impact observed in females and older individuals. HIGHLIGHTS: APOE Îµ4 carriers had lower global and domain-specific cognitive performance. Females and older individuals may be more susceptible to ε4 effects. For most cognitive measures, there was no interaction between ε4 and education.

8.
Hum Mol Genet ; 30(15): 1443-1456, 2021 07 09.
Artigo em Inglês | MEDLINE | ID: mdl-33856023

RESUMO

Nonalcoholic fatty liver disease (NAFLD) is a leading cause of chronic liver disease and is highly correlated with metabolic disease. NAFLD results from environmental exposures acting on a susceptible polygenic background. This study performed the largest multiethnic investigation of exonic variation associated with NAFLD and correlated metabolic traits and diseases. An exome array meta-analysis was carried out among eight multiethnic population-based cohorts (n = 16 492) with computed tomography (CT) measured hepatic steatosis. A fixed effects meta-analysis identified five exome-wide significant loci (P < 5.30 × 10-7); including a novel signal near TOMM40/APOE. Joint analysis of TOMM40/APOE variants revealed the TOMM40 signal was attributed to APOE rs429358-T; APOE rs7412 was not associated with liver attenuation. Moreover, rs429358-T was associated with higher serum alanine aminotransferase, liver steatosis, cirrhosis, triglycerides and obesity; as well as, lower cholesterol and decreased risk of myocardial infarction and Alzheimer's disease (AD) in phenome-wide association analyses in the Michigan Genomics Initiative, United Kingdom Biobank and/or public datasets. These results implicate APOE in imaging-based identification of NAFLD. This association may or may not translate to nonalcoholic steatohepatitis; however, these results indicate a significant association with advanced liver disease and hepatic cirrhosis. These findings highlight allelic heterogeneity at the APOE locus and demonstrate an inverse link between NAFLD and AD at the exome level in the largest analysis to date.


Assuntos
Apolipoproteínas E/genética , Hepatopatia Gordurosa não Alcoólica/genética , Obesidade/genética , Alanina Transaminase , Alelos , Doença de Alzheimer/genética , Apolipoproteínas E/metabolismo , Bases de Dados Genéticas , Exoma/genética , Frequência do Gene/genética , Estudo de Associação Genômica Ampla/métodos , Humanos , Fígado , Cirrose Hepática/genética , Infarto do Miocárdio/genética , Hepatopatia Gordurosa não Alcoólica/metabolismo , Obesidade/metabolismo , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Prognóstico , Fatores de Risco , Triglicerídeos
9.
Am J Hum Genet ; 106(4): 496-512, 2020 04 02.
Artigo em Inglês | MEDLINE | ID: mdl-32220292

RESUMO

Most existing expression quantitative trait locus (eQTL) mapping studies have been focused on individuals of European ancestry and are underrepresented in other populations including populations with African ancestry. Lack of large-scale well-powered eQTL mapping studies in populations with African ancestry can both impede the dissemination of eQTL mapping results that would otherwise benefit individuals with African ancestry and hinder the comparable analysis for understanding how gene regulation is shaped through evolution. We fill this critical knowledge gap by performing a large-scale in-depth eQTL mapping study on 1,032 African Americans (AA) and 801 European Americans (EA) in the GENOA cohort. We identified a total of 354,931 eSNPs in AA and 371,309 eSNPs in EA, with 112,316 eSNPs overlapped between the two. We found that eQTL harboring genes (eGenes) are enriched in metabolic pathways and tend to have higher SNP heritability compared to non-eGenes. We found that eGenes that are common in the two populations tend to be less conserved than eGenes that are unique to one population, which are less conserved than non-eGenes. Through conditional analysis, we found that eGenes in AA tend to harbor more independent eQTLs than eGenes in EA, suggesting potentially diverse genetic architecture underlying expression variation in the two populations. Finally, the large sample sizes in GENOA allow us to construct accurate expression prediction models in both AA and EA, facilitating powerful transcriptome-wide association studies. Overall, our results represent an important step toward revealing the genetic architecture underlying expression variation in African Americans.


Assuntos
Negro ou Afro-Americano/genética , Regulação da Expressão Gênica/genética , Locos de Características Quantitativas/genética , População Branca/genética , Mapeamento Cromossômico/métodos , Estudos de Coortes , Feminino , Predisposição Genética para Doença/genética , Estudo de Associação Genômica Ampla/métodos , Humanos , Masculino , Polimorfismo de Nucleotídeo Único/genética , Transcriptoma/genética
10.
Genet Med ; 25(1): 115-124, 2023 01.
Artigo em Inglês | MEDLINE | ID: mdl-36371759

RESUMO

PURPOSE: Genetic researchers' selection of a database can have scientific, regulatory, and ethical implications. It is important to understand what is driving database selection such that database stewards can be responsive to user needs while balancing the interests of communities in equitably benefiting from advances. METHODS: We conducted 23 semistructured interviews with US academic genetic researchers working with private, government, and collaboratory data stewards to explore factors that they consider when selecting a genetic database. RESULTS: Interviewees used existing databases to avoid burdens of primary data collection, which was described as expensive and time-consuming. They highlighted ease of access as the most important selection factor, integrating concepts of familiarity and efficiency. Data features, such as size and available phenotype, were also important. Demographic diversity was not originally cited by any interviewee as a pivotal factor; when probed, most stated that the option to consider diversity in database selection was limited. Database features, including integrity, harmonization, and storage were also described as key components of efficient use. CONCLUSION: There is a growing market and competition between genetic data stewards. Data need to be accessible, harmonized, and administratively supported for their existence to be translated into use and, in turn, result in scientific advancements across diverse communities.


Assuntos
Disseminação de Informação , Pesquisadores , Humanos
11.
Value Health ; 26(9): 1301-1307, 2023 09.
Artigo em Inglês | MEDLINE | ID: mdl-36736697

RESUMO

OBJECTIVES: The aim to this study was to assess preferences for sharing of electronic health record (EHR) and genetic information separately and to examine whether there are different preferences for sharing these 2 types of information. METHODS: Using a population-based, nationally representative survey of the United States, we conducted a discrete choice experiment in which half of the subjects (N = 790) responded to questions about sharing of genetic information and the other half (N = 751) to questions about sharing of EHR information. Conditional logistic regression models assessed relative preferences across attribute levels of where patients learn about health information sharing, whether shared data are deidentified, whether data are commercialized, how long biospecimens are kept, and what the purpose of sharing the information is. RESULTS: Individuals had strong preferences to share deidentified (vs identified) data (odds ratio [OR] 3.26, 95% confidence interval 2.68-3.96) and to be able to opt out of sharing information with commercial companies (OR 4.26, 95% confidence interval 3.42-5.30). There were no significant differences regarding how long biospecimens are kept or why the data are being shared. Individuals had a stronger preference for opting out of sharing genetic (OR 4.26) versus EHR information (OR 2.64) (P = .002). CONCLUSIONS: Hospital systems and regulatory bodies should consider patient preferences for sharing of personal medical records or genetic information. For both genetic and EHR information, patients strongly prefer their data to be deidentified and to have the choice to opt out of sharing information with commercial companies.


Assuntos
Confidencialidade , Registros Eletrônicos de Saúde , Humanos , Estados Unidos , Disseminação de Informação , Modelos Logísticos , Coleta de Dados
12.
Am J Hum Genet ; 104(2): 260-274, 2019 02 07.
Artigo em Inglês | MEDLINE | ID: mdl-30639324

RESUMO

With advances in whole-genome sequencing (WGS) technology, more advanced statistical methods for testing genetic association with rare variants are being developed. Methods in which variants are grouped for analysis are also known as variant-set, gene-based, and aggregate unit tests. The burden test and sequence kernel association test (SKAT) are two widely used variant-set tests, which were originally developed for samples of unrelated individuals and later have been extended to family data with known pedigree structures. However, computationally efficient and powerful variant-set tests are needed to make analyses tractable in large-scale WGS studies with complex study samples. In this paper, we propose the variant-set mixed model association tests (SMMAT) for continuous and binary traits using the generalized linear mixed model framework. These tests can be applied to large-scale WGS studies involving samples with population structure and relatedness, such as in the National Heart, Lung, and Blood Institute's Trans-Omics for Precision Medicine (TOPMed) program. SMMATs share the same null model for different variant sets, and a virtue of this null model, which includes covariates only, is that it needs to be fit only once for all tests in each genome-wide analysis. Simulation studies show that all the proposed SMMATs correctly control type I error rates for both continuous and binary traits in the presence of population structure and relatedness. We also illustrate our tests in a real data example of analysis of plasma fibrinogen levels in the TOPMed program (n = 23,763), using the Analysis Commons, a cloud-based computing platform.


Assuntos
Estudos de Associação Genética , Modelos Genéticos , Sequenciamento Completo do Genoma , Cromossomos Humanos Par 4/genética , Computação em Nuvem , Feminino , Fibrinogênio/análise , Fibrinogênio/genética , Genética Populacional , Humanos , Masculino , National Heart, Lung, and Blood Institute (U.S.) , Medicina de Precisão , Projetos de Pesquisa , Fatores de Tempo , Estados Unidos
14.
PLoS Genet ; 15(12): e1008500, 2019 12.
Artigo em Inglês | MEDLINE | ID: mdl-31869403

RESUMO

Most genome-wide association and fine-mapping studies to date have been conducted in individuals of European descent, and genetic studies of populations of Hispanic/Latino and African ancestry are limited. In addition, these populations have more complex linkage disequilibrium structure. In order to better define the genetic architecture of these understudied populations, we leveraged >100,000 phased sequences available from deep-coverage whole genome sequencing through the multi-ethnic NHLBI Trans-Omics for Precision Medicine (TOPMed) program to impute genotypes into admixed African and Hispanic/Latino samples with genome-wide genotyping array data. We demonstrated that using TOPMed sequencing data as the imputation reference panel improves genotype imputation quality in these populations, which subsequently enhanced gene-mapping power for complex traits. For rare variants with minor allele frequency (MAF) < 0.5%, we observed a 2.3- to 6.1-fold increase in the number of well-imputed variants, with 11-34% improvement in average imputation quality, compared to the state-of-the-art 1000 Genomes Project Phase 3 and Haplotype Reference Consortium reference panels. Impressively, even for extremely rare variants with minor allele count <10 (including singletons) in the imputation target samples, average information content rescued was >86%. Subsequent association analyses of TOPMed reference panel-imputed genotype data with hematological traits (hemoglobin (HGB), hematocrit (HCT), and white blood cell count (WBC)) in ~21,600 African-ancestry and ~21,700 Hispanic/Latino individuals identified associations with two rare variants in the HBB gene (rs33930165 with higher WBC [p = 8.8x10-15] in African populations, rs11549407 with lower HGB [p = 1.5x10-12] and HCT [p = 8.8x10-10] in Hispanics/Latinos). By comparison, neither variant would have been genome-wide significant if either 1000 Genomes Project Phase 3 or Haplotype Reference Consortium reference panels had been used for imputation. Our findings highlight the utility of the TOPMed imputation reference panel for identification of novel rare variant associations not previously detected in similarly sized genome-wide studies of under-represented African and Hispanic/Latino populations.


Assuntos
Negro ou Afro-Americano/genética , Hispânico ou Latino/genética , Medicina de Precisão/métodos , Sequenciamento Completo do Genoma/métodos , Globinas beta/genética , Adulto , Idoso , Idoso de 80 Anos ou mais , Biologia Computacional/métodos , Bases de Dados Genéticas , Feminino , Frequência do Gene , Predisposição Genética para Doença , Genética Populacional , Estudo de Associação Genômica Ampla , Técnicas de Genotipagem , Humanos , Desequilíbrio de Ligação , Masculino , Pessoa de Meia-Idade , Estados Unidos
15.
Am J Epidemiol ; 190(10): 1977-1992, 2021 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-33861317

RESUMO

Genotype-phenotype association studies often combine phenotype data from multiple studies to increase statistical power. Harmonization of the data usually requires substantial effort due to heterogeneity in phenotype definitions, study design, data collection procedures, and data-set organization. Here we describe a centralized system for phenotype harmonization that includes input from phenotype domain and study experts, quality control, documentation, reproducible results, and data-sharing mechanisms. This system was developed for the National Heart, Lung, and Blood Institute's Trans-Omics for Precision Medicine (TOPMed) program, which is generating genomic and other -omics data for more than 80 studies with extensive phenotype data. To date, 63 phenotypes have been harmonized across thousands of participants (recruited in 1948-2012) from up to 17 studies per phenotype. Here we discuss challenges in this undertaking and how they were addressed. The harmonized phenotype data and associated documentation have been submitted to National Institutes of Health data repositories for controlled access by the scientific community. We also provide materials to facilitate future harmonization efforts by the community, which include 1) the software code used to generate the 63 harmonized phenotypes, enabling others to reproduce, modify, or extend these harmonizations to additional studies, and 2) the results of labeling thousands of phenotype variables with controlled vocabulary terms.


Assuntos
Estudos de Associação Genética/métodos , Fenômica/métodos , Medicina de Precisão/métodos , Agregação de Dados , Humanos , Disseminação de Informação , National Heart, Lung, and Blood Institute (U.S.) , Fenótipo , Avaliação de Programas e Projetos de Saúde , Estados Unidos
16.
Stat Med ; 40(27): 6038-6056, 2021 11 30.
Artigo em Inglês | MEDLINE | ID: mdl-34404112

RESUMO

We consider Bayesian high-dimensional mediation analysis to identify among a large set of correlated potential mediators the active ones that mediate the effect from an exposure variable to an outcome of interest. Correlations among mediators are commonly observed in modern data analysis; examples include the activated voxels within connected regions in brain image data, regulatory signals driven by gene networks in genome data, and correlated exposure data from the same source. When correlations are present among active mediators, mediation analysis that fails to account for such correlation can be suboptimal and may lead to a loss of power in identifying active mediators. Building upon a recent high-dimensional mediation analysis framework, we propose two Bayesian hierarchical models, one with a Gaussian mixture prior that enables correlated mediator selection and the other with a Potts mixture prior that accounts for the correlation among active mediators in mediation analysis. We develop efficient sampling algorithms for both methods. Various simulations demonstrate that our methods enable effective identification of correlated active mediators, which could be missed by using existing methods that assume prior independence among active mediators. The proposed methods are applied to the LIFECODES birth cohort and the Multi-Ethnic Study of Atherosclerosis (MESA) and identified new active mediators with important biological implications.


Assuntos
Algoritmos , Análise de Mediação , Teorema de Bayes , Humanos
17.
BMC Genomics ; 21(1): 476, 2020 Jul 11.
Artigo em Inglês | MEDLINE | ID: mdl-32652930

RESUMO

BACKGROUND: Fitness epistasis, the interaction effect of genes at different loci on fitness, makes an important contribution to adaptive evolution. Although fitness interaction evidence has been observed in model organisms, it is more difficult to detect and remains poorly understood in human populations as a result of limited statistical power and experimental constraints. Fitness epistasis is inferred from non-independence between unlinked loci. We previously observed ancestral block correlation between chromosomes 4 and 6 in African Americans. The same approach fails when examining ancestral blocks on the same chromosome due to the strong confounding effect observed in a recently admixed population. RESULTS: We developed a novel approach to eliminate the bias caused by admixture linkage disequilibrium when searching for fitness epistasis on the same chromosome. We applied this approach in 16,252 unrelated African Americans and identified significant ancestral correlations in two pairs of genomic regions (P-value< 8.11 × 10- 7) on chromosomes 1 and 10. The ancestral correlations were not explained by population admixture. Historical African-European crossover events are reduced between pairs of epistatic regions. We observed multiple pairs of co-expressed genes shared by the two regions on each chromosome, including ADAR being co-expressed with IFI44 in almost all tissues and DARC being co-expressed with VCAM1, S1PR1 and ELTD1 in multiple tissues in the Genotype-Tissue Expression (GTEx) data. Moreover, the co-expressed gene pairs are associated with the same diseases/traits in the GWAS Catalog, such as white blood cell count, blood pressure, lung function, inflammatory bowel disease and educational attainment. CONCLUSIONS: Our analyses revealed two instances of fitness epistasis on chromosomes 1 and 10, and the findings suggest a potential approach to improving our understanding of adaptive evolution.


Assuntos
Epistasia Genética , Aptidão Genética , Estudo de Associação Genômica Ampla/métodos , Negro ou Afro-Americano/genética , Cromossomos Humanos Par 1/genética , Cromossomos Humanos Par 10/genética , Simulação por Computador , Humanos , Desequilíbrio de Ligação , Polimorfismo de Nucleotídeo Único , Receptores Acoplados a Proteínas G/genética
18.
Am J Hum Genet ; 101(6): 888-902, 2017 Dec 07.
Artigo em Inglês | MEDLINE | ID: mdl-29198723

RESUMO

Genome-wide association studies have identified hundreds of genetic variants associated with blood pressure (BP), but sequence variation accounts for a small fraction of the phenotypic variance. Epigenetic changes may alter the expression of genes involved in BP regulation and explain part of the missing heritability. We therefore conducted a two-stage meta-analysis of the cross-sectional associations of systolic and diastolic BP with blood-derived genome-wide DNA methylation measured on the Infinium HumanMethylation450 BeadChip in 17,010 individuals of European, African American, and Hispanic ancestry. Of 31 discovery-stage cytosine-phosphate-guanine (CpG) dinucleotides, 13 replicated after Bonferroni correction (discovery: N = 9,828, p < 1.0 × 10-7; replication: N = 7,182, p < 1.6 × 10-3). The replicated methylation sites are heritable (h2 > 30%) and independent of known BP genetic variants, explaining an additional 1.4% and 2.0% of the interindividual variation in systolic and diastolic BP, respectively. Bidirectional Mendelian randomization among up to 4,513 individuals of European ancestry from 4 cohorts suggested that methylation at cg08035323 (TAF1B-YWHAQ) influences BP, while BP influences methylation at cg00533891 (ZMIZ1), cg00574958 (CPT1A), and cg02711608 (SLC1A5). Gene expression analyses further identified six genes (TSPAN2, SLC7A11, UNC93B1, CPT1A, PTMS, and LPCAT3) with evidence of triangular associations between methylation, gene expression, and BP. Additional integrative Mendelian randomization analyses of gene expression and DNA methylation suggested that the expression of TSPAN2 is a putative mediator of association between DNA methylation at cg23999170 and BP. These findings suggest that heritable DNA methylation plays a role in regulating BP independently of previously known genetic variants.


Assuntos
Pressão Sanguínea/genética , Metilação de DNA/genética , Proteínas do Tecido Nervoso/genética , Tetraspaninas/genética , Idoso , Ilhas de CpG/genética , Estudos Transversais , Epigênese Genética/genética , Variação Genética/genética , Estudo de Associação Genômica Ampla , Humanos , Análise da Randomização Mendeliana , Pessoa de Meia-Idade , Locos de Características Quantitativas/genética
19.
Blood ; 132(17): 1842-1850, 2018 10 25.
Artigo em Inglês | MEDLINE | ID: mdl-30042098

RESUMO

Many hemostatic factors are associated with age and age-related diseases; however, much remains unknown about the biological mechanisms linking aging and hemostatic factors. DNA methylation is a novel means by which to assess epigenetic aging, which is a measure of age and the aging processes as determined by altered epigenetic states. We used a meta-analysis approach to examine the association between measures of epigenetic aging and hemostatic factors, as well as a clotting time measure. For fibrinogen, we performed European and African ancestry-specific meta-analyses which were then combined via a random effects meta-analysis. For all other measures we could not estimate ancestry-specific effects and used a single fixed effects meta-analysis. We found that 1-year higher extrinsic epigenetic age as compared with chronological age was associated with higher fibrinogen (0.004 g/L/y; 95% confidence interval, 0.001-0.007; P = .01) and plasminogen activator inhibitor 1 (PAI-1; 0.13 U/mL/y; 95% confidence interval, 0.07-0.20; P = 6.6 × 10-5) concentrations, as well as lower activated partial thromboplastin time, a measure of clotting time. We replicated PAI-1 associations using an independent cohort. To further elucidate potential functional mechanisms, we associated epigenetic aging with expression levels of the PAI-1 protein encoding gene (SERPINE1) and the 3 fibrinogen subunit-encoding genes (FGA, FGG, and FGB) in both peripheral blood and aorta intima-media samples. We observed associations between accelerated epigenetic aging and transcription of FGG in both tissues. Collectively, our results indicate that accelerated epigenetic aging is associated with a procoagulation hemostatic profile, and that epigenetic aging may regulate hemostasis in part via gene transcription.


Assuntos
Envelhecimento/patologia , Envelhecimento/fisiologia , Metilação de DNA , Hemostasia/fisiologia , Epigênese Genética/fisiologia , Humanos
20.
J Nutr ; 150(10): 2635-2645, 2020 10 12.
Artigo em Inglês | MEDLINE | ID: mdl-32840624

RESUMO

BACKGROUND: Excess sodium intake and insufficient potassium intake are risk factors for hypertension, but there is limited knowledge regarding genetic factors that influence intake. Twenty-hour or half-day urine samples provide robust estimates of sodium and potassium intake, outperforming other measures such as spot urine samples and dietary self-reporting. OBJECTIVE: The aim of this study was to investigate genomic regions associated with sodium intake, potassium intake, and sodium-to-potassium ratio measured from 24-h or half-day urine samples. METHODS: Using samples of European ancestry (mean age: 54.2 y; 52.3% women), we conducted a meta-analysis of genome-wide association studies in 4 cohorts with 24-h or half-day urine samples (n = 6,519), followed by gene-based analysis. Suggestive loci (P < 10-6) were examined in additional European (n = 844), African (n = 1,246), and Asian (n = 2,475) ancestry samples. RESULTS: We found suggestive loci (P < 10-6) for all 3 traits, including 7 for 24-h sodium excretion, 4 for 24-h potassium excretion, and 4 for sodium-to-potassium ratio. The most significant locus was rs77958157 near cocaine- and amphetamine-regulated transcript prepropeptide (CARTPT) , a gene involved in eating behavior and appetite regulation (P = 2.3 × 10-8 with sodium-to-potassium ratio). Two suggestive loci were replicated in additional samples: for sodium excretion, rs12094702 near zinc finger SWIM-type containing 5 (ZSWIM5) was replicated in the Asian ancestry sample reaching Bonferroni-corrected significance (P = 0.007), and for potassium excretion rs34473523 near sodium leak channel (NALCN) was associated at a nominal P value with potassium excretion both in European (P = 0.043) and African (P = 0.043) ancestry cohorts. Gene-based tests identified 1 significant gene for sodium excretion, CDC42 small effector 1 (CDC42SE1), which is associated with blood pressure regulation. CONCLUSIONS: We identified multiple suggestive loci for sodium and potassium intake near genes associated with eating behavior, nervous system development and function, and blood pressure regulation in individuals of European ancestry. Further research is needed to replicate these findings and to provide insight into the underlying genetic mechanisms by which these genomic regions influence sodium and potassium intake.


Assuntos
Comportamento Alimentar , Estudo de Associação Genômica Ampla , Potássio na Dieta/administração & dosagem , Sódio na Dieta/administração & dosagem , População Branca/genética , Adulto , Idoso , Dieta , Feminino , Genótipo , Humanos , Masculino , Pessoa de Meia-Idade , Potássio/metabolismo , Potássio/urina , Sódio/metabolismo , Sódio/urina
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA