Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 29
Filtrar
1.
Nature ; 616(7958): 755-763, 2023 04.
Artigo em Inglês | MEDLINE | ID: mdl-37046083

RESUMO

Mutations in a diverse set of driver genes increase the fitness of haematopoietic stem cells (HSCs), leading to clonal haematopoiesis1. These lesions are precursors for blood cancers2-6, but the basis of their fitness advantage remains largely unknown, partly owing to a paucity of large cohorts in which the clonal expansion rate has been assessed by longitudinal sampling. Here, to circumvent this limitation, we developed a method to infer the expansion rate from data from a single time point. We applied this method to 5,071 people with clonal haematopoiesis. A genome-wide association study revealed that a common inherited polymorphism in the TCL1A promoter was associated with a slower expansion rate in clonal haematopoiesis overall, but the effect varied by driver gene. Those carrying this protective allele exhibited markedly reduced growth rates or prevalence of clones with driver mutations in TET2, ASXL1, SF3B1 and SRSF2, but this effect was not seen in clones with driver mutations in DNMT3A. TCL1A was not expressed in normal or DNMT3A-mutated HSCs, but the introduction of mutations in TET2 or ASXL1 led to the expression of TCL1A protein and the expansion of HSCs in vitro. The protective allele restricted TCL1A expression and expansion of mutant HSCs, as did experimental knockdown of TCL1A expression. Forced expression of TCL1A promoted the expansion of human HSCs in vitro and mouse HSCs in vivo. Our results indicate that the fitness advantage of several commonly mutated driver genes in clonal haematopoiesis may be mediated by TCL1A activation.


Assuntos
Hematopoiese Clonal , Células-Tronco Hematopoéticas , Animais , Humanos , Camundongos , Alelos , Hematopoiese Clonal/genética , Estudo de Associação Genômica Ampla , Hematopoese/genética , Células-Tronco Hematopoéticas/citologia , Células-Tronco Hematopoéticas/metabolismo , Mutação , Regiões Promotoras Genéticas
2.
Am J Hum Genet ; 109(5): 857-870, 2022 05 05.
Artigo em Inglês | MEDLINE | ID: mdl-35385699

RESUMO

While polygenic risk scores (PRSs) enable early identification of genetic risk for chronic obstructive pulmonary disease (COPD), predictive performance is limited when the discovery and target populations are not well matched. Hypothesizing that the biological mechanisms of disease are shared across ancestry groups, we introduce a PrediXcan-derived polygenic transcriptome risk score (PTRS) to improve cross-ethnic portability of risk prediction. We constructed the PTRS using summary statistics from application of PrediXcan on large-scale GWASs of lung function (forced expiratory volume in 1 s [FEV1] and its ratio to forced vital capacity [FEV1/FVC]) in the UK Biobank. We examined prediction performance and cross-ethnic portability of PTRS through smoking-stratified analyses both on 29,381 multi-ethnic participants from TOPMed population/family-based cohorts and on 11,771 multi-ethnic participants from TOPMed COPD-enriched studies. Analyses were carried out for two dichotomous COPD traits (moderate-to-severe and severe COPD) and two quantitative lung function traits (FEV1 and FEV1/FVC). While the proposed PTRS showed weaker associations with disease than PRS for European ancestry, the PTRS showed stronger association with COPD than PRS for African Americans (e.g., odds ratio [OR] = 1.24 [95% confidence interval [CI]: 1.08-1.43] for PTRS versus 1.10 [0.96-1.26] for PRS among heavy smokers with ≥ 40 pack-years of smoking) for moderate-to-severe COPD. Cross-ethnic portability of the PTRS was significantly higher than the PRS (paired t test p < 2.2 × 10-16 with portability gains ranging from 5% to 28%) for both dichotomous COPD traits and across all smoking strata. Our study demonstrates the value of PTRS for improved cross-ethnic portability compared to PRS in predicting COPD risk.


Assuntos
Doença Pulmonar Obstrutiva Crônica , Transcriptoma , Humanos , Pulmão , National Heart, Lung, and Blood Institute (U.S.) , Doença Pulmonar Obstrutiva Crônica/genética , Fatores de Risco , Estados Unidos/epidemiologia
3.
Am J Epidemiol ; 190(10): 1977-1992, 2021 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-33861317

RESUMO

Genotype-phenotype association studies often combine phenotype data from multiple studies to increase statistical power. Harmonization of the data usually requires substantial effort due to heterogeneity in phenotype definitions, study design, data collection procedures, and data-set organization. Here we describe a centralized system for phenotype harmonization that includes input from phenotype domain and study experts, quality control, documentation, reproducible results, and data-sharing mechanisms. This system was developed for the National Heart, Lung, and Blood Institute's Trans-Omics for Precision Medicine (TOPMed) program, which is generating genomic and other -omics data for more than 80 studies with extensive phenotype data. To date, 63 phenotypes have been harmonized across thousands of participants (recruited in 1948-2012) from up to 17 studies per phenotype. Here we discuss challenges in this undertaking and how they were addressed. The harmonized phenotype data and associated documentation have been submitted to National Institutes of Health data repositories for controlled access by the scientific community. We also provide materials to facilitate future harmonization efforts by the community, which include 1) the software code used to generate the 63 harmonized phenotypes, enabling others to reproduce, modify, or extend these harmonizations to additional studies, and 2) the results of labeling thousands of phenotype variables with controlled vocabulary terms.


Assuntos
Estudos de Associação Genética/métodos , Fenômica/métodos , Medicina de Precisão/métodos , Agregação de Dados , Humanos , Disseminação de Informação , National Heart, Lung, and Blood Institute (U.S.) , Fenótipo , Avaliação de Programas e Projetos de Saúde , Estados Unidos
4.
Hum Mol Genet ; 28(6): 1038-1051, 2019 03 15.
Artigo em Inglês | MEDLINE | ID: mdl-30452639

RESUMO

Orofacial clefts are common developmental disorders that pose significant clinical, economical and psychological problems. We conducted genome-wide association analyses for cleft palate only (CPO) and cleft lip with or without palate (CL/P) with ~17 million markers in sub-Saharan Africans. After replication and combined analyses, we identified novel loci for CPO at or near genome-wide significance on chromosomes 2 (near CTNNA2) and 19 (near SULT2A1). In situ hybridization of Sult2a1 in mice showed expression of SULT2A1 in mesenchymal cells in palate, palatal rugae and palatal epithelium in the fused palate. The previously reported 8q24 was the most significant locus for CL/P in our study, and we replicated several previously reported loci including PAX7 and VAX1.


Assuntos
População Negra/genética , Fissura Palatina/genética , Genética Populacional , Genoma Humano , Genômica , Locos de Características Quantitativas , Alelos , Animais , Mapeamento Cromossômico , Modelos Animais de Doenças , Elementos Facilitadores Genéticos , Feminino , Expressão Gênica , Frequência do Gene , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Genômica/métodos , Genótipo , Humanos , Masculino , Camundongos , Razão de Chances , Polimorfismo de Nucleotídeo Único
5.
Genet Epidemiol ; 43(3): 263-275, 2019 04.
Artigo em Inglês | MEDLINE | ID: mdl-30653739

RESUMO

When testing genotype-phenotype associations using linear regression, departure of the trait distribution from normality can impact both Type I error rate control and statistical power, with worse consequences for rarer variants. Because genotypes are expected to have small effects (if any) investigators now routinely use a two-stage method, in which they first regress the trait on covariates, obtain residuals, rank-normalize them, and then use the rank-normalized residuals in association analysis with the genotypes. Potential confounding signals are assumed to be removed at the first stage, so in practice, no further adjustment is done in the second stage. Here, we show that this widely used approach can lead to tests with undesirable statistical properties, due to both combination of a mis-specified mean-variance relationship and remaining covariate associations between the rank-normalized residuals and genotypes. We demonstrate these properties theoretically, and also in applications to genome-wide and whole-genome sequencing association studies. We further propose and evaluate an alternative fully adjusted two-stage approach that adjusts for covariates both when residuals are obtained and in the subsequent association test. This method can reduce excess Type I errors and improve statistical power.


Assuntos
Estudos de Associação Genética , Modelos Genéticos , Simulação por Computador , Estudo de Associação Genômica Ampla , Genótipo , Hemoglobinas/metabolismo , Hispânico ou Latino , Humanos , Modelos Lineares , Fenótipo
6.
Hum Mol Genet ; 26(6): 1193-1204, 2017 03 15.
Artigo em Inglês | MEDLINE | ID: mdl-28158719

RESUMO

Circulating white blood cell (WBC) counts (neutrophils, monocytes, lymphocytes, eosinophils, basophils) differ by ethnicity. The genetic factors underlying basal WBC traits in Hispanics/Latinos are unknown. We performed a genome-wide association study of total WBC and differential counts in a large, ethnically diverse US population sample of Hispanics/Latinos ascertained by the Hispanic Community Health Study and Study of Latinos (HCHS/SOL). We demonstrate that several previously known WBC-associated genetic loci (e.g. the African Duffy antigen receptor for chemokines null variant for neutrophil count) are generalizable to WBC traits in Hispanics/Latinos. We identified and replicated common and rare germ-line variants at FLT3 (a gene often somatically mutated in leukemia) associated with monocyte count. The common FLT3 variant rs76428106 has a large allele frequency differential between African and non-African populations. We also identified several novel genetic loci involving or regulating hematopoietic transcription factors (CEBPE-SLC7A7, CEBPA and CRBN-TRNT1) associated with basophil count. The minor allele of the CEBPE variant associated with lower basophil count has been previously associated with Amerindian ancestry and higher risk of acute lymphoblastic leukemia in Hispanics. Together, these data suggest that germline genetic variation affecting transcriptional and signaling pathways that underlie WBC development and lineage specification can contribute to inter-individual as well as ethnic differences in peripheral blood cell counts (normal hematopoiesis) in addition to susceptibility to leukemia (malignant hematopoiesis).


Assuntos
Proteínas Estimuladoras de Ligação a CCAAT/genética , Estudo de Associação Genômica Ampla , Contagem de Leucócitos , Tirosina Quinase 3 Semelhante a fms/genética , Negro ou Afro-Americano/genética , Basófilos/citologia , Feminino , Frequência do Gene , Hispânico ou Latino/genética , Humanos , Linfócitos/citologia , Masculino , Monócitos/citologia , Neutrófilos/citologia , Estados Unidos/epidemiologia , População Branca/genética
7.
Am J Hum Genet ; 98(1): 165-84, 2016 Jan 07.
Artigo em Inglês | MEDLINE | ID: mdl-26748518

RESUMO

US Hispanic/Latino individuals are diverse in genetic ancestry, culture, and environmental exposures. Here, we characterized and controlled for this diversity in genome-wide association studies (GWASs) for the Hispanic Community Health Study/Study of Latinos (HCHS/SOL). We simultaneously estimated population-structure principal components (PCs) robust to familial relatedness and pairwise kinship coefficients (KCs) robust to population structure, admixture, and Hardy-Weinberg departures. The PCs revealed substantial genetic differentiation within and among six self-identified background groups (Cuban, Dominican, Puerto Rican, Mexican, and Central and South American). To control for variation among groups, we developed a multi-dimensional clustering method to define a "genetic-analysis group" variable that retains many properties of self-identified background while achieving substantially greater genetic homogeneity within groups and including participants with non-specific self-identification. In GWASs of 22 biomedical traits, we used a linear mixed model (LMM) including pairwise empirical KCs to account for familial relatedness, PCs for ancestry, and genetic-analysis groups for additional group-associated effects. Including the genetic-analysis group as a covariate accounted for significant trait variation in 8 of 22 traits, even after we fit 20 PCs. Additionally, genetic-analysis groups had significant heterogeneity of residual variance for 20 of 22 traits, and modeling this heteroscedasticity within the LMM reduced genomic inflation for 19 traits. Furthermore, fitting an LMM that utilized a genetic-analysis group rather than a self-identified background group achieved higher power to detect previously reported associations. We expect that the methods applied here will be useful in other studies with multiple ethnic groups, admixture, and relatedness.


Assuntos
Variação Genética , Hispânico ou Latino/genética , Estudo de Associação Genômica Ampla , Humanos , Estados Unidos
8.
Am J Hum Genet ; 98(2): 229-42, 2016 Feb 04.
Artigo em Inglês | MEDLINE | ID: mdl-26805783

RESUMO

Platelets play an essential role in hemostasis and thrombosis. We performed a genome-wide association study of platelet count in 12,491 participants of the Hispanic Community Health Study/Study of Latinos by using a mixed-model method that accounts for admixture and family relationships. We discovered and replicated associations with five genes (ACTN1, ETV7, GABBR1-MOG, MEF2C, and ZBTB9-BAK1). Our strongest association was with Amerindian-specific variant rs117672662 (p value = 1.16 × 10(-28)) in ACTN1, a gene implicated in congenital macrothrombocytopenia. rs117672662 exhibited allelic differences in transcriptional activity and protein binding in hematopoietic cells. Our results underscore the value of diverse populations to extend insights into the allelic architecture of complex traits.


Assuntos
Estudos de Associação Genética/métodos , Loci Gênicos , Hispânico ou Latino/genética , Contagem de Plaquetas , Actinina/genética , Adolescente , Adulto , Idoso , Alelos , Frequência do Gene , Genótipo , Técnicas de Genotipagem , Humanos , Fatores de Transcrição MEF2/genética , Proteínas de Membrana/genética , Pessoa de Meia-Idade , Fenótipo , Polimorfismo de Nucleotídeo Único , Receptores de GABA-B/genética , Adulto Jovem
9.
Am J Hum Genet ; 98(4): 744-54, 2016 Apr 07.
Artigo em Inglês | MEDLINE | ID: mdl-27018472

RESUMO

Cleft palate (CP) is a common birth defect occurring in 1 in 2,500 live births. Approximately half of infants with CP have a syndromic form, exhibiting other physical and cognitive disabilities. The other half have nonsyndromic CP, and to date, few genes associated with risk for nonsyndromic CP have been characterized. To identify such risk factors, we performed a genome-wide association study of this disorder. We discovered a genome-wide significant association with a missense variant in GRHL3 (p.Thr454Met [c.1361C>T]; rs41268753; p = 4.08 × 10(-9)) and replicated the result in an independent sample of case and control subjects. In both the discovery and replication samples, rs41268753 conferred increased risk for CP (OR = 8.3, 95% CI 4.1-16.8; OR = 2.16, 95% CI 1.43-3.27, respectively). In luciferase transactivation assays, p.Thr454Met had about one-third of the activity of wild-type GRHL3, and in zebrafish embryos, perturbed periderm development. We conclude that this mutation is an etiologic variant for nonsyndromic CP and is one of few functional variants identified to date for nonsyndromic orofacial clefting. This finding advances our understanding of the genetic basis of craniofacial development and might ultimately lead to improvements in recurrence risk prediction, treatment, and prognosis.


Assuntos
Fissura Palatina/genética , Proteínas de Ligação a DNA/genética , Polimorfismo de Nucleotídeo Único , Fatores de Transcrição/genética , Animais , Estudos de Casos e Controles , Fissura Palatina/diagnóstico , Modelos Animais de Doenças , Etnicidade/genética , Loci Gênicos , Estudo de Associação Genômica Ampla , Técnicas de Genotipagem , Humanos , Mutação de Sentido Incorreto , Fatores de Risco , Peixe-Zebra/embriologia , Peixe-Zebra/genética
10.
PLoS Genet ; 12(8): e1006149, 2016 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-27560520

RESUMO

Numerous lines of evidence point to a genetic basis for facial morphology in humans, yet little is known about how specific genetic variants relate to the phenotypic expression of many common facial features. We conducted genome-wide association meta-analyses of 20 quantitative facial measurements derived from the 3D surface images of 3118 healthy individuals of European ancestry belonging to two US cohorts. Analyses were performed on just under one million genotyped SNPs (Illumina OmniExpress+Exome v1.2 array) imputed to the 1000 Genomes reference panel (Phase 3). We observed genome-wide significant associations (p < 5 x 10-8) for cranial base width at 14q21.1 and 20q12, intercanthal width at 1p13.3 and Xq13.2, nasal width at 20p11.22, nasal ala length at 14q11.2, and upper facial depth at 11q22.1. Several genes in the associated regions are known to play roles in craniofacial development or in syndromes affecting the face: MAFB, PAX9, MIPOL1, ALX3, HDAC8, and PAX1. We also tested genotype-phenotype associations reported in two previous genome-wide studies and found evidence of replication for nasal ala length and SNPs in CACNA2D3 and PRDM16. These results provide further evidence that common variants in regions harboring genes of known craniofacial function contribute to normal variation in human facial features. Improved understanding of the genes associated with facial morphology in healthy individuals can provide insights into the pathways and mechanisms controlling normal and abnormal facial morphogenesis.


Assuntos
Face/anatomia & histologia , Estudos de Associação Genética , Estudo de Associação Genômica Ampla , Desenvolvimento Maxilofacial/genética , Variação Genética , Genótipo , Humanos , Fenótipo , Polimorfismo de Nucleotídeo Único , Fatores de Transcrição/genética , População Branca
11.
Hum Mol Genet ; 25(4): 807-16, 2016 Feb 15.
Artigo em Inglês | MEDLINE | ID: mdl-26662797

RESUMO

Dental caries is the most common chronic disease worldwide, and exhibits profound disparities in the USA with racial and ethnic minorities experiencing disproportionate disease burden. Though heritable, the specific genes influencing risk of dental caries remain largely unknown. Therefore, we performed genome-wide association scans (GWASs) for dental caries in a population-based cohort of 12 000 Hispanic/Latino participants aged 18-74 years from the HCHS/SOL. Intra-oral examinations were used to generate two common indices of dental caries experience which were tested for association with 27.7 M genotyped or imputed single-nucleotide polymorphisms separately in the six ancestry groups. A mixed-models approach was used, which adjusted for age, sex, recruitment site, five principal components of ancestry and additional features of the sampling design. Meta-analyses were used to combine GWAS results across ancestry groups. Heritability estimates ranged from 20-53% in the six ancestry groups. The most significant association observed via meta-analysis for both phenotypes was in the region of the NAMPT gene (rs190395159; P-value = 6 × 10(-10)), which is involved in many biological processes including periodontal healing. Another significant association was observed for rs72626594 (P-value = 3 × 10(-8)) downstream of BMP7, a tooth development gene. Other associations were observed in genes lacking known or plausible roles in dental caries. In conclusion, this was the largest GWAS of dental caries, to date and was the first to target Hispanic/Latino populations. Understanding the factors influencing dental caries susceptibility may lead to improvements in prediction, prevention and disease management, which may ultimately reduce the disparities in oral health across racial, ethnic and socioeconomic strata.


Assuntos
Cárie Dentária/etnologia , Cárie Dentária/genética , Hispânico ou Latino/genética , Adulto , Idoso , Centros Comunitários de Saúde , Feminino , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Masculino , Pessoa de Meia-Idade , Polimorfismo de Nucleotídeo Único
12.
Hum Mol Genet ; 25(13): 2862-2872, 2016 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-27033726

RESUMO

Orofacial clefts (OFCs), which include non-syndromic cleft lip with or without cleft palate (CL/P), are among the most common birth defects in humans, affecting approximately 1 in 700 newborns. CL/P is phenotypically heterogeneous and has a complex etiology caused by genetic and environmental factors. Previous genome-wide association studies (GWASs) have identified at least 15 risk loci for CL/P. As these loci do not account for all of the genetic variance of CL/P, we hypothesized the existence of additional risk loci. We conducted a multiethnic GWAS in 6480 participants (823 unrelated cases, 1700 unrelated controls and 1319 case-parent trios) with European, Asian, African and Central and South American ancestry. Our GWAS revealed novel associations on 2p24 near FAM49A, a gene of unknown function (P = 4.22 × 10-8), and 19q13 near RHPN2, a gene involved in organizing the actin cytoskeleton (P = 4.17 × 10-8). Other regions reaching genome-wide significance were 1p36 (PAX7), 1p22 (ARHGAP29), 1q32 (IRF6), 8q24 and 17p13 (NTN1), all reported in previous GWASs. Stratification by ancestry group revealed a novel association with a region on 17q23 (P = 2.92 × 10-8) among individuals with European ancestry. This region included several promising candidates including TANC2, an oncogene required for development, and DCAF7, a scaffolding protein required for craniofacial development. In the Central and South American ancestry group, significant associations with loci previously identified in Asian or European ancestry groups reflected their admixed ancestry. In summary, we have identified novel CL/P risk loci and suggest new genes involved in craniofacial development, confirming the highly heterogeneous etiology of OFCs.


Assuntos
Fenda Labial/genética , Fissura Palatina/genética , Povo Asiático/genética , População Negra/genética , Cromossomos Humanos Par 17/genética , Cromossomos Humanos Par 19/genética , Cromossomos Humanos Par 2/genética , Etnicidade , Feminino , Loci Gênicos , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Masculino , Polimorfismo de Nucleotídeo Único/genética , Fatores de Risco , População Branca/genética
13.
Gastroenterology ; 144(4): 799-807.e24, 2013 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-23266556

RESUMO

BACKGROUND & AIMS: Heritable factors contribute to the development of colorectal cancer. Identifying the genetic loci associated with colorectal tumor formation could elucidate the mechanisms of pathogenesis. METHODS: We conducted a genome-wide association study that included 14 studies, 12,696 cases of colorectal tumors (11,870 cancer, 826 adenoma), and 15,113 controls of European descent. The 10 most statistically significant, previously unreported findings were followed up in 6 studies; these included 3056 colorectal tumor cases (2098 cancer, 958 adenoma) and 6658 controls of European and Asian descent. RESULTS: Based on the combined analysis, we identified a locus that reached the conventional genome-wide significance level at less than 5.0 × 10(-8): an intergenic region on chromosome 2q32.3, close to nucleic acid binding protein 1 (most significant single nucleotide polymorphism: rs11903757; odds ratio [OR], 1.15 per risk allele; P = 3.7 × 10(-8)). We also found evidence for 3 additional loci with P values less than 5.0 × 10(-7): a locus within the laminin gamma 1 gene on chromosome 1q25.3 (rs10911251; OR, 1.10 per risk allele; P = 9.5 × 10(-8)), a locus within the cyclin D2 gene on chromosome 12p13.32 (rs3217810 per risk allele; OR, 0.84; P = 5.9 × 10(-8)), and a locus in the T-box 3 gene on chromosome 12q24.21 (rs59336; OR, 0.91 per risk allele; P = 3.7 × 10(-7)). CONCLUSIONS: In a large genome-wide association study, we associated polymorphisms close to nucleic acid binding protein 1 (which encodes a DNA-binding protein involved in DNA repair) with colorectal tumor risk. We also provided evidence for an association between colorectal tumor risk and polymorphisms in laminin gamma 1 (this is the second gene in the laminin family to be associated with colorectal cancers), cyclin D2 (which encodes for cyclin D2), and T-box 3 (which encodes a T-box transcription factor and is a target of Wnt signaling to ß-catenin). The roles of these genes and their products in cancer pathogenesis warrant further investigation.


Assuntos
Neoplasias Colorretais/genética , Ciclina D2/genética , Loci Gênicos/genética , Predisposição Genética para Doença/epidemiologia , Distribuição por Idade , Idoso , Idoso de 80 Anos ou mais , Neoplasias Colorretais/epidemiologia , Proteínas de Ligação a DNA/genética , Feminino , Estudo de Associação Genômica Ampla , Humanos , Incidência , Laminina/genética , Masculino , Pessoa de Meia-Idade , Polimorfismo de Nucleotídeo Único , Prognóstico , Medição de Risco , Distribuição por Sexo , Proteínas com Domínio T/genética
14.
Bioinformatics ; 28(24): 3329-31, 2012 Dec 15.
Artigo em Inglês | MEDLINE | ID: mdl-23052040

RESUMO

GWASTools is an R/Bioconductor package for quality control and analysis of genome-wide association studies (GWAS). GWASTools brings the interactive capability and extensive statistical libraries of R to GWAS. Data are stored in NetCDF format to accommodate extremely large datasets that cannot fit within R's memory limits. The documentation includes instructions for converting data from multiple formats, including variants called from sequencing. GWASTools provides a convenient interface for linking genotypes and intensity data with sample and single nucleotide polymorphism annotation.


Assuntos
Estudo de Associação Genômica Ampla/normas , Polimorfismo de Nucleotídeo Único , Software , Genótipo , Humanos , Controle de Qualidade
15.
Circ Genom Precis Med ; 16(2): e003532, 2023 04.
Artigo em Inglês | MEDLINE | ID: mdl-36960714

RESUMO

BACKGROUND: Risk for venous thromboembolism has a strong genetic component. Whole genome sequencing from the TOPMed program (Trans-Omics for Precision Medicine) allowed us to look for new associations, particularly rare variants missed by standard genome-wide association studies. METHODS: The 3793 cases and 7834 controls (11.6% of cases were individuals of African, Hispanic/Latino, or Asian ancestry) were analyzed using a single variant approach and an aggregate gene-based approach using our primary filter (included only loss-of-function and missense variants predicted to be deleterious) and our secondary filter (included all missense variants). RESULTS: Single variant analyses identified associations at 5 known loci. Aggregate gene-based analyses identified only PROC (odds ratio, 6.2 for carriers of rare variants; P=7.4×10-14) when using our primary filter. Employing our secondary variant filter led to a smaller effect size at PROC (odds ratio, 3.8; P=1.6×10-14), while excluding variants found only in rare isoforms led to a larger one (odds ratio, 7.5). Different filtering strategies improved the signal for 2 other known genes: PROS1 became significant (minimum P=1.8×10-6 with the secondary filter), while SERPINC1 did not (minimum P=4.4×10-5 with minor allele frequency <0.0005). Results were largely the same when restricting the analyses to include only unprovoked cases; however, one novel gene, MS4A1, became significant (P=4.4×10-7 using all missense variants with minor allele frequency <0.0005). CONCLUSIONS: Here, we have demonstrated the importance of using multiple variant filtering strategies, as we detected additional genes when filtering variants based on their predicted deleteriousness, frequency, and presence on the most expressed isoforms. Our primary analyses did not identify new candidate loci; thus larger follow-up studies are needed to replicate the novel MS4A1 locus and to identify additional rare variation associated with venous thromboembolism.


Assuntos
Estudo de Associação Genômica Ampla , Tromboembolia Venosa , Humanos , Tromboembolia Venosa/genética , Medicina de Precisão , Predisposição Genética para Doença , Frequência do Gene
16.
Sci Adv ; 9(17): eabm4945, 2023 04 28.
Artigo em Inglês | MEDLINE | ID: mdl-37126548

RESUMO

Nononcogenic somatic mutations are thought to be uncommon and inconsequential. To test this, we analyzed 43,693 National Heart, Lung and Blood Institute Trans-Omics for Precision Medicine blood whole genomes from 37 cohorts and identified 7131 non-missense somatic mutations that are recurrently mutated in at least 50 individuals. These recurrent non-missense somatic mutations (RNMSMs) are not clearly explained by other clonal phenomena such as clonal hematopoiesis. RNMSM prevalence increased with age, with an average 50-year-old having 27 RNMSMs. Inherited germline variation associated with RNMSM acquisition. These variants were found in genes involved in adaptive immune function, proinflammatory cytokine production, and lymphoid lineage commitment. In addition, the presence of eight specific RNMSMs associated with blood cell traits at effect sizes comparable to Mendelian genetic mutations. Overall, we found that somatic mutations in blood are an unexpectedly common phenomenon with ancestry-specific determinants and human health consequences.


Assuntos
Mutação em Linhagem Germinativa , Hematopoese , Humanos , Pessoa de Meia-Idade , Mutação , Mutação de Sentido Incorreto , Fenótipo
17.
Cell Genom ; 2(1)2022 Jan 12.
Artigo em Inglês | MEDLINE | ID: mdl-35530816

RESUMO

Genetic studies on telomere length are important for understanding age-related diseases. Prior GWAS for leukocyte TL have been limited to European and Asian populations. Here, we report the first sequencing-based association study for TL across ancestrally-diverse individuals (European, African, Asian and Hispanic/Latino) from the NHLBI Trans-Omics for Precision Medicine (TOPMed) program. We used whole genome sequencing (WGS) of whole blood for variant genotype calling and the bioinformatic estimation of telomere length in n=109,122 individuals. We identified 59 sentinel variants (p-value <5×10-9) in 36 loci associated with telomere length, including 20 newly associated loci (13 were replicated in external datasets). There was little evidence of effect size heterogeneity across populations. Fine-mapping at OBFC1 indicated the independent signals colocalized with cell-type specific eQTLs for OBFC1 (STN1). Using a multi-variant gene-based approach, we identified two genes newly implicated in telomere length, DCLRE1B (SNM1B) and PARN. In PheWAS, we demonstrated our TL polygenic trait scores (PTS) were associated with increased risk of cancer-related phenotypes.

18.
Nat Genet ; 54(3): 263-273, 2022 03.
Artigo em Inglês | MEDLINE | ID: mdl-35256806

RESUMO

Analyses of data from genome-wide association studies on unrelated individuals have shown that, for human traits and diseases, approximately one-third to two-thirds of heritability is captured by common SNPs. However, it is not known whether the remaining heritability is due to the imperfect tagging of causal variants by common SNPs, in particular whether the causal variants are rare, or whether it is overestimated due to bias in inference from pedigree data. Here we estimated heritability for height and body mass index (BMI) from whole-genome sequence data on 25,465 unrelated individuals of European ancestry. The estimated heritability was 0.68 (standard error 0.10) for height and 0.30 (standard error 0.10) for body mass index. Low minor allele frequency variants in low linkage disequilibrium (LD) with neighboring variants were enriched for heritability, to a greater extent for protein-altering variants, consistent with negative selection. Our results imply that rare variants, in particular those in regions of low linkage disequilibrium, are a major source of the still missing heritability of complex traits and disease.


Assuntos
Estudo de Associação Genômica Ampla , Herança Multifatorial , Alelos , Estudo de Associação Genômica Ampla/métodos , Humanos , Desequilíbrio de Ligação , Herança Multifatorial/genética , Polimorfismo de Nucleotídeo Único/genética
19.
Sci Adv ; 8(14): eabl6579, 2022 Apr 08.
Artigo em Inglês | MEDLINE | ID: mdl-35385311

RESUMO

Human genetic studies support an inverse causal relationship between leukocyte telomere length (LTL) and coronary artery disease (CAD), but directionally mixed effects for LTL and diverse malignancies. Clonal hematopoiesis of indeterminate potential (CHIP), characterized by expansion of hematopoietic cells bearing leukemogenic mutations, predisposes both hematologic malignancy and CAD. TERT (which encodes telomerase reverse transcriptase) is the most significantly associated germline locus for CHIP in genome-wide association studies. Here, we investigated the relationship between CHIP, LTL, and CAD in the Trans-Omics for Precision Medicine (TOPMed) program (n = 63,302) and UK Biobank (n = 47,080). Bidirectional Mendelian randomization studies were consistent with longer genetically imputed LTL increasing propensity to develop CHIP, but CHIP then, in turn, hastens to shorten measured LTL (mLTL). We also demonstrated evidence of modest mediation between CHIP and CAD by mLTL. Our data promote an understanding of potential causal relationships across CHIP and LTL toward prevention of CAD.

20.
Nat Commun ; 12(1): 3506, 2021 06 09.
Artigo em Inglês | MEDLINE | ID: mdl-34108454

RESUMO

In modern Whole Genome Sequencing (WGS) epidemiological studies, participant-level data from multiple studies are often pooled and results are obtained from a single analysis. We consider the impact of differential phenotype variances by study, which we term 'variance stratification'. Unaccounted for, variance stratification can lead to both decreased statistical power, and increased false positives rates, depending on how allele frequencies, sample sizes, and phenotypic variances vary across the studies that are pooled. We develop a procedure to compute variant-specific inflation factors, and show how it can be used for diagnosis of genetic association analyses on pooled individual level data from multiple studies. We describe a WGS-appropriate analysis approach, implemented in freely-available software, which allows study-specific variances and thereby improves performance in practice. We illustrate the variance stratification problem, its solutions, and the proposed diagnostic procedure, in simulations and in data from the Trans-Omics for Precision Medicine Whole Genome Sequencing Program (TOPMed), used in association tests for hemoglobin concentrations and BMI.


Assuntos
Variação Genética , Estudo de Associação Genômica Ampla/métodos , Algoritmos , Simulação por Computador , Frequência do Gene , Estudo de Associação Genômica Ampla/normas , Estudo de Associação Genômica Ampla/estatística & dados numéricos , Humanos , Fenótipo , Tamanho da Amostra
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA