Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Resultados 1 - 20 de 125
Filtrar
1.
Cell ; 179(4): 984-1002.e36, 2019 10 31.
Artículo en Inglés | MEDLINE | ID: mdl-31675503

RESUMEN

Genomic studies in African populations provide unique opportunities to understand disease etiology, human diversity, and population history. In the largest study of its kind, comprising genome-wide data from 6,400 individuals and whole-genome sequences from 1,978 individuals from rural Uganda, we find evidence of geographically correlated fine-scale population substructure. Historically, the ancestry of modern Ugandans was best represented by a mixture of ancient East African pastoralists. We demonstrate the value of the largest sequence panel from Africa to date as an imputation resource. Examining 34 cardiometabolic traits, we show systematic differences in trait heritability between European and African populations, probably reflecting the differential impact of genes and environment. In a multi-trait pan-African GWAS of up to 14,126 individuals, we identify novel loci associated with anthropometric, hematological, lipid, and glycemic traits. We find that several functionally important signals are driven by Africa-specific variants, highlighting the value of studying diverse populations across the region.


Asunto(s)
Población Negra/genética , Predisposición Genética a la Enfermedad , Genoma Humano/genética , Genómica , Femenino , Frecuencia de los Genes/genética , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Polimorfismo de Nucleótido Simple/genética , Uganda/epidemiología , Secuenciación Completa del Genoma
2.
Nature ; 590(7845): 290-299, 2021 02.
Artículo en Inglés | MEDLINE | ID: mdl-33568819

RESUMEN

The Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes)1. In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.


Asunto(s)
Variación Genética/genética , Genoma Humano/genética , Genómica , National Heart, Lung, and Blood Institute (U.S.) , Medicina de Precisión , Citocromo P-450 CYP2D6/genética , Haplotipos/genética , Heterocigoto , Humanos , Mutación INDEL , Mutación con Pérdida de Función , Mutagénesis , Fenotipo , Polimorfismo de Nucleótido Simple , Densidad de Población , Medicina de Precisión/normas , Control de Calidad , Tamaño de la Muestra , Estados Unidos , Secuenciación Completa del Genoma/normas
3.
Hum Mol Genet ; 2024 May 15.
Artículo en Inglés | MEDLINE | ID: mdl-38747556

RESUMEN

Inflammation biomarkers can provide valuable insight into the role of inflammatory processes in many diseases and conditions. Sequencing based analyses of such biomarkers can also serve as an exemplar of the genetic architecture of quantitative traits. To evaluate the biological insight, which can be provided by a multi-ancestry, whole-genome based association study, we performed a comprehensive analysis of 21 inflammation biomarkers from up to 38 465 individuals with whole-genome sequencing from the Trans-Omics for Precision Medicine (TOPMed) program (with varying sample size by trait, where the minimum sample size was n = 737 for MMP-1). We identified 22 distinct single-variant associations across 6 traits-E-selectin, intercellular adhesion molecule 1, interleukin-6, lipoprotein-associated phospholipase A2 activity and mass, and P-selectin-that remained significant after conditioning on previously identified associations for these inflammatory biomarkers. We further expanded upon known biomarker associations by pairing the single-variant analysis with a rare variant set-based analysis that further identified 19 significant rare variant set-based associations with 5 traits. These signals were distinct from both significant single variant association signals within TOPMed and genetic signals observed in prior studies, demonstrating the complementary value of performing both single and rare variant analyses when analyzing quantitative traits. We also confirm several previously reported signals from semi-quantitative proteomics platforms. Many of these signals demonstrate the extensive allelic heterogeneity and ancestry-differentiated variant-trait associations common for inflammation biomarkers, a characteristic we hypothesize will be increasingly observed with well-powered, large-scale analyses of complex traits.

4.
Am J Hum Genet ; 109(6): 1175-1181, 2022 06 02.
Artículo en Inglés | MEDLINE | ID: mdl-35504290

RESUMEN

Current publicly available tools that allow rapid exploration of linkage disequilibrium (LD) between markers (e.g., HaploReg and LDlink) are based on whole-genome sequence (WGS) data from 2,504 individuals in the 1000 Genomes Project. Here, we present TOP-LD, an online tool to explore LD inferred with high-coverage (∼30×) WGS data from 15,578 individuals in the NHLBI Trans-Omics for Precision Medicine (TOPMed) program. TOP-LD provides a significant upgrade compared to current LD tools, as the TOPMed WGS data provide a more comprehensive representation of genetic variation than the 1000 Genomes data, particularly for rare variants and in the specific populations that we analyzed. For example, TOP-LD encompasses LD information for 150.3, 62.2, and 36.7 million variants for European, African, and East Asian ancestral samples, respectively, offering 2.6- to 9.1-fold increase in variant coverage compared to HaploReg 4.0 or LDlink. In addition, TOP-LD includes tens of thousands of structural variants (SVs). We demonstrate the value of TOP-LD in fine-mapping at the GGT1 locus associated with gamma glutamyltransferase in the African ancestry participants in UK Biobank. Beyond fine-mapping, TOP-LD can facilitate a wide range of applications that are based on summary statistics and estimates of LD. TOP-LD is freely available online.


Asunto(s)
Estudio de Asociación del Genoma Completo , Medicina de Precisión , Pueblo Asiatico , Humanos , Desequilibrio de Ligamiento/genética , Polimorfismo de Nucleótido Simple/genética , Secuenciación Completa del Genoma
5.
Nat Methods ; 19(12): 1599-1611, 2022 12.
Artículo en Inglés | MEDLINE | ID: mdl-36303018

RESUMEN

Large-scale whole-genome sequencing studies have enabled analysis of noncoding rare-variant (RV) associations with complex human diseases and traits. Variant-set analysis is a powerful approach to study RV association. However, existing methods have limited ability in analyzing the noncoding genome. We propose a computationally efficient and robust noncoding RV association detection framework, STAARpipeline, to automatically annotate a whole-genome sequencing study and perform flexible noncoding RV association analysis, including gene-centric analysis and fixed window-based and dynamic window-based non-gene-centric analysis by incorporating variant functional annotations. In gene-centric analysis, STAARpipeline uses STAAR to group noncoding variants based on functional categories of genes and incorporate multiple functional annotations. In non-gene-centric analysis, STAARpipeline uses SCANG-STAAR to incorporate dynamic window sizes and multiple functional annotations. We apply STAARpipeline to identify noncoding RV sets associated with four lipid traits in 21,015 discovery samples from the Trans-Omics for Precision Medicine (TOPMed) program and replicate several of them in an additional 9,123 TOPMed samples. We also analyze five non-lipid TOPMed traits.


Asunto(s)
Estudio de Asociación del Genoma Completo , Genoma , Humanos , Estudio de Asociación del Genoma Completo/métodos , Secuenciación Completa del Genoma/métodos , Fenotipo , Variación Genética
6.
Hum Mol Genet ; 31(18): 3120-3132, 2022 09 10.
Artículo en Inglés | MEDLINE | ID: mdl-35552711

RESUMEN

Plasma levels of fibrinogen, coagulation factors VII and VIII and von Willebrand factor (vWF) are four intermediate phenotypes that are heritable and have been associated with the risk of clinical thrombotic events. To identify rare and low-frequency variants associated with these hemostatic factors, we conducted whole-exome sequencing in 10 860 individuals of European ancestry (EA) and 3529 African Americans (AAs) from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium and the National Heart, Lung and Blood Institute's Exome Sequencing Project. Gene-based tests demonstrated significant associations with rare variation (minor allele frequency < 5%) in fibrinogen gamma chain (FGG) (with fibrinogen, P = 9.1 × 10-13), coagulation factor VII (F7) (with factor VII, P = 1.3 × 10-72; seven novel variants) and VWF (with factor VIII and vWF; P = 3.2 × 10-14; one novel variant). These eight novel rare variant associations were independent of the known common variants at these loci and tended to have much larger effect sizes. In addition, one of the rare novel variants in F7 was significantly associated with an increased risk of venous thromboembolism in AAs (Ile200Ser; rs141219108; P = 4.2 × 10-5). After restricting gene-based analyses to only loss-of-function variants, a novel significant association was detected and replicated between factor VIII levels and a stop-gain mutation exclusive to AAs (rs3211938) in CD36 molecule (CD36). This variant has previously been linked to dyslipidemia but not with the levels of a hemostatic factor. These efforts represent the largest integration of whole-exome sequence data from two national projects to identify genetic variation associated with plasma hemostatic factors.


Asunto(s)
Factor VIII , Hemostáticos , Factor VII/genética , Factor VIII/genética , Fibrinógeno/genética , Humanos , Polimorfismo de Nucleótido Simple/genética , Secuenciación del Exoma , Factor de von Willebrand/análisis , Factor de von Willebrand/genética
8.
BMC Genomics ; 24(1): 303, 2023 Jun 06.
Artículo en Inglés | MEDLINE | ID: mdl-37277705

RESUMEN

BACKGROUND: Analysis of imputed genotypes is an important and routine component of genome-wide association studies and the increasing size of imputation reference panels has facilitated the ability to impute and test low-frequency variants for associations. In the context of genotype imputation, the true genotype is unknown and genotypes are inferred with uncertainty using statistical models. Here, we present a novel method for integrating imputation uncertainty into statistical association tests using a fully conditional multiple imputation (MI) approach which is implemented using the Substantive Model Compatible Fully Conditional Specification (SMCFCS). We compared the performance of this method to an unconditional MI and two additional approaches that have been shown to demonstrate excellent performance: regression with dosages and a mixture of regression models (MRM). RESULTS: Our simulations considered a range of allele frequencies and imputation qualities based on data from the UK Biobank. We found that the unconditional MI was computationally costly and overly conservative across a wide range of settings. Analyzing data with Dosage, MRM, or MI SMCFCS resulted in greater power, including for low frequency variants, compared to unconditional MI while effectively controlling type I error rates. MRM andl MI SMCFCS are both more computationally intensive then using Dosage. CONCLUSIONS: The unconditional MI approach for association testing is overly conservative and we do not recommend its use in the context of imputed genotypes. Given its performance, speed, and ease of implementation, we recommend using Dosage for imputed genotypes with MAF [Formula: see text] 0.001 and Rsq [Formula: see text] 0.3.


Asunto(s)
Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Estudio de Asociación del Genoma Completo/métodos , Genotipo , Frecuencia de los Genes , Modelos Estadísticos
9.
Breast Cancer Res ; 25(1): 93, 2023 08 09.
Artículo en Inglés | MEDLINE | ID: mdl-37559094

RESUMEN

BACKGROUND: Genome-wide studies of gene-environment interactions (G×E) may identify variants associated with disease risk in conjunction with lifestyle/environmental exposures. We conducted a genome-wide G×E analysis of ~ 7.6 million common variants and seven lifestyle/environmental risk factors for breast cancer risk overall and for estrogen receptor positive (ER +) breast cancer. METHODS: Analyses were conducted using 72,285 breast cancer cases and 80,354 controls of European ancestry from the Breast Cancer Association Consortium. Gene-environment interactions were evaluated using standard unconditional logistic regression models and likelihood ratio tests for breast cancer risk overall and for ER + breast cancer. Bayesian False Discovery Probability was employed to assess the noteworthiness of each SNP-risk factor pairs. RESULTS: Assuming a 1 × 10-5 prior probability of a true association for each SNP-risk factor pairs and a Bayesian False Discovery Probability < 15%, we identified two independent SNP-risk factor pairs: rs80018847(9p13)-LINGO2 and adult height in association with overall breast cancer risk (ORint = 0.94, 95% CI 0.92-0.96), and rs4770552(13q12)-SPATA13 and age at menarche for ER + breast cancer risk (ORint = 0.91, 95% CI 0.88-0.94). CONCLUSIONS: Overall, the contribution of G×E interactions to the heritability of breast cancer is very small. At the population level, multiplicative G×E interactions do not make an important contribution to risk prediction in breast cancer.


Asunto(s)
Neoplasias de la Mama , Interacción Gen-Ambiente , Adulto , Femenino , Humanos , Predisposición Genética a la Enfermedad , Neoplasias de la Mama/etiología , Neoplasias de la Mama/genética , Teorema de Bayes , Estudio de Asociación del Genoma Completo , Factores de Riesgo , Polimorfismo de Nucleótido Simple , Estudios de Casos y Controles
10.
Am J Hum Genet ; 106(1): 112-120, 2020 01 02.
Artículo en Inglés | MEDLINE | ID: mdl-31883642

RESUMEN

Whole-genome sequencing (WGS) can improve assessment of low-frequency and rare variants, particularly in non-European populations that have been underrepresented in existing genomic studies. The genetic determinants of C-reactive protein (CRP), a biomarker of chronic inflammation, have been extensively studied, with existing genome-wide association studies (GWASs) conducted in >200,000 individuals of European ancestry. In order to discover novel loci associated with CRP levels, we examined a multi-ancestry population (n = 23,279) with WGS (∼38× coverage) from the Trans-Omics for Precision Medicine (TOPMed) program. We found evidence for eight distinct associations at the CRP locus, including two variants that have not been identified previously (rs11265259 and rs181704186), both of which are non-coding and more common in individuals of African ancestry (∼10% and ∼1% minor allele frequency, respectively, and rare or monomorphic in 1000 Genomes populations of East Asian, South Asian, and European ancestry). We show that the minor (G) allele of rs181704186 is associated with lower CRP levels and decreased transcriptional activity and protein binding in vitro, providing a plausible molecular mechanism for this African ancestry-specific signal. The individuals homozygous for rs181704186-G have a mean CRP level of 0.23 mg/L, in contrast to individuals heterozygous for rs181704186 with mean CRP of 2.97 mg/L and major allele homozygotes with mean CRP of 4.11 mg/L. This study demonstrates the utility of WGS in multi-ethnic populations to drive discovery of complex trait associations of large effect and to identify functional alleles in noncoding regulatory regions.


Asunto(s)
Pueblo Asiatico/genética , Población Negra/genética , Proteína C-Reactiva/genética , Predisposición Genética a la Enfermedad , Polimorfismo de Nucleótido Simple , Población Blanca/genética , Secuenciación Completa del Genoma/métodos , Estudios de Cohortes , Frecuencia de los Genes , Estudio de Asociación del Genoma Completo , Humanos , Desequilibrio de Ligamiento
11.
Biostatistics ; 23(2): 362-379, 2022 04 13.
Artículo en Inglés | MEDLINE | ID: mdl-32766691

RESUMEN

Malignant progression of normal tissue is typically driven by complex networks of somatic changes, including genetic mutations, copy number aberrations, epigenetic changes, and transcriptional reprogramming. To delineate aberrant multi-omic tumor features that correlate with clinical outcomes, we present a novel pathway-centric tool based on the multiple factor analysis framework called padma. Using a multi-omic consensus representation, padma quantifies and characterizes individualized pathway-specific multi-omic deviations and their underlying drivers, with respect to the sampled population. We demonstrate the utility of padma to correlate patient outcomes with complex genetic, epigenetic, and transcriptomic perturbations in clinically actionable pathways in breast and lung cancer.


Asunto(s)
Neoplasias , Análisis Factorial , Humanos , Neoplasias/genética , Transcriptoma
12.
Stat Med ; 42(17): 2962-2981, 2023 07 30.
Artículo en Inglés | MEDLINE | ID: mdl-37345498

RESUMEN

In this study, the asymptotic distributions of the likelihood ratio test (LRT), the restricted likelihood ratio test (RLRT), the F and the sequence kernel association test (SKAT) statistics for testing an additive effect of the expected familial relatedness (FR) in a linear mixed model are examined based on an eigenvalue approach. First, the covariance structure for modeling the FR effect in a LMM is presented. Then, the multiplicity of eigenvalues for the log-likelihood and restricted log-likelihood is established under a replicate family setting and extended to a more general replicate family setting (GRFS) as well. After that, the asymptotic null distributions of LRT, RLRT, F and SKAT statistics under GRFS are derived. The asymptotic null distribution of SKAT for testing genetic rare variants is also constructed. In addition, a simple formula for sample size calculation is provided based on the restricted maximum likelihood estimate of the effect size for the expected FR. Finally, a power comparison of these test statistics on hypothesis test of the expected FR effect is made via simulation. The four test statistics are also applied to a data set from the UK Biobank.


Asunto(s)
Modelos Genéticos , Humanos , Funciones de Verosimilitud , Simulación por Computador , Modelos Lineales
13.
Nature ; 551(7678): 92-94, 2017 11 02.
Artículo en Inglés | MEDLINE | ID: mdl-29059683

RESUMEN

Breast cancer risk is influenced by rare coding variants in susceptibility genes, such as BRCA1, and many common, mostly non-coding variants. However, much of the genetic contribution to breast cancer risk remains unknown. Here we report the results of a genome-wide association study of breast cancer in 122,977 cases and 105,974 controls of European ancestry and 14,068 cases and 13,104 controls of East Asian ancestry. We identified 65 new loci that are associated with overall breast cancer risk at P < 5 × 10-8. The majority of credible risk single-nucleotide polymorphisms in these loci fall in distal regulatory elements, and by integrating in silico data to predict target genes in breast cells at each locus, we demonstrate a strong overlap between candidate target genes and somatic driver genes in breast tumours. We also find that heritability of breast cancer due to all single-nucleotide polymorphisms in regulatory features was 2-5-fold enriched relative to the genome-wide average, with strong enrichment for particular transcription factor binding sites. These results provide further insight into genetic susceptibility to breast cancer and will improve the use of genetic risk scores for individualized screening and prevention.


Asunto(s)
Neoplasias de la Mama/genética , Sitios Genéticos , Predisposición Genética a la Enfermedad/genética , Estudio de Asociación del Genoma Completo , Asia/etnología , Pueblo Asiatico/genética , Sitios de Unión/genética , Neoplasias de la Mama/diagnóstico , Simulación por Computador , Europa (Continente)/etnología , Femenino , Humanos , Herencia Multifactorial/genética , Polimorfismo de Nucleótido Simple/genética , Secuencias Reguladoras de Ácidos Nucleicos , Medición de Riesgo , Factores de Transcripción/metabolismo , Población Blanca/genética
14.
Nature ; 542(7640): 186-190, 2017 02 09.
Artículo en Inglés | MEDLINE | ID: mdl-28146470

RESUMEN

Height is a highly heritable, classic polygenic trait with approximately 700 common associated variants identified through genome-wide association studies so far. Here, we report 83 height-associated coding variants with lower minor-allele frequencies (in the range of 0.1-4.8%) and effects of up to 2 centimetres per allele (such as those in IHH, STC2, AR and CRISPLD2), greater than ten times the average effect of common variants. In functional follow-up studies, rare height-increasing alleles of STC2 (giving an increase of 1-2 centimetres per allele) compromised proteolytic inhibition of PAPP-A and increased cleavage of IGFBP-4 in vitro, resulting in higher bioavailability of insulin-like growth factors. These 83 height-associated variants overlap genes that are mutated in monogenic growth disorders and highlight new biological candidates (such as ADAMTS3, IL11RA and NOX4) and pathways (such as proteoglycan and glycosaminoglycan synthesis) involved in growth. Our results demonstrate that sufficiently large sample sizes can uncover rare and low-frequency variants of moderate-to-large effect associated with polygenic human phenotypes, and that these variants implicate relevant genes and pathways.


Asunto(s)
Estatura/genética , Frecuencia de los Genes/genética , Variación Genética/genética , Proteínas ADAMTS/genética , Adulto , Alelos , Moléculas de Adhesión Celular/genética , Femenino , Genoma Humano/genética , Glicoproteínas/genética , Glicoproteínas/metabolismo , Glicosaminoglicanos/biosíntesis , Proteínas Hedgehog/genética , Humanos , Péptidos y Proteínas de Señalización Intercelular/genética , Péptidos y Proteínas de Señalización Intercelular/metabolismo , Factores Reguladores del Interferón/genética , Subunidad alfa del Receptor de Interleucina-11/genética , Masculino , Herencia Multifactorial/genética , NADPH Oxidasa 4 , NADPH Oxidasas/genética , Fenotipo , Proteína Plasmática A Asociada al Embarazo/metabolismo , Procolágeno N-Endopeptidasa/genética , Proteoglicanos/biosíntesis , Proteolisis , Receptores Androgénicos/genética , Somatomedinas/metabolismo
15.
Genet Epidemiol ; 45(1): 16-23, 2021 02.
Artículo en Inglés | MEDLINE | ID: mdl-32918779

RESUMEN

Mendelian randomization (MR) is an established approach for assessing the causal effects of heritable exposures on outcomes. Outcomes of interest often include binary clinical endpoints, but may also include censored survival times. We explore the implications of both the Cox proportional hazard model and the additive hazard model in the context of MR, with a specific emphasis on two-stage methods. We show that naive application of standard MR approaches to censored survival times may induce significant bias. Through simulations and analysis of data from the Women's Health Initiative, we provide practical advice on modeling survival outcomes in MRs.


Asunto(s)
Análisis de la Aleatorización Mendeliana , Modelos Genéticos , Sesgo , Causalidad , Femenino , Humanos , Modelos de Riesgos Proporcionales
16.
Genet Epidemiol ; 45(3): 305-315, 2021 04.
Artículo en Inglés | MEDLINE | ID: mdl-33175443

RESUMEN

Familial relatedness (FR) and population structure (PS) are two major sources for genetic correlation. In the human population, both FR and PS can further break down into additive and dominant components to account for potential additive and dominant genetic effects. In this study, besides the classical additive genomic relationship matrix, a dominant genomic relationship matrix is introduced. A link between the additive/dominant genomic relationship matrices and the coancestry (or kinship)/double coancestry coefficients is also established. In addition, a way to separate the FR and PS correlations based on the estimates of coancestry and double coancestry coefficients from the genomic relationship matrices is proposed. A unified linear mixed model is also developed, which can account for both the additive and dominance effects of FR and PS correlations as well as their possible random interactions. Finally, this unified linear mixed model is applied to analyze two study cohorts from UK Biobank.


Asunto(s)
Genoma , Modelos Genéticos , Genes Dominantes , Estudios de Asociación Genética , Genómica , Humanos
17.
Stroke ; 53(3): 875-885, 2022 03.
Artículo en Inglés | MEDLINE | ID: mdl-34727735

RESUMEN

BACKGROUND AND PURPOSE: Stroke is the leading cause of death and long-term disability worldwide. Previous genome-wide association studies identified 51 loci associated with stroke (mostly ischemic) and its subtypes among predominantly European populations. Using whole-genome sequencing in ancestrally diverse populations from the Trans-Omics for Precision Medicine (TOPMed) Program, we aimed to identify novel variants, especially low-frequency or ancestry-specific variants, associated with all stroke, ischemic stroke and its subtypes (large artery, cardioembolic, and small vessel), and hemorrhagic stroke and its subtypes (intracerebral and subarachnoid). METHODS: Whole-genome sequencing data were available for 6833 stroke cases and 27 116 controls, including 22 315 European, 7877 Black, 2616 Hispanic/Latino, 850 Asian, 54 Native American, and 237 other ancestry participants. In TOPMed, we performed single variant association analysis examining 40 million common variants and aggregated association analysis focusing on rare variants. We also combined TOPMed European populations with over 28 000 additional European participants from the UK BioBank genome-wide array data through meta-analysis. RESULTS: In the single variant association analysis in TOPMed, we identified one novel locus 13q33 for large artery at whole-genome-wide significance (P<5.00×10-9) and 4 novel loci at genome-wide significance (P<5.00×10-8), all of which need confirmation in independent studies. Lead variants in all 5 loci are low-frequency but are more common in non-European populations. An aggregation of synonymous rare variants within the gene C6orf26 demonstrated suggestive evidence of association for hemorrhagic stroke (P<3.11×10-6). By meta-analyzing European ancestry samples in TOPMed and UK BioBank, we replicated several previously reported stroke loci including PITX2, HDAC9, ZFHX3, and LRCH1. CONCLUSIONS: We represent the first association analysis for stroke and its subtypes using whole-genome sequencing data from ancestrally diverse populations. While our findings suggest the potential benefits of combining whole-genome sequencing data with populations of diverse genetic backgrounds to identify possible low-frequency or ancestry-specific variants, they also highlight the need to increase genome coverage and sample sizes.


Asunto(s)
Sitios Genéticos , Predisposición Genética a la Enfermedad , Polimorfismo de Nucleótido Simple , Medicina de Precisión , Grupos Raciales/genética , Accidente Cerebrovascular/genética , Anciano , Anciano de 80 o más Años , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Persona de Mediana Edad , Secuenciación Completa del Genoma
18.
Breast Cancer Res ; 24(1): 2, 2022 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-34983606

RESUMEN

BACKGROUND: Genome-wide association studies (GWAS) have identified multiple common breast cancer susceptibility variants. Many of these variants have differential associations by estrogen receptor (ER) status, but how these variants relate with other tumor features and intrinsic molecular subtypes is unclear. METHODS: Among 106,571 invasive breast cancer cases and 95,762 controls of European ancestry with data on 173 breast cancer variants identified in previous GWAS, we used novel two-stage polytomous logistic regression models to evaluate variants in relation to multiple tumor features (ER, progesterone receptor (PR), human epidermal growth factor receptor 2 (HER2) and grade) adjusting for each other, and to intrinsic-like subtypes. RESULTS: Eighty-five of 173 variants were associated with at least one tumor feature (false discovery rate < 5%), most commonly ER and grade, followed by PR and HER2. Models for intrinsic-like subtypes found nearly all of these variants (83 of 85) associated at p < 0.05 with risk for at least one luminal-like subtype, and approximately half (41 of 85) of the variants were associated with risk of at least one non-luminal subtype, including 32 variants associated with triple-negative (TN) disease. Ten variants were associated with risk of all subtypes in different magnitude. Five variants were associated with risk of luminal A-like and TN subtypes in opposite directions. CONCLUSION: This report demonstrates a high level of complexity in the etiology heterogeneity of breast cancer susceptibility variants and can inform investigations of subtype-specific risk prediction.


Asunto(s)
Neoplasias de la Mama , Biomarcadores de Tumor/genética , Biomarcadores de Tumor/metabolismo , Neoplasias de la Mama/epidemiología , Neoplasias de la Mama/genética , Neoplasias de la Mama/metabolismo , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Receptor ErbB-2/genética , Receptor ErbB-2/metabolismo , Receptores de Estrógenos/genética , Receptores de Estrógenos/metabolismo , Receptores de Progesterona/genética , Receptores de Progesterona/metabolismo , Riesgo
19.
Am J Hum Genet ; 104(1): 21-34, 2019 01 03.
Artículo en Inglés | MEDLINE | ID: mdl-30554720

RESUMEN

Stratification of women according to their risk of breast cancer based on polygenic risk scores (PRSs) could improve screening and prevention strategies. Our aim was to develop PRSs, optimized for prediction of estrogen receptor (ER)-specific disease, from the largest available genome-wide association dataset and to empirically validate the PRSs in prospective studies. The development dataset comprised 94,075 case subjects and 75,017 control subjects of European ancestry from 69 studies, divided into training and validation sets. Samples were genotyped using genome-wide arrays, and single-nucleotide polymorphisms (SNPs) were selected by stepwise regression or lasso penalized regression. The best performing PRSs were validated in an independent test set comprising 11,428 case subjects and 18,323 control subjects from 10 prospective studies and 190,040 women from UK Biobank (3,215 incident breast cancers). For the best PRSs (313 SNPs), the odds ratio for overall disease per 1 standard deviation in ten prospective studies was 1.61 (95%CI: 1.57-1.65) with area under receiver-operator curve (AUC) = 0.630 (95%CI: 0.628-0.651). The lifetime risk of overall breast cancer in the top centile of the PRSs was 32.6%. Compared with women in the middle quintile, those in the highest 1% of risk had 4.37- and 2.78-fold risks, and those in the lowest 1% of risk had 0.16- and 0.27-fold risks, of developing ER-positive and ER-negative disease, respectively. Goodness-of-fit tests indicated that this PRS was well calibrated and predicts disease risk accurately in the tails of the distribution. This PRS is a powerful and reliable predictor of breast cancer risk that may improve breast cancer prevention programs.


Asunto(s)
Neoplasias de la Mama/clasificación , Neoplasias de la Mama/genética , Predisposición Genética a la Enfermedad , Herencia Multifactorial/genética , Adulto , Factores de Edad , Anciano , Anciano de 80 o más Años , Neoplasias de la Mama/diagnóstico , Neoplasias de la Mama/prevención & control , Femenino , Humanos , Anamnesis , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple/genética , Receptores de Estrógenos/metabolismo , Reproducibilidad de los Resultados , Medición de Riesgo
20.
Am J Hum Genet ; 105(1): 15-28, 2019 07 03.
Artículo en Inglés | MEDLINE | ID: mdl-31178129

RESUMEN

Circulating levels of adiponectin, an adipocyte-secreted protein associated with cardiovascular and metabolic risk, are highly heritable. To gain insights into the biology that regulates adiponectin levels, we performed an exome array meta-analysis of 265,780 genetic variants in 67,739 individuals of European, Hispanic, African American, and East Asian ancestry. We identified 20 loci associated with adiponectin, including 11 that had been reported previously (p < 2 × 10-7). Comparison of exome array variants to regional linkage disequilibrium (LD) patterns and prior genome-wide association study (GWAS) results detected candidate variants (r2 > .60) spanning as much as 900 kb. To identify potential genes and mechanisms through which the previously unreported association signals act to affect adiponectin levels, we assessed cross-trait associations, expression quantitative trait loci in subcutaneous adipose, and biological pathways of nearby genes. Eight of the nine loci were also associated (p < 1 × 10-4) with at least one obesity or lipid trait. Candidate genes include PRKAR2A, PTH1R, and HDAC9, which have been suggested to play roles in adipocyte differentiation or bone marrow adipose tissue. Taken together, these findings provide further insights into the processes that influence circulating adiponectin levels.


Asunto(s)
Adiponectina/genética , Tejido Adiposo/patología , Exoma/genética , Predisposición Genética a la Enfermedad , Lípidos/análisis , Obesidad/etiología , Polimorfismo de Nucleótido Simple , Tejido Adiposo/metabolismo , Adolescente , Adulto , Negro o Afroamericano/genética , Anciano , Anciano de 80 o más Años , Femenino , Hispánicos o Latinos/genética , Humanos , Masculino , Persona de Mediana Edad , Obesidad/patología , Fenotipo , Sitios de Carácter Cuantitativo , Población Blanca/genética , Adulto Joven
SELECCIÓN DE REFERENCIAS
Detalles de la búsqueda