Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 14 de 14
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Hum Mol Genet ; 33(16): 1429-1441, 2024 Aug 06.
Artigo em Inglês | MEDLINE | ID: mdl-38747556

RESUMO

Inflammation biomarkers can provide valuable insight into the role of inflammatory processes in many diseases and conditions. Sequencing based analyses of such biomarkers can also serve as an exemplar of the genetic architecture of quantitative traits. To evaluate the biological insight, which can be provided by a multi-ancestry, whole-genome based association study, we performed a comprehensive analysis of 21 inflammation biomarkers from up to 38 465 individuals with whole-genome sequencing from the Trans-Omics for Precision Medicine (TOPMed) program (with varying sample size by trait, where the minimum sample size was n = 737 for MMP-1). We identified 22 distinct single-variant associations across 6 traits-E-selectin, intercellular adhesion molecule 1, interleukin-6, lipoprotein-associated phospholipase A2 activity and mass, and P-selectin-that remained significant after conditioning on previously identified associations for these inflammatory biomarkers. We further expanded upon known biomarker associations by pairing the single-variant analysis with a rare variant set-based analysis that further identified 19 significant rare variant set-based associations with 5 traits. These signals were distinct from both significant single variant association signals within TOPMed and genetic signals observed in prior studies, demonstrating the complementary value of performing both single and rare variant analyses when analyzing quantitative traits. We also confirm several previously reported signals from semi-quantitative proteomics platforms. Many of these signals demonstrate the extensive allelic heterogeneity and ancestry-differentiated variant-trait associations common for inflammation biomarkers, a characteristic we hypothesize will be increasingly observed with well-powered, large-scale analyses of complex traits.


Assuntos
Biomarcadores , Estudo de Associação Genômica Ampla , Inflamação , Medicina de Precisão , Sequenciamento Completo do Genoma , Humanos , Medicina de Precisão/métodos , Inflamação/genética , Estudo de Associação Genômica Ampla/métodos , Sequenciamento Completo do Genoma/métodos , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Predisposição Genética para Doença , Feminino , Interleucina-6/genética
2.
Am J Hum Genet ; 109(6): 1016-1025, 2022 06 02.
Artigo em Inglês | MEDLINE | ID: mdl-35659928

RESUMO

Haplotypes can be estimated from unphased genotype data via statistical methods. When parent-offspring trios are available for inferring the true phase from Mendelian inheritance rules, the accuracy of statistical phasing is usually measured by the switch error rate, which is the proportion of pairs of consecutive heterozygotes that are incorrectly phased. We present a method for estimating the genotype error rate from parent-offspring trios and a method for estimating the bias that occurs in the observed switch error rate as a result of genotype error. We apply these methods to 485,301 genotyped UK Biobank samples that include 898 White British trios and to 38,387 sequenced TOPMed samples that include 217 African Caribbean trios and 669 European American trios. We show that genotype error inflates the observed switch error rate and that the relative bias increases with sample size. For the UK Biobank White British trios, the observed switch error rate in the trio offspring is 2.4 times larger than the estimated true switch error rate (1.4 × 10-3 vs 5.8 × 10-4. We propose an alternate definition of phase error that counts two consecutive switch errors as a single error because back-to-back switch errors arise when a single heterozygote is incorrectly phased with respect to the surrounding heterozygotes. With this definition, we estimate that the average distance between phase errors is 64 megabases in the UK Biobank White British individuals.


Assuntos
Hereditariedade , Polimorfismo de Nucleotídeo Único , Viés , Genótipo , Haplótipos/genética , Humanos , Polimorfismo de Nucleotídeo Único/genética
3.
Brief Bioinform ; 25(1)2023 11 22.
Artigo em Inglês | MEDLINE | ID: mdl-38221906

RESUMO

Large-scale imputation reference panels are currently available and have contributed to efficient genome-wide association studies through genotype imputation. However, whether large-size multi-ancestry or small-size population-specific reference panels are the optimal choices for under-represented populations continues to be debated. We imputed genotypes of East Asian (180k Japanese) subjects using the Trans-Omics for Precision Medicine reference panel and found that the standard imputation quality metric (Rsq) overestimated dosage r2 (squared correlation between imputed dosage and true genotype) particularly in marginal-quality bins. Variance component analysis of Rsq revealed that the increased imputed-genotype certainty (dosages closer to 0, 1 or 2) caused upward bias, indicating some systemic bias in the imputation. Through systematic simulations using different template switching rates (θ value) in the hidden Markov model, we revealed that the lower θ value increased the imputed-genotype certainty and Rsq; however, dosage r2 was insensitive to the θ value, thereby causing a deviation. In simulated reference panels with different sizes and ancestral diversities, the θ value estimates from Minimac decreased with the size of a single ancestry and increased with the ancestral diversity. Thus, Rsq could be deviated from dosage r2 for a subpopulation in the multi-ancestry panel, and the deviation represents different imputed-dosage distributions. Finally, despite the impact of the θ value, distant ancestries in the reference panel contributed only a few additional variants passing a predefined Rsq threshold. We conclude that the θ value substantially impacts the imputed dosage and the imputation quality metric value.


Assuntos
Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Humanos , Frequência do Gene , Genótipo
4.
Am J Hum Genet ; 108(10): 1880-1890, 2021 10 07.
Artigo em Inglês | MEDLINE | ID: mdl-34478634

RESUMO

Haplotype phasing is the estimation of haplotypes from genotype data. We present a fast, accurate, and memory-efficient haplotype phasing method that scales to large-scale SNP array and sequence data. The method uses marker windowing and composite reference haplotypes to reduce memory usage and computation time. It incorporates a progressive phasing algorithm that identifies confidently phased heterozygotes in each iteration and fixes the phase of these heterozygotes in subsequent iterations. For data with many low-frequency variants, such as whole-genome sequence data, the method employs a two-stage phasing algorithm that phases high-frequency markers via progressive phasing in the first stage and phases low-frequency markers via genotype imputation in the second stage. This haplotype phasing method is implemented in the open-source Beagle 5.2 software package. We compare Beagle 5.2 and SHAPEIT 4.2.1 by using expanding subsets of 485,301 UK Biobank samples and 38,387 TOPMed samples. Both methods have very similar accuracy and computation time for UK Biobank SNP array data. However, for TOPMed sequence data, Beagle is more than 20 times faster than SHAPEIT, achieves similar accuracy, and scales to larger sample sizes.


Assuntos
Asma/genética , Fibrilação Atrial/genética , Interpretação Estatística de Dados , Genoma Humano , Haplótipos , Polimorfismo de Nucleotídeo Único , Software , Algoritmos , Feminino , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Masculino
5.
Am J Respir Crit Care Med ; 207(9): 1194-1202, 2023 05 01.
Artigo em Inglês | MEDLINE | ID: mdl-36602845

RESUMO

Rationale: Idiopathic pulmonary fibrosis (IPF) is a rare, irreversible, and progressive disease of the lungs. Common genetic variants, in addition to nongenetic factors, have been consistently associated with IPF. Rare variants identified by candidate gene, family-based, and exome studies have also been reported to associate with IPF. However, the extent to which rare variants, genome-wide, may contribute to the risk of IPF remains unknown. Objectives: We used whole-genome sequencing to investigate the role of rare variants, genome-wide, on IPF risk. Methods: As part of the Trans-Omics for Precision Medicine Program, we sequenced 2,180 cases of IPF. Association testing focused on the aggregated effect of rare variants (minor allele frequency ⩽0.01) within genes or regions. We also identified individual rare variants that are influential within genes and estimated the heritability of IPF on the basis of rare and common variants. Measurements and Main Results: Rare variants in both TERT and RTEL1 were significantly associated with IPF. A single rare variant in each of the TERT and RTEL1 genes was found to consistently influence the aggregated test statistics. There was no significant evidence of association with other previously reported rare variants. The SNP heritability of IPF was estimated to be 32% (SE = 3%). Conclusions: Rare variants within the TERT and RTEL1 genes and well-established common variants have the largest contribution to IPF risk overall. Efforts in risk profiling or the development of therapies for IPF that focus on TERT, RTEL1, common variants, and environmental risk factors are likely to have the largest impact on this complex disease.


Assuntos
Fibrose Pulmonar Idiopática , Humanos , Fibrose Pulmonar Idiopática/genética , Sequenciamento Completo do Genoma , Exoma
6.
Diabetologia ; 66(7): 1273-1288, 2023 07.
Artigo em Inglês | MEDLINE | ID: mdl-37148359

RESUMO

AIMS/HYPOTHESIS: The Latino population has been systematically underrepresented in large-scale genetic analyses, and previous studies have relied on the imputation of ungenotyped variants based on the 1000 Genomes (1000G) imputation panel, which results in suboptimal capture of low-frequency or Latino-enriched variants. The National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) released the largest multi-ancestry genotype reference panel representing a unique opportunity to analyse rare genetic variations in the Latino population. We hypothesise that a more comprehensive analysis of low/rare variation using the TOPMed panel would improve our knowledge of the genetics of type 2 diabetes in the Latino population. METHODS: We evaluated the TOPMed imputation performance using genotyping array and whole-exome sequence data in six Latino cohorts. To evaluate the ability of TOPMed imputation to increase the number of identified loci, we performed a Latino type 2 diabetes genome-wide association study (GWAS) meta-analysis in 8150 individuals with type 2 diabetes and 10,735 control individuals and replicated the results in six additional cohorts including whole-genome sequence data from the All of Us cohort. RESULTS: Compared with imputation with 1000G, the TOPMed panel improved the identification of rare and low-frequency variants. We identified 26 genome-wide significant signals including a novel variant (minor allele frequency 1.7%; OR 1.37, p=3.4 × 10-9). A Latino-tailored polygenic score constructed from our data and GWAS data from East Asian and European populations improved the prediction accuracy in a Latino target dataset, explaining up to 7.6% of the type 2 diabetes risk variance. CONCLUSIONS/INTERPRETATION: Our results demonstrate the utility of TOPMed imputation for identifying low-frequency variants in understudied populations, leading to the discovery of novel disease associations and the improvement of polygenic scores. DATA AVAILABILITY: Full summary statistics are available through the Common Metabolic Diseases Knowledge Portal ( https://t2d.hugeamp.org/downloads.html ) and through the GWAS catalog ( https://www.ebi.ac.uk/gwas/ , accession ID: GCST90255648). Polygenic score (PS) weights for each ancestry are available via the PGS catalog ( https://www.pgscatalog.org , publication ID: PGP000445, scores IDs: PGS003443, PGS003444 and PGS003445).


Assuntos
Diabetes Mellitus Tipo 2 , Saúde da População , Humanos , Estudo de Associação Genômica Ampla , Diabetes Mellitus Tipo 2/genética , Medicina de Precisão , Genótipo , Hispânico ou Latino/genética , Polimorfismo de Nucleotídeo Único/genética
7.
Am J Hum Genet ; 104(2): 260-274, 2019 02 07.
Artigo em Inglês | MEDLINE | ID: mdl-30639324

RESUMO

With advances in whole-genome sequencing (WGS) technology, more advanced statistical methods for testing genetic association with rare variants are being developed. Methods in which variants are grouped for analysis are also known as variant-set, gene-based, and aggregate unit tests. The burden test and sequence kernel association test (SKAT) are two widely used variant-set tests, which were originally developed for samples of unrelated individuals and later have been extended to family data with known pedigree structures. However, computationally efficient and powerful variant-set tests are needed to make analyses tractable in large-scale WGS studies with complex study samples. In this paper, we propose the variant-set mixed model association tests (SMMAT) for continuous and binary traits using the generalized linear mixed model framework. These tests can be applied to large-scale WGS studies involving samples with population structure and relatedness, such as in the National Heart, Lung, and Blood Institute's Trans-Omics for Precision Medicine (TOPMed) program. SMMATs share the same null model for different variant sets, and a virtue of this null model, which includes covariates only, is that it needs to be fit only once for all tests in each genome-wide analysis. Simulation studies show that all the proposed SMMATs correctly control type I error rates for both continuous and binary traits in the presence of population structure and relatedness. We also illustrate our tests in a real data example of analysis of plasma fibrinogen levels in the TOPMed program (n = 23,763), using the Analysis Commons, a cloud-based computing platform.


Assuntos
Estudos de Associação Genética , Modelos Genéticos , Sequenciamento Completo do Genoma , Cromossomos Humanos Par 4/genética , Computação em Nuvem , Feminino , Fibrinogênio/análise , Fibrinogênio/genética , Genética Populacional , Humanos , Masculino , National Heart, Lung, and Blood Institute (U.S.) , Medicina de Precisão , Projetos de Pesquisa , Fatores de Tempo , Estados Unidos
8.
Am J Hum Genet ; 105(5): 1057-1068, 2019 11 07.
Artigo em Inglês | MEDLINE | ID: mdl-31668705

RESUMO

Average arterial oxyhemoglobin saturation during sleep (AvSpO2S) is a clinically relevant measure of physiological stress associated with sleep-disordered breathing, and this measure predicts incident cardiovascular disease and mortality. Using high-depth whole-genome sequencing data from the National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) project and focusing on genes with linkage evidence on chromosome 8p23,1,2 we observed that six coding and 51 noncoding variants in a gene that encodes the GTPase-activating protein (DLC1) are significantly associated with AvSpO2S and replicated in independent subjects. The combined DLC1 association evidence of discovery and replication cohorts reaches genome-wide significance in European Americans (p = 7.9 × 10-7). A risk score for these variants, built on an independent dataset, explains 0.97% of the AvSpO2S variation and contributes to the linkage evidence. The 51 noncoding variants are enriched in regulatory features in a human lung fibroblast cell line and contribute to DLC1 expression variation. Mendelian randomization analysis using these variants indicates a significant causal effect of DLC1 expression in fibroblasts on AvSpO2S. Multiple sources of information, including genetic variants, gene expression, and methylation, consistently suggest that DLC1 is a gene associated with AvSpO2S.


Assuntos
Cromossomos Humanos Par 8/genética , Proteínas Ativadoras de GTPase/genética , Oxiemoglobinas/genética , Sono/genética , Proteínas Supressoras de Tumor/genética , Ligação Genética/genética , Estudo de Associação Genômica Ampla , Humanos , Sequenciamento Completo do Genoma/métodos
9.
Am J Hum Genet ; 105(4): 706-718, 2019 10 03.
Artigo em Inglês | MEDLINE | ID: mdl-31564435

RESUMO

Hemoglobin A1c (HbA1c) is widely used to diagnose diabetes and assess glycemic control in individuals with diabetes. However, nonglycemic determinants, including genetic variation, may influence how accurately HbA1c reflects underlying glycemia. Analyzing the NHLBI Trans-Omics for Precision Medicine (TOPMed) sequence data in 10,338 individuals from five studies and four ancestries (6,158 Europeans, 3,123 African-Americans, 650 Hispanics, and 407 East Asians), we confirmed five regions associated with HbA1c (GCK in Europeans and African-Americans, HK1 in Europeans and Hispanics, FN3K and/or FN3KRP in Europeans, and G6PD in African-Americans and Hispanics) and we identified an African-ancestry-specific low-frequency variant (rs1039215 in HBG2 and HBE1, minor allele frequency (MAF) = 0.03). The most associated G6PD variant (rs1050828-T, p.Val98Met, MAF = 12% in African-Americans, MAF = 2% in Hispanics) lowered HbA1c (-0.88% in hemizygous males, -0.34% in heterozygous females) and explained 23% of HbA1c variance in African-Americans and 4% in Hispanics. Additionally, we identified a rare distinct G6PD coding variant (rs76723693, p.Leu353Pro, MAF = 0.5%; -0.98% in hemizygous males, -0.46% in heterozygous females) and detected significant association with HbA1c when aggregating rare missense variants in G6PD. We observed similar magnitude and direction of effects for rs1039215 (HBG2) and rs76723693 (G6PD) in the two largest TOPMed African American cohorts, and we replicated the rs76723693 association in the UK Biobank African-ancestry participants. These variants in G6PD and HBG2 were monomorphic in the European and Asian samples. African or Hispanic ancestry individuals carrying G6PD variants may be underdiagnosed for diabetes when screened with HbA1c. Thus, assessment of these variants should be considered for incorporation into precision medicine approaches for diabetes diagnosis.


Assuntos
Diabetes Mellitus/diagnóstico , Diabetes Mellitus/genética , Variação Genética , Hemoglobinas Glicadas/genética , Grupos Populacionais/genética , Medicina de Precisão , Estudos de Coortes , Feminino , Humanos , Masculino , Polimorfismo de Nucleotídeo Único
10.
Res Sq ; 2023 Feb 03.
Artigo em Inglês | MEDLINE | ID: mdl-36778386

RESUMO

Ever larger Structural Variant (SV) catalogs highlighting the diversity within and between populations help researchers better understand the links between SVs and disease. The identification of SVs from DNA sequence data is non-trivial and requires a balance between comprehensiveness and precision. Here we present a catalog of 355,667 SVs (59.34% novel) across autosomes and the X chromosome (50bp+) from 138,134 individuals in the diverse TOPMed consortium. We describe our methodologies for SV inference resulting in high variant quality and >90% allele concordance compared to long-read de-novo assemblies of well-characterized control samples. We demonstrate utility through significant associations between SVs and important various cardio-metabolic and hematologic traits. We have identified 690 SV hotspots and deserts and those that potentially impact the regulation of medically relevant genes. This catalog characterizes SVs across multiple populations and will serve as a valuable tool to understand the impact of SV on disease development and progression.

11.
bioRxiv ; 2023 Jan 25.
Artigo em Inglês | MEDLINE | ID: mdl-36747810

RESUMO

Ever larger Structural Variant (SV) catalogs highlighting the diversity within and between populations help researchers better understand the links between SVs and disease. The identification of SVs from DNA sequence data is non-trivial and requires a balance between comprehensiveness and precision. Here we present a catalog of 355,667 SVs (59.34% novel) across autosomes and the X chromosome (50bp+) from 138,134 individuals in the diverse TOPMed consortium. We describe our methodologies for SV inference resulting in high variant quality and >90% allele concordance compared to long-read de-novo assemblies of well-characterized control samples. We demonstrate utility through significant associations between SVs and important various cardio-metabolic and hemotologic traits. We have identified 690 SV hotspots and deserts and those that potentially impact the regulation of medically relevant genes. This catalog characterizes SVs across multiple populations and will serve as a valuable tool to understand the impact of SV on disease development and progression.

12.
Front Genet ; 14: 1278215, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-38162683

RESUMO

Introduction: Apparent treatment-resistant hypertension (aTRH) is characterized by the use of four or more antihypertensive (AHT) classes to achieve blood pressure (BP) control. In the current study, we conducted single-variant and gene-based analyses of aTRH among individuals from 12 Trans-Omics for Precision Medicine cohorts with whole-genome sequencing data. Methods: Cases were defined as individuals treated for hypertension (HTN) taking three different AHT classes, with average systolic BP ≥ 140 or diastolic BP ≥ 90 mmHg, or four or more medications regardless of BP (n = 1,705). A normotensive control group was defined as individuals with BP < 140/90 mmHg (n = 22,079), not on AHT medication. A second control group comprised individuals who were treatment responsive on one AHT medication with BP < 140/ 90 mmHg (n = 5,424). Logistic regression with kinship adjustment using the Scalable and Accurate Implementation of Generalized mixed models (SAIGE) was performed, adjusting for age, sex, and genetic ancestry. We assessed variants using SKAT-O in rare-variant analyses. Single-variant and gene-based tests were conducted in a pooled multi-ethnicity stratum, as well as self-reported ethnic/racial strata (European and African American). Results: One variant in the known HTN locus, KCNK3, was a top finding in the multi-ethnic analysis (p = 8.23E-07) for the normotensive control group [rs12476527, odds ratio (95% confidence interval) = 0.80 (0.74-0.88)]. This variant was replicated in the Vanderbilt University Medical Center's DNA repository data. Aggregate gene-based signals included the genes AGTPBP, MYL4, PDCD4, BBS9, ERG, and IER3. Discussion: Additional work validating these loci in larger, more diverse populations, is warranted to determine whether these regions influence the pathobiology of aTRH.

13.
Genetics ; 224(4)2023 08 09.
Artigo em Inglês | MEDLINE | ID: mdl-37348055

RESUMO

Exonic variants present some of the strongest links between genotype and phenotype. However, these variants can have significant inter-individual pathogenicity differences, known as variable penetrance. In this study, we propose a model where genetically controlled mRNA splicing modulates the pathogenicity of exonic variants. By first cataloging exonic inclusion from RNA-sequencing data in GTEx V8, we find that pathogenic alleles are depleted on highly included exons. Using a large-scale phased whole genome sequencing data from the TOPMed consortium, we observe that this effect may be driven by common splice-regulatory genetic variants, and that natural selection acts on haplotype configurations that reduce the transcript inclusion of putatively pathogenic variants, especially when limiting to haploinsufficient genes. Finally, we test if this effect may be relevant for autism risk using families from the Simons Simplex Collection, but find that splicing of pathogenic alleles has a penetrance reducing effect here as well. Overall, our results indicate that common splice-regulatory variants may play a role in reducing the damaging effects of rare exonic variants.


Assuntos
Sítios de Splice de RNA , Splicing de RNA , Penetrância , Éxons , Genótipo , RNA Mensageiro/genética , Processamento Alternativo
14.
Cell Genom ; 3(6): 100332, 2023 Jun 14.
Artigo em Inglês | MEDLINE | ID: mdl-37388906

RESUMO

Based on evaluations of imputation performed on a genotype dataset consisting of about 11,000 sub-Saharan African (SSA) participants, we show Trans-Omics for Precision Medicine (TOPMed) and the African Genome Resource (AGR) to be currently the best panels for imputing SSA datasets. We report notable differences in the number of single-nucleotide polymorphisms (SNPs) that are imputed by different panels in datasets from East, West, and South Africa. Comparisons with a subset of 95 SSA high-coverage whole-genome sequences (WGSs) show that despite being about 20-fold smaller, the AGR imputed dataset has higher concordance with the WGSs. Moreover, the level of concordance between imputed and WGS datasets was strongly influenced by the extent of Khoe-San ancestry in a genome, highlighting the need for integration of not only geographically but also ancestrally diverse WGS data in reference panels for further improvement in imputation of SSA datasets. Approaches that integrate imputed data from different panels could also lead to better imputation.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA