Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 279
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Hum Mol Genet ; 2024 May 15.
Artículo en Inglés | MEDLINE | ID: mdl-38747556

RESUMEN

Inflammation biomarkers can provide valuable insight into the role of inflammatory processes in many diseases and conditions. Sequencing based analyses of such biomarkers can also serve as an exemplar of the genetic architecture of quantitative traits. To evaluate the biological insight, which can be provided by a multi-ancestry, whole-genome based association study, we performed a comprehensive analysis of 21 inflammation biomarkers from up to 38 465 individuals with whole-genome sequencing from the Trans-Omics for Precision Medicine (TOPMed) program (with varying sample size by trait, where the minimum sample size was n = 737 for MMP-1). We identified 22 distinct single-variant associations across 6 traits-E-selectin, intercellular adhesion molecule 1, interleukin-6, lipoprotein-associated phospholipase A2 activity and mass, and P-selectin-that remained significant after conditioning on previously identified associations for these inflammatory biomarkers. We further expanded upon known biomarker associations by pairing the single-variant analysis with a rare variant set-based analysis that further identified 19 significant rare variant set-based associations with 5 traits. These signals were distinct from both significant single variant association signals within TOPMed and genetic signals observed in prior studies, demonstrating the complementary value of performing both single and rare variant analyses when analyzing quantitative traits. We also confirm several previously reported signals from semi-quantitative proteomics platforms. Many of these signals demonstrate the extensive allelic heterogeneity and ancestry-differentiated variant-trait associations common for inflammation biomarkers, a characteristic we hypothesize will be increasingly observed with well-powered, large-scale analyses of complex traits.

2.
Nature ; 584(7821): 420-424, 2020 08.
Artículo en Inglés | MEDLINE | ID: mdl-32674112

RESUMEN

As countries in the world review interventions for containing the pandemic of coronavirus disease 2019 (COVID-19), important lessons can be drawn from the study of the full transmission dynamics of its causative agent-severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)- in Wuhan (China), where vigorous non-pharmaceutical interventions have suppressed the local outbreak of this disease1. Here we use a modelling approach to reconstruct the full-spectrum dynamics of COVID-19 in Wuhan between 1 January and 8 March 2020 across 5 periods defined by events and interventions, on the basis of 32,583 laboratory-confirmed cases1. Accounting for presymptomatic infectiousness2, time-varying ascertainment rates, transmission rates and population movements3, we identify two key features of the outbreak: high covertness and high transmissibility. We estimate 87% (lower bound, 53%) of the infections before 8 March 2020 were unascertained (potentially including asymptomatic and mildly symptomatic individuals); and a basic reproduction number (R0) of 3.54 (95% credible interval 3.40-3.67) in the early outbreak, much higher than that of severe acute respiratory syndrome (SARS) and Middle East respiratory syndrome (MERS)4,5. We observe that multipronged interventions had considerable positive effects on controlling the outbreak, decreasing the reproduction number to 0.28 (95% credible interval 0.23-0.33) and-by projection-reducing the total infections in Wuhan by 96.0% as of 8 March 2020. We also explore the probability of resurgence following the lifting of all interventions after 14 consecutive days of no ascertained infections; we estimate this probability at 0.32 and 0.06 on the basis of models with 87% and 53% unascertained cases, respectively-highlighting the risk posed by substantial covert infections when changing control measures. These results have important implications when considering strategies of continuing surveillance and interventions to eventually contain outbreaks of COVID-19.


Asunto(s)
Infecciones por Coronavirus/transmisión , Modelos Biológicos , Neumonía Viral/transmisión , COVID-19 , China/epidemiología , Infecciones por Coronavirus/epidemiología , Infecciones por Coronavirus/prevención & control , Monitoreo Epidemiológico , Femenino , Humanos , Masculino , Pandemias/prevención & control , Neumonía Viral/epidemiología , Neumonía Viral/prevención & control , Reproducibilidad de los Resultados , Procesos Estocásticos
3.
Proc Natl Acad Sci U S A ; 120(27): e2216248120, 2023 Jul 04.
Artículo en Inglés | MEDLINE | ID: mdl-37368928

RESUMEN

The US global leadership in science and technology has greatly benefitted from immigrants from other countries, most notably from China in the recent decades. However, feeling the pressure of potential federal investigations since the 2018 launch of the China Initiative, scientists of Chinese descent in the United States now face higher incentives to leave the United States and lower incentives to apply for federal grants. Analyzing data pertaining to institutional affiliations of more than 200 million scientific papers, we find a steady increase in the return migration of scientists of Chinese descent from the United States to China. We also conducted a survey of scientists of Chinese descent employed by US universities in tenured or tenure-track positions (n = 1,304), with results revealing general feelings of fear and anxiety that lead them to consider leaving the United States and/or stop applying for federal grants. If the situation is not corrected, American science will likely suffer the loss of scientific talent to China and other countries.

4.
Am J Hum Genet ; 109(3): 446-456, 2022 03 03.
Artículo en Inglés | MEDLINE | ID: mdl-35216679

RESUMEN

Attempts to identify and prioritize functional DNA elements in coding and non-coding regions, particularly through use of in silico functional annotation data, continue to increase in popularity. However, specific functional roles can vary widely from one variant to another, making it challenging to summarize different aspects of variant function with a one-dimensional rating. Here we propose multi-dimensional annotation-class integrative estimation (MACIE), an unsupervised multivariate mixed-model framework capable of integrating annotations of diverse origin to assess multi-dimensional functional roles for both coding and non-coding variants. Unlike existing one-dimensional scoring methods, MACIE views variant functionality as a composite attribute encompassing multiple characteristics and estimates the joint posterior functional probabilities of each genomic position. This estimate offers more comprehensive and interpretable information in the presence of multiple aspects of functionality. Applied to a variety of independent coding and non-coding datasets, MACIE demonstrates powerful and robust performance in discriminating between functional and non-functional variants. We also show an application of MACIE to fine-mapping and heritability enrichment analysis by using the lipids GWAS summary statistics data from the European Network for Genetic and Genomic Epidemiology Consortium.


Asunto(s)
Genoma Humano , Estudio de Asociación del Genoma Completo , Genoma Humano/genética , Estudio de Asociación del Genoma Completo/métodos , Genómica , Humanos , Anotación de Secuencia Molecular , Polimorfismo de Nucleótido Simple/genética , Probabilidad
5.
Nucleic Acids Res ; 51(D1): D1300-D1311, 2023 01 06.
Artículo en Inglés | MEDLINE | ID: mdl-36350676

RESUMEN

Large biobank-scale whole genome sequencing (WGS) studies are rapidly identifying a multitude of coding and non-coding variants. They provide an unprecedented resource for illuminating the genetic basis of human diseases. Variant functional annotations play a critical role in WGS analysis, result interpretation, and prioritization of disease- or trait-associated causal variants. Existing functional annotation databases have limited scope to perform online queries and functionally annotate the genotype data of large biobank-scale WGS studies. We develop the Functional Annotation of Variants Online Resources (FAVOR) to meet these pressing needs. FAVOR provides a comprehensive multi-faceted variant functional annotation online portal that summarizes and visualizes findings of all possible nine billion single nucleotide variants (SNVs) across the genome. It allows for rapid variant-, gene- and region-level queries of variant functional annotations. FAVOR integrates variant functional information from multiple sources to describe the functional characteristics of variants and facilitates prioritizing plausible causal variants influencing human phenotypes. Furthermore, we provide a scalable annotation tool, FAVORannotator, to functionally annotate large-scale WGS studies and efficiently store the genotype and their variant functional annotation data in a single file using the annotated Genomic Data Structure (aGDS) format, making downstream analysis more convenient. FAVOR and FAVORannotator are available at https://favor.genohub.org.


Asunto(s)
Genoma Humano , Programas Informáticos , Humanos , Anotación de Secuencia Molecular , Genómica , Genotipo , Variación Genética
6.
PLoS Genet ; 17(1): e1009315, 2021 01.
Artículo en Inglés | MEDLINE | ID: mdl-33476339

RESUMEN

Inference of relationships from whole-genome genetic data of a cohort is a crucial prerequisite for genome-wide association studies. Typically, relationships are inferred by computing the kinship coefficients (ϕ) and the genome-wide probability of zero IBD sharing (π0) among all pairs of individuals. Current leading methods are based on pairwise comparisons, which may not scale up to very large cohorts (e.g., sample size >1 million). Here, we propose an efficient relationship inference method, RAFFI. RAFFI leverages the efficient RaPID method to call IBD segments first, then estimate the ϕ and π0 from detected IBD segments. This inference is achieved by a data-driven approach that adjusts the estimation based on phasing quality and genotyping quality. Using simulations, we showed that RAFFI is robust against phasing/genotyping errors, admix events, and varying marker densities, and achieves higher accuracy compared to KING, the current leading method, especially for more distant relatives. When applied to the phased UK Biobank data with ~500K individuals, RAFFI is approximately 18 times faster than KING. We expect RAFFI will offer fast and accurate relatedness inference for even larger cohorts.


Asunto(s)
Estudio de Asociación del Genoma Completo/estadística & datos numéricos , Técnicas de Genotipaje/estadística & datos numéricos , Modelos Genéticos , Bancos de Muestras Biológicas , Genoma Humano/genética , Haplotipos/genética , Humanos , Linaje , Polimorfismo de Nucleótido Simple/genética
7.
Am J Hum Genet ; 106(1): 112-120, 2020 01 02.
Artículo en Inglés | MEDLINE | ID: mdl-31883642

RESUMEN

Whole-genome sequencing (WGS) can improve assessment of low-frequency and rare variants, particularly in non-European populations that have been underrepresented in existing genomic studies. The genetic determinants of C-reactive protein (CRP), a biomarker of chronic inflammation, have been extensively studied, with existing genome-wide association studies (GWASs) conducted in >200,000 individuals of European ancestry. In order to discover novel loci associated with CRP levels, we examined a multi-ancestry population (n = 23,279) with WGS (∼38× coverage) from the Trans-Omics for Precision Medicine (TOPMed) program. We found evidence for eight distinct associations at the CRP locus, including two variants that have not been identified previously (rs11265259 and rs181704186), both of which are non-coding and more common in individuals of African ancestry (∼10% and ∼1% minor allele frequency, respectively, and rare or monomorphic in 1000 Genomes populations of East Asian, South Asian, and European ancestry). We show that the minor (G) allele of rs181704186 is associated with lower CRP levels and decreased transcriptional activity and protein binding in vitro, providing a plausible molecular mechanism for this African ancestry-specific signal. The individuals homozygous for rs181704186-G have a mean CRP level of 0.23 mg/L, in contrast to individuals heterozygous for rs181704186 with mean CRP of 2.97 mg/L and major allele homozygotes with mean CRP of 4.11 mg/L. This study demonstrates the utility of WGS in multi-ethnic populations to drive discovery of complex trait associations of large effect and to identify functional alleles in noncoding regulatory regions.


Asunto(s)
Pueblo Asiatico/genética , Población Negra/genética , Proteína C-Reactiva/genética , Predisposición Genética a la Enfermedad , Polimorfismo de Nucleótido Simple , Población Blanca/genética , Secuenciación Completa del Genoma/métodos , Estudios de Cohortes , Frecuencia de los Genes , Estudio de Asociación del Genoma Completo , Humanos , Desequilibrio de Ligamiento
8.
Biostatistics ; 2022 Aug 24.
Artículo en Inglés | MEDLINE | ID: mdl-36000269

RESUMEN

Set-based association tests are widely popular in genetic association settings for their ability to aggregate weak signals and reduce multiple testing burdens. In particular, a class of set-based tests including the Higher Criticism, Berk-Jones, and other statistics have recently been popularized for reaching a so-called detection boundary when signals are rare and weak. Such tests have been applied in two subtly different settings: (a) associating a genetic variant set with a single phenotype and (b) associating a single genetic variant with a phenotype set. A significant issue in practice is the choice of test, especially when deciding between innovated and generalized type methods for detection boundary tests. Conflicting guidance is present in the literature. This work describes how correlation structures generate marked differences in relative operating characteristics for settings (a) and (b). The implications for study design are significant. We also develop novel power bounds that facilitate the aforementioned calculations and allow for analysis of individual testing settings. In more concrete terms, our investigation is motivated by translational expression quantitative trait loci (eQTL) studies in lung cancer. These studies involve both testing for groups of variants associated with a single gene expression (multiple explanatory factors) and testing whether a single variant is associated with a group of gene expressions (multiple outcomes). Results are supported by a collection of simulation studies and illustrated through lung cancer eQTL examples.

9.
Bioinformatics ; 38(9): 2661-2663, 2022 04 28.
Artículo en Inglés | MEDLINE | ID: mdl-35244140

RESUMEN

SUMMARY: Amidst the continuing spread of coronavirus disease-19 (COVID-19), real-time data analysis and visualization remain critical the general public to track the pandemic's impact and to inform policy making by officials. Multiple metrics permit the evaluation of the spread, infection and mortality of infectious diseases. For example, numbers of new cases and deaths provide easily interpretable measures of absolute impact within a given population and time frame, while the effective reproduction rate provides an epidemiological measure of the rate of spread. By evaluating multiple metrics concurrently, users can leverage complementary insights into the impact and current state of the pandemic when formulating prevention and safety plans for oneself and others. We describe COVID-19 Spread Mapper, a unified framework for estimating and quantifying the uncertainty in the smoothed daily effective reproduction number, case rate and death rate in a region using log-linear models. We apply this framework to characterize COVID-19 impact at multiple geographic resolutions, including by US county and state as well as by country, demonstrating the variation across resolutions and the need for harmonized efforts to control the pandemic. We provide an open-source online dashboard for real-time analysis and visualization of multiple key metrics, which are critical to evaluate the impact of COVID-19 and make informed policy decisions. AVAILABILITY AND IMPLEMENTATION: Our model and tool are publicly available as implemented in R and hosted at https://metrics.covid19-analysis.org/. The source code is freely available from https://github.com/lin-lab/COVID19-Rt and https://github.com/lin-lab/COVID19-Viz. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
COVID-19 , Humanos , COVID-19/epidemiología , SARS-CoV-2 , Pandemias/prevención & control , Programas Informáticos
10.
Bioinformatics ; 38(11): 3116-3117, 2022 05 26.
Artículo en Inglés | MEDLINE | ID: mdl-35441669

RESUMEN

SUMMARY: We developed the variant-Set Test for Association using Annotation infoRmation (STAAR) workflow description language (WDL) workflow to facilitate the analysis of rare variants in whole genome sequencing association studies. The open-access STAAR workflow written in the WDL allows a user to perform rare variant testing for both gene-centric and genetic region approaches, enabling genome-wide, candidate and conditional analyses. It incorporates functional annotations into the workflow as introduced in the STAAR method in order to boost the rare variant analysis power. This tool was specifically developed and optimized to be implemented on cloud-based platforms such as BioData Catalyst Powered by Terra. It provides easy-to-use functionality for rare variant analysis that can be incorporated into an exhaustive whole genome sequencing analysis pipeline. AVAILABILITY AND IMPLEMENTATION: The workflow is freely available from https://dockstore.org/workflows/github.com/sheilagaynor/STAAR_workflow. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Nube Computacional , Programas Informáticos , Flujo de Trabajo , Genoma , Estudio de Asociación del Genoma Completo
11.
Biometrics ; 79(2): 1472-1484, 2023 06.
Artículo en Inglés | MEDLINE | ID: mdl-35218565

RESUMEN

Sample sizes vary substantially across tissues in the Genotype-Tissue Expression (GTEx) project, where considerably fewer samples are available from certain inaccessible tissues, such as the substantia nigra (SSN), than from accessible tissues, such as blood. This severely limits power for identifying tissue-specific expression quantitative trait loci (eQTL) in undersampled tissues. Here we propose Surrogate Phenotype Regression Analysis (Spray) for leveraging information from a correlated surrogate outcome (eg, expression in blood) to improve inference on a partially missing target outcome (eg, expression in SSN). Rather than regarding the surrogate outcome as a proxy for the target outcome, Spray jointly models the target and surrogate outcomes within a bivariate regression framework. Unobserved values of either outcome are treated as missing data. We describe and implement an expectation conditional maximization algorithm for performing estimation in the presence of bilateral outcome missingness. Spray estimates the same association parameter estimated by standard eQTL mapping and controls the type I error even when the target and surrogate outcomes are truly uncorrelated. We demonstrate analytically and empirically, using simulations and GTEx data, that in comparison with marginally modeling the target outcome, jointly modeling the target and surrogate outcomes increases estimation precision and improves power.


Asunto(s)
Algoritmos , Sitios de Carácter Cuantitativo , Fenotipo , Análisis de Regresión
12.
Eur Radiol ; 33(10): 7284-7293, 2023 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-37099174

RESUMEN

OBJECTIVES: The study investigated tumor burden dynamics on computed tomography (CT) scans in patients with advanced non-small-cell lung cancer (NSCLC) during first-line pembrolizumab plus chemotherapy, to provide imaging markers for overall survival (OS). METHODS: The study included 133 patients treated with first-line pembrolizumab plus platinum-doublet chemotherapy. Serial CT scans during therapy were assessed for tumor burden dynamics during therapy, which were studied for the association with OS. RESULTS: There were 67 responders, with overall response rate of 50%. The tumor burden change at the best overall response ranged from - 100.0% to + 132.1% (median of - 30%). Higher response rates were associated with younger age (p < 0.001) and higher programmed cell death-1 (PD-L1) expression levels (p = 0.01). Eighty-three patients (62%) showed tumor burden below the baseline burden throughout therapy. Using an 8-week landmark analysis, OS was longer in patients with tumor burden below the baseline burden in the first 8 weeks than in those who experienced ≥ 0% increase (median OS: 26.8 vs. 7.6 months, hazard ratio (HR): 0.36, p < 0.001). Tumor burden remained below their baseline throughout therapy was associated with significantly reduced hazards of death (HR: 0.72, p = 0.03) in the extended Cox models, after adjusting for other clinical variables. Pseudoprogression was noted in only one patient (0.8%). CONCLUSIONS: Tumor burden staying below the baseline burden throughout the therapy was predictive of prolonged overall survival in patients with advanced NSCLC treated with first-line pembrolizumab plus chemotherapy, and may be used as a practical marker for therapeutic decisions in this widely used combination regimen. CLINICAL RELEVANCE STATEMENT: The analysis of tumor burden dynamics on serial CT scans in reference to the baseline burden can provide an additional objective guide for treatment decision making in patients treated with first-line pembrolizumab plus chemotherapy for their advanced NSCLC. KEY POINTS: • Tumor burden remaining below baseline burden during therapy predicted longer survival during first-line pembrolizumab plus chemotherapy. • Pseudoprogression was noted in 0.8%, demonstrating the rarity of the phenomenon. • Tumor burden dynamics may serve as an objective marker for treatment benefit to guide treatment decisions during first-line pembrolizumab plus chemotherapy.


Asunto(s)
Carcinoma de Pulmón de Células no Pequeñas , Neoplasias Pulmonares , Humanos , Carcinoma de Pulmón de Células no Pequeñas/diagnóstico por imagen , Carcinoma de Pulmón de Células no Pequeñas/tratamiento farmacológico , Carcinoma de Pulmón de Células no Pequeñas/metabolismo , Neoplasias Pulmonares/diagnóstico por imagen , Neoplasias Pulmonares/tratamiento farmacológico , Neoplasias Pulmonares/metabolismo , Anticuerpos Monoclonales Humanizados/uso terapéutico , Protocolos de Quimioterapia Combinada Antineoplásica/uso terapéutico
13.
Am J Respir Crit Care Med ; 206(10): 1271-1280, 2022 11 15.
Artículo en Inglés | MEDLINE | ID: mdl-35822943

RESUMEN

Rationale: Obstructive sleep apnea (OSA) is a common disorder associated with increased risk for cardiovascular disease, diabetes, and premature mortality. There is strong clinical and epidemiologic evidence supporting the importance of genetic factors influencing OSA but limited data implicating specific genes. Objectives: To search for rare variants contributing to OSA severity. Methods: Leveraging high-depth genomic sequencing data from the NHLBI Trans-Omics for Precision Medicine (TOPMed) program and imputed genotype data from multiple population-based studies, we performed linkage analysis in the CFS (Cleveland Family Study), followed by multistage gene-based association analyses in independent cohorts for apnea-hypopnea index (AHI) in a total of 7,708 individuals of European ancestry. Measurements and Main Results: Linkage analysis in the CFS identified a suggestive linkage peak on chromosome 7q31 (LOD = 2.31). Gene-based analysis identified 21 noncoding rare variants in CAV1 (Caveolin-1) associated with lower AHI after accounting for multiple comparisons (P = 7.4 × 10-8). These noncoding variants together significantly contributed to the linkage evidence (P < 10-3). Follow-up analysis revealed significant associations between these variants and increased CAV1 expression, and increased CAV1 expression in peripheral monocytes was associated with lower AHI (P = 0.024) and higher minimum overnight oxygen saturation (P = 0.007). Conclusions: Rare variants in CAV1, a membrane-scaffolding protein essential in multiple cellular and metabolic functions, are associated with higher CAV1 gene expression and lower OSA severity, suggesting a novel target for modulating OSA severity.


Asunto(s)
Apnea Obstructiva del Sueño , Humanos , Caveolina 1/genética , Apnea Obstructiva del Sueño/genética , Análisis de Secuencia de ADN , Secuenciación de Nucleótidos de Alto Rendimiento
14.
Am J Hum Genet ; 104(3): 410-421, 2019 03 07.
Artículo en Inglés | MEDLINE | ID: mdl-30849328

RESUMEN

Set-based analysis that jointly tests the association of variants in a group has emerged as a popular tool for analyzing rare and low-frequency variants in sequencing studies. The existing set-based tests can suffer significant power loss when only a small proportion of variants are causal, and their powers can be sensitive to the number, effect sizes, and effect directions of the causal variants and the choices of weights. Here we propose an aggregated Cauchy association test (ACAT), a general, powerful, and computationally efficient p value combination method for boosting power in sequencing studies. First, by combining variant-level p values, we use ACAT to construct a set-based test (ACAT-V) that is particularly powerful in the presence of only a small number of causal variants in a variant set. Second, by combining different variant-set-level p values, we use ACAT to construct an omnibus test (ACAT-O) that combines the strength of multiple complimentary set-based tests, including the burden test, sequence kernel association test (SKAT), and ACAT-V. Through analysis of extensively simulated data and the whole-genome sequencing data from the Atherosclerosis Risk in Communities (ARIC) study, we demonstrate that ACAT-V complements the SKAT and the burden test, and that ACAT-O has a substantially more robust and higher power than those of the alternative tests.


Asunto(s)
Algoritmos , Enfermedad/genética , Estudios de Asociación Genética/métodos , Variación Genética , Genoma Humano , Modelos Genéticos , Análisis de Secuencia de ADN/métodos , Simulación por Computador , Interpretación Estadística de Datos , Humanos
15.
Am J Hum Genet ; 104(5): 802-814, 2019 05 02.
Artículo en Inglés | MEDLINE | ID: mdl-30982610

RESUMEN

Whole-genome sequencing (WGS) studies are being widely conducted in order to identify rare variants associated with human diseases and disease-related traits. Classical single-marker association analyses for rare variants have limited power, and variant-set-based analyses are commonly used by researchers for analyzing rare variants. However, existing variant-set-based approaches need to pre-specify genetic regions for analysis; hence, they are not directly applicable to WGS data because of the large number of intergenic and intron regions that consist of a massive number of non-coding variants. The commonly used sliding-window method requires the pre-specification of fixed window sizes, which are often unknown as a priori, are difficult to specify in practice, and are subject to limitations given that the sizes of genetic-association regions are likely to vary across the genome and phenotypes. We propose a computationally efficient and dynamic scan-statistic method (Scan the Genome [SCANG]) for analyzing WGS data; this method flexibly detects the sizes and the locations of rare-variant association regions without the need to specify a prior, fixed window size. The proposed method controls for the genome-wise type I error rate and accounts for the linkage disequilibrium among genetic variants. It allows the detected sizes of rare-variant association regions to vary across the genome. Through extensive simulated studies that consider a wide variety of scenarios, we show that SCANG substantially outperforms several alternative methods for detecting rare-variant-associations while controlling for the genome-wise type I error rates. We illustrate SCANG by analyzing the WGS lipids data from the Atherosclerosis Risk in Communities (ARIC) study.


Asunto(s)
Algoritmos , Biología Computacional/métodos , Variación Genética , Genoma Humano , Estudio de Asociación del Genoma Completo , Secuenciación Completa del Genoma/métodos , Humanos , Desequilibrio de Ligamiento , Modelos Genéticos
16.
Am J Hum Genet ; 105(5): 1057-1068, 2019 11 07.
Artículo en Inglés | MEDLINE | ID: mdl-31668705

RESUMEN

Average arterial oxyhemoglobin saturation during sleep (AvSpO2S) is a clinically relevant measure of physiological stress associated with sleep-disordered breathing, and this measure predicts incident cardiovascular disease and mortality. Using high-depth whole-genome sequencing data from the National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) project and focusing on genes with linkage evidence on chromosome 8p23,1,2 we observed that six coding and 51 noncoding variants in a gene that encodes the GTPase-activating protein (DLC1) are significantly associated with AvSpO2S and replicated in independent subjects. The combined DLC1 association evidence of discovery and replication cohorts reaches genome-wide significance in European Americans (p = 7.9 × 10-7). A risk score for these variants, built on an independent dataset, explains 0.97% of the AvSpO2S variation and contributes to the linkage evidence. The 51 noncoding variants are enriched in regulatory features in a human lung fibroblast cell line and contribute to DLC1 expression variation. Mendelian randomization analysis using these variants indicates a significant causal effect of DLC1 expression in fibroblasts on AvSpO2S. Multiple sources of information, including genetic variants, gene expression, and methylation, consistently suggest that DLC1 is a gene associated with AvSpO2S.


Asunto(s)
Cromosomas Humanos Par 8/genética , Proteínas Activadoras de GTPasa/genética , Oxihemoglobinas/genética , Sueño/genética , Proteínas Supresoras de Tumor/genética , Ligamiento Genético/genética , Estudio de Asociación del Genoma Completo , Humanos , Secuenciación Completa del Genoma/métodos
17.
Am J Hum Genet ; 104(2): 260-274, 2019 02 07.
Artículo en Inglés | MEDLINE | ID: mdl-30639324

RESUMEN

With advances in whole-genome sequencing (WGS) technology, more advanced statistical methods for testing genetic association with rare variants are being developed. Methods in which variants are grouped for analysis are also known as variant-set, gene-based, and aggregate unit tests. The burden test and sequence kernel association test (SKAT) are two widely used variant-set tests, which were originally developed for samples of unrelated individuals and later have been extended to family data with known pedigree structures. However, computationally efficient and powerful variant-set tests are needed to make analyses tractable in large-scale WGS studies with complex study samples. In this paper, we propose the variant-set mixed model association tests (SMMAT) for continuous and binary traits using the generalized linear mixed model framework. These tests can be applied to large-scale WGS studies involving samples with population structure and relatedness, such as in the National Heart, Lung, and Blood Institute's Trans-Omics for Precision Medicine (TOPMed) program. SMMATs share the same null model for different variant sets, and a virtue of this null model, which includes covariates only, is that it needs to be fit only once for all tests in each genome-wide analysis. Simulation studies show that all the proposed SMMATs correctly control type I error rates for both continuous and binary traits in the presence of population structure and relatedness. We also illustrate our tests in a real data example of analysis of plasma fibrinogen levels in the TOPMed program (n = 23,763), using the Analysis Commons, a cloud-based computing platform.


Asunto(s)
Estudios de Asociación Genética , Modelos Genéticos , Secuenciación Completa del Genoma , Cromosomas Humanos Par 4/genética , Nube Computacional , Femenino , Fibrinógeno/análisis , Fibrinógeno/genética , Genética de Población , Humanos , Masculino , National Heart, Lung, and Blood Institute (U.S.) , Medicina de Precisión , Proyectos de Investigación , Factores de Tiempo , Estados Unidos
18.
PLoS Genet ; 15(3): e1007530, 2019 03.
Artículo en Inglés | MEDLINE | ID: mdl-30875371

RESUMEN

A common complementary strategy in Genome-Wide Association Studies (GWAS) is to perform Gene Set Analysis (GSA), which tests for the association between one phenotype of interest and an entire set of Single Nucleotide Polymorphisms (SNPs) residing in selected genes. While there exist many tools for performing GSA, popular methods often include a number of ad-hoc steps that are difficult to justify statistically, provide complicated interpretations based on permutation inference, and demonstrate poor operating characteristics. Additionally, the lack of gold standard gene set lists can produce misleading results and create difficulties in comparing analyses even across the same phenotype. We introduce the Generalized Berk-Jones (GBJ) statistic for GSA, a permutation-free parametric framework that offers asymptotic power guarantees in certain set-based testing settings. To adjust for confounding introduced by different gene set lists, we further develop a GBJ step-down inference technique that can discriminate between gene sets driven to significance by single genes and those demonstrating group-level effects. We compare GBJ to popular alternatives through simulation and re-analysis of summary statistics from a large breast cancer GWAS, and we show how GBJ can increase power by incorporating information from multiple signals in the same gene. In addition, we illustrate how breast cancer pathway analysis can be confounded by the frequency of FGFR2 in pathway lists. Our approach is further validated on two other datasets of summary statistics generated from GWAS of height and schizophrenia.


Asunto(s)
Estudio de Asociación del Genoma Completo/estadística & datos numéricos , Estatura/genética , Neoplasias de la Mama/genética , Mapeo Cromosómico/estadística & datos numéricos , Biología Computacional/métodos , Simulación por Computador , Bases de Datos Genéticas , Femenino , Redes Reguladoras de Genes , Humanos , Modelos Genéticos , Modelos Estadísticos , Polimorfismo de Nucleótido Simple , Receptor Tipo 2 de Factor de Crecimiento de Fibroblastos/genética , Esquizofrenia/genética
19.
PLoS Genet ; 15(4): e1007739, 2019 04.
Artículo en Inglés | MEDLINE | ID: mdl-30990817

RESUMEN

Sleep disordered breathing (SDB)-related overnight hypoxemia is associated with cardiometabolic disease and other comorbidities. Understanding the genetic bases for variations in nocturnal hypoxemia may help understand mechanisms influencing oxygenation and SDB-related mortality. We conducted genome-wide association tests across 10 cohorts and 4 populations to identify genetic variants associated with three correlated measures of overnight oxyhemoglobin saturation: average and minimum oxyhemoglobin saturation during sleep and the percent of sleep with oxyhemoglobin saturation under 90%. The discovery sample consisted of 8,326 individuals. Variants with p < 1 × 10(-6) were analyzed in a replication group of 14,410 individuals. We identified 3 significantly associated regions, including 2 regions in multi-ethnic analyses (2q12, 10q22). SNPs in the 2q12 region associated with minimum SpO2 (rs78136548 p = 2.70 × 10(-10)). SNPs at 10q22 were associated with all three traits including average SpO2 (rs72805692 p = 4.58 × 10(-8)). SNPs in both regions were associated in over 20,000 individuals and are supported by prior associations or functional evidence. Four additional significant regions were detected in secondary sex-stratified and combined discovery and replication analyses, including a region overlapping Reelin, a known marker of respiratory complex neurons.These are the first genome-wide significant findings reported for oxyhemoglobin saturation during sleep, a phenotype of high clinical interest. Our replicated associations with HK1 and IL18R1 suggest that variants in inflammatory pathways, such as the biologically-plausible NLRP3 inflammasome, may contribute to nocturnal hypoxemia.


Asunto(s)
Hexoquinasa/genética , Subunidad alfa del Receptor de Interleucina-18/genética , Oxihemoglobinas/metabolismo , Sueño/genética , Adolescente , Adulto , Anciano , Anciano de 80 o más Años , Moléculas de Adhesión Celular Neuronal/genética , Biología Computacional , Proteínas de la Matriz Extracelular/genética , Femenino , Redes Reguladoras de Genes , Variación Genética , Estudio de Asociación del Genoma Completo , Humanos , Hipoxia/sangre , Hipoxia/genética , Masculino , Persona de Mediana Edad , Proteína con Dominio Pirina 3 de la Familia NLR/genética , Proteínas del Tejido Nervioso/genética , Oxígeno/sangre , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Proteína Reelina , Serina Endopeptidasas/genética , Síndromes de la Apnea del Sueño/sangre , Síndromes de la Apnea del Sueño/genética , Adulto Joven
20.
Hum Mol Genet ; 28(4): 675-687, 2019 02 15.
Artículo en Inglés | MEDLINE | ID: mdl-30403821

RESUMEN

Obstructive sleep apnea (OSA) is a common disorder associated with increased risk of cardiovascular disease and mortality. Its prevalence and severity vary across ancestral background. Although OSA traits are heritable, few genetic associations have been identified. To identify genetic regions associated with OSA and improve statistical power, we applied admixture mapping on three primary OSA traits [the apnea hypopnea index (AHI), overnight average oxyhemoglobin saturation (SaO2) and percentage time SaO2 < 90%] and a secondary trait (respiratory event duration) in a Hispanic/Latino American population study of 11 575 individuals with significant variation in ancestral background. Linear mixed models were performed using previously inferred African, European and Amerindian local genetic ancestry markers. Global African ancestry was associated with a lower AHI, higher SaO2 and shorter event duration. Admixture mapping analysis of the primary OSA traits identified local African ancestry at the chromosomal region 2q37 as genome-wide significantly associated with AHI (P < 5.7 × 10-5), and European and Amerindian ancestries at 18q21 suggestively associated with both AHI and percentage time SaO2 < 90% (P < 10-3). Follow-up joint ancestry-SNP association analyses identified novel variants in ferrochelatase (FECH), significantly associated with AHI and percentage time SaO2 < 90% after adjusting for multiple tests (P < 8 × 10-6). These signals contributed to the admixture mapping associations and were replicated in independent cohorts. In this first admixture mapping study of OSA, novel associations with variants in the iron/heme metabolism pathway suggest a role for iron in influencing respiratory traits underlying OSA.


Asunto(s)
Ferroquelatasa/genética , Estudio de Asociación del Genoma Completo , Apnea Obstructiva del Sueño/genética , Anciano , Mapeo Cromosómico , Femenino , Genotipo , Hispánicos o Latinos/genética , Humanos , Masculino , Persona de Mediana Edad , Polimorfismo de Nucleótido Simple/genética , Polisomnografía , Apnea Obstructiva del Sueño/diagnóstico por imagen , Apnea Obstructiva del Sueño/fisiopatología , Población Blanca/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA