Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 374
Filtrar
1.
Artigo em Inglês | MEDLINE | ID: mdl-38724019

RESUMO

Significant progress has been made in augmenting clinical decision-making using artificial intelligence (AI) in the context of secondary and tertiary care at large academic medical centers. For such innovations to have an impact across the spectrum of care, additional challenges must be addressed, including inconsistent use of preventative care and gaps in chronic care management. The integration of additional data, including genomics and data from wearables, could prove critical in addressing these gaps, but technical, legal, and ethical challenges arise. On the technical side, approaches for integrating complex and messy data are needed. Data and design imperfections like selection bias, missing data, and confounding must be addressed. In terms of legal and ethical challenges, while AI has the potential to aid in leveraging patient data to make clinical care decisions, we also risk exacerbating existing disparities. Organizations implementing AI solutions must carefully consider how they can improve care for all and reduce inequities.

2.
PLoS Genet ; 19(11): e1011022, 2023 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-37934796

RESUMO

Epigenetic researchers often evaluate DNA methylation as a potential mediator of the effect of social/environmental exposures on a health outcome. Modern statistical methods for jointly evaluating many mediators have not been widely adopted. We compare seven methods for high-dimensional mediation analysis with continuous outcomes through both diverse simulations and analysis of DNAm data from a large multi-ethnic cohort in the United States, while providing an R package for their seamless implementation and adoption. Among the considered choices, the best-performing methods for detecting active mediators in simulations are the Bayesian sparse linear mixed model (BSLMM) and high-dimensional mediation analysis (HDMA); while the preferred methods for estimating the global mediation effect are high-dimensional linear mediation analysis (HILMA) and principal component mediation analysis (PCMA). We provide guidelines for epigenetic researchers on choosing the best method in practice and offer suggestions for future methodological development.


Assuntos
Metilação de DNA , Análise de Mediação , Humanos , Metilação de DNA/genética , Teorema de Bayes , Modelos Lineares , Exposição Ambiental
3.
PLoS Genet ; 19(12): e1010907, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-38113267

RESUMO

OBJECTIVE: To overcome the limitations associated with the collection and curation of COVID-19 outcome data in biobanks, this study proposes the use of polygenic risk scores (PRS) as reliable proxies of COVID-19 severity across three large biobanks: the Michigan Genomics Initiative (MGI), UK Biobank (UKB), and NIH All of Us. The goal is to identify associations between pre-existing conditions and COVID-19 severity. METHODS: Drawing on a sample of more than 500,000 individuals from the three biobanks, we conducted a phenome-wide association study (PheWAS) to identify associations between a PRS for COVID-19 severity, derived from a genome-wide association study on COVID-19 hospitalization, and clinical pre-existing, pre-pandemic phenotypes. We performed cohort-specific PRS PheWAS and a subsequent fixed-effects meta-analysis. RESULTS: The current study uncovered 23 pre-existing conditions significantly associated with the COVID-19 severity PRS in cohort-specific analyses, of which 21 were observed in the UKB cohort and two in the MGI cohort. The meta-analysis yielded 27 significant phenotypes predominantly related to obesity, metabolic disorders, and cardiovascular conditions. After adjusting for body mass index, several clinical phenotypes, such as hypercholesterolemia and gastrointestinal disorders, remained associated with an increased risk of hospitalization following COVID-19 infection. CONCLUSION: By employing PRS as a proxy for COVID-19 severity, we corroborated known risk factors and identified novel associations between pre-existing clinical phenotypes and COVID-19 severity. Our study highlights the potential value of using PRS when actual outcome data may be limited or inadequate for robust analyses.


Assuntos
COVID-19 , Saúde da População , Humanos , Estudo de Associação Genômica Ampla , Estratificação de Risco Genético , COVID-19/genética , Bancos de Espécimes Biológicos , Cobertura de Condição Pré-Existente , Fatores de Risco , Predisposição Genética para Doença
4.
Am J Hum Genet ; 109(11): 1998-2008, 2022 11 03.
Artigo em Inglês | MEDLINE | ID: mdl-36240765

RESUMO

As most existing genome-wide association studies (GWASs) were conducted in European-ancestry cohorts, and as the existing polygenic risk score (PRS) models have limited transferability across ancestry groups, PRS research on non-European-ancestry groups needs to make efficient use of available data until we attain large sample sizes across all ancestry groups. Here we propose a PRS method using transfer learning techniques. Our approach, TL-PRS, uses gradient descent to fine-tune the baseline PRS model from an ancestry group with large sample GWASs to the dataset of target ancestry. In our application of constructing PRS for seven quantitative and two dichotomous traits for 10,285 individuals of South Asian ancestry and 8,168 individuals of African ancestry in UK Biobank, TL-PRS using PRS-CS as a baseline method obtained 25% average relative improvement for South Asian samples and 29% for African samples compared to the standard PRS-CS method in terms of predicted R2. Our approach increases the transferability of PRSs across ancestries and thereby helps reduce existing inequities in genetics research.


Assuntos
Estudo de Associação Genômica Ampla , Herança Multifatorial , Humanos , Herança Multifatorial/genética , Predisposição Genética para Doença , Polimorfismo de Nucleotídeo Único/genética , Fatores de Risco , Aprendizado de Máquina
5.
Am J Hum Genet ; 109(10): 1742-1760, 2022 10 06.
Artigo em Inglês | MEDLINE | ID: mdl-36152628

RESUMO

Complex traits are influenced by genetic risk factors, lifestyle, and environmental variables, so-called exposures. Some exposures, e.g., smoking or lipid levels, have common genetic modifiers identified in genome-wide association studies. Because measurements are often unfeasible, exposure polygenic risk scores (ExPRSs) offer an alternative to study the influence of exposures on various phenotypes. Here, we collected publicly available summary statistics for 28 exposures and applied four common PRS methods to generate ExPRSs in two large biobanks: the Michigan Genomics Initiative and the UK Biobank. We established ExPRSs for 27 exposures and demonstrated their applicability in phenome-wide association studies and as predictors for common chronic conditions. Especially the addition of multiple ExPRSs showed, for several chronic conditions, an improvement compared to prediction models that only included traditional, disease-focused PRSs. To facilitate follow-up studies, we share all ExPRS constructs and generated results via an online repository called ExPRSweb.


Assuntos
Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Lipídeos , Herança Multifatorial/genética , Fatores de Risco
6.
BMC Bioinformatics ; 25(1): 65, 2024 Feb 09.
Artigo em Inglês | MEDLINE | ID: mdl-38336614

RESUMO

BACKGROUND: Genetic variants can contribute differently to trait heritability by their functional categories, and recent studies have shown that incorporating functional annotation can improve the predictive performance of polygenic risk scores (PRSs). In addition, when only a small proportion of variants are causal variants, PRS methods that employ a Bayesian framework with shrinkage can account for such sparsity. It is possible that the annotation group level effect is also sparse. However, the number of PRS methods that incorporate both annotation information and shrinkage on effect sizes is limited. We propose a PRS method, PRSbils, which utilizes the functional annotation information with a bilevel continuous shrinkage prior to accommodate the varying genetic architectures both on the variant-specific level and on the functional annotation level. RESULTS: We conducted simulation studies and investigated the predictive performance in settings with different genetic architectures. Results indicated that when there was a relatively large variability of group-wise heritability contribution, the gain in prediction performance from the proposed method was on average 8.0% higher AUC compared to the benchmark method PRS-CS. The proposed method also yielded higher predictive performance compared to PRS-CS in settings with different overlapping patterns of annotation groups and obtained on average 6.4% higher AUC. We applied PRSbils to binary and quantitative traits in three real world data sources (the UK Biobank, the Michigan Genomics Initiative (MGI), and the Korean Genome and Epidemiology Study (KoGES)), and two sources of annotations: ANNOVAR, and pathway information from the Kyoto Encyclopedia of Genes and Genomes (KEGG), and demonstrated that the proposed method holds the potential for improving predictive performance by incorporating functional annotations. CONCLUSIONS: By utilizing a bilevel shrinkage framework, PRSbils enables the incorporation of both overlapping and non-overlapping annotations into PRS construction to improve the performance of genetic risk prediction. The software is available at https://github.com/styvon/PRSbils .


Assuntos
Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Teorema de Bayes , Estudo de Associação Genômica Ampla/métodos , Herança Multifatorial , Software , Fatores de Risco
7.
Genet Epidemiol ; 47(2): 167-184, 2023 03.
Artigo em Inglês | MEDLINE | ID: mdl-36465006

RESUMO

Mediation hypothesis testing for a large number of mediators is challenging due to the composite structure of the null hypothesis, H 0 : α ß = 0 ${H}_{0}:\alpha \beta =0$ ( α $\alpha $ : effect of the exposure on the mediator after adjusting for confounders; ß $\beta $ : effect of the mediator on the outcome after adjusting for exposure and confounders). In this paper, we reviewed three classes of methods for large-scale one at a time mediation hypothesis testing. These methods are commonly used for continuous outcomes and continuous mediators assuming there is no exposure-mediator interaction so that the product α ß $\alpha \beta $ has a causal interpretation as the indirect effect. The first class of methods ignores the impact of different structures under the composite null hypothesis, namely, (1) α = 0 , ß ≠ 0 $\alpha =0,\beta \ne 0$ ; (2) α ≠ 0 , ß = 0 $\alpha \ne 0,\beta =0$ ; and (3) α = ß = 0 $\alpha =\beta =0$ . The second class of methods weights the reference distribution under each case of the null to form a mixture reference distribution. The third class constructs a composite test statistic using the three p values obtained under each case of the null so that the reference distribution of the composite statistic is approximately U ( 0 , 1 ) $U(0,1)$ . In addition to these existing methods, we developed the Sobel-comp method belonging to the second class, which uses a corrected mixture reference distribution for Sobel's test statistic. We performed extensive simulation studies to compare all six methods belonging to these three classes in terms of the false positive rates (FPRs) under the null hypothesis and the true positive rates under the alternative hypothesis. We found that the second class of methods which uses a mixture reference distribution could best maintain the FPRs at the nominal level under the null hypothesis and had the greatest true positive rates under the alternative hypothesis. We applied all methods to study the mediation mechanism of DNA methylation sites in the pathway from adult socioeconomic status to glycated hemoglobin level using data from the Multi-Ethnic Study of Atherosclerosis (MESA). We provide guidelines for choosing the optimal mediation hypothesis testing method in practice and develop an R package medScan available on the CRAN for implementing all the six methods.


Assuntos
Modelos Genéticos , Modelos Estatísticos , Adulto , Humanos , Simulação por Computador , Projetos de Pesquisa
8.
Am J Hum Genet ; 108(5): 825-839, 2021 05 06.
Artigo em Inglês | MEDLINE | ID: mdl-33836139

RESUMO

In genome-wide association studies, ordinal categorical phenotypes are widely used to measure human behaviors, satisfaction, and preferences. However, because of the lack of analysis tools, methods designed for binary or quantitative traits are commonly used inappropriately to analyze categorical phenotypes. To accurately model the dependence of an ordinal categorical phenotype on covariates, we propose an efficient mixed model association test, proportional odds logistic mixed model (POLMM). POLMM is computationally efficient to analyze large datasets with hundreds of thousands of samples, can control type I error rates at a stringent significance level regardless of the phenotypic distribution, and is more powerful than alternative methods. In contrast, the standard linear mixed model approaches cannot control type I error rates for rare variants when the phenotypic distribution is unbalanced, although they performed well when testing common variants. We applied POLMM to 258 ordinal categorical phenotypes on array genotypes and imputed samples from 408,961 individuals in UK Biobank. In total, we identified 5,885 genome-wide significant variants, of which, 424 variants (7.2%) are rare variants with MAF < 0.01.


Assuntos
Simulação por Computador , Estudo de Associação Genômica Ampla , Modelos Genéticos , Fenótipo , Bancos de Espécimes Biológicos , Criança , Feminino , Humanos , Masculino , Projetos de Pesquisa , Reino Unido
9.
Ann Surg ; 279(4): 555-560, 2024 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-37830271

RESUMO

OBJECTIVE: To evaluate severe complications and mortality over years of independent practice among general surgeons. BACKGROUND: Despite concerns that newly graduated general surgeons may be unprepared for independent practice, it is unclear whether patient outcomes differ between early and later career surgeons. METHODS: We used Medicare claims for patients discharged between July 1, 2007 and December 31, 2019 to evaluate 30-day severe complications and mortality for 26 operations defined as core procedures by the American Board of Surgery. Generalized additive mixed models were used to assess the association between surgeon years in practice and 30-day outcomes while adjusting for differences in patient, hospital, and surgeon characteristics. RESULTS: The cohort included 1,329,358 operations performed by 14,399 surgeons. In generalized mixed models, the relative risk (RR) of mortality was higher among surgeons in their first year of practice compared with surgeons in their 15th year of practice [5.5% (95% CI: 4.1%-7.3%) vs 4.7% (95% CI: 3.5%-6.3%), RR: 1.17 (95% CI: 1.11-1.22)]. Similarly, the RR of severe complications was higher among surgeons in their first year of practice compared with surgeons in their 15th year of practice [7.5% (95% CI: 6.6%-8.5%) versus 6.9% (95% CI: 6.1%-7.9%), RR: 1.08 (95% CI: 1.03-1.14)]. When stratified by individual operation, 21 operations had a significantly higher RR of mortality and all 26 operations had a significantly higher RR of severe complications in the first compared with the 15th year of practice. CONCLUSIONS: Among general surgeons performing common operations, rates of mortality and severe complications were higher among newly graduated surgeons compared with later career surgeons.


Assuntos
Medicare , Cirurgiões , Humanos , Estados Unidos/epidemiologia , Idoso , Hospitais , Mortalidade Hospitalar , Competência Clínica , Complicações Pós-Operatórias/epidemiologia , Estudos Retrospectivos
10.
Biostatistics ; 24(2): 406-424, 2023 04 14.
Artigo em Inglês | MEDLINE | ID: mdl-34269371

RESUMO

It is becoming increasingly common for researchers to consider incorporating external information from large studies to improve the accuracy of statistical inference instead of relying on a modestly sized data set collected internally. With some new predictors only available internally, we aim to build improved regression models based on individual-level data from an "internal" study while incorporating summary-level information from "external" models. We propose a meta-analysis framework along with two weighted estimators as the composite of empirical Bayes estimators, which combines the estimates from different external models. The proposed framework is flexible and robust in the ways that (i) it is capable of incorporating external models that use a slightly different set of covariates; (ii) it is able to identify the most relevant external information and diminish the influence of information that is less compatible with the internal data; and (iii) it nicely balances the bias-variance trade-off while preserving the most efficiency gain. The proposed estimators are more efficient than the naïve analysis of the internal data and other naïve combinations of external estimators.


Assuntos
Modelos Estatísticos , Humanos , Teorema de Bayes , Interpretação Estatística de Dados , Viés
11.
J Neurol Neurosurg Psychiatry ; 95(3): 241-248, 2024 Feb 14.
Artigo em Inglês | MEDLINE | ID: mdl-37758454

RESUMO

BACKGROUND: Amyotrophic lateral sclerosis (ALS) is a fatal, progressive neurogenerative disease caused by combined genetic susceptibilities and environmental exposures. Identifying and validating these exposures are of paramount importance to modify disease risk. We previously reported that persistent organic pollutants (POPs) associate with ALS risk and survival and aimed to replicate these findings in a new cohort. METHOD: Participants with and without ALS recruited in Michigan provided plasma samples for POPs analysis by isotope dilution with triple quadrupole mass spectrometry. ORs for risk models and hazard ratios for survival models were calculated for individual POPs. POP mixtures were represented by environmental risk scores (ERS), a summation of total exposures, to evaluate the association with risk (ERSrisk) and survival (ERSsurvival). RESULTS: Samples from 164 ALS and 105 control participants were analysed. Several individual POPs significantly associated with ALS, including 8 of 22 polychlorinated biphenyls and 7 of 10 organochlorine pesticides (OCPs). ALS risk was most strongly represented by the mixture effects of OCPs alpha-hexachlorocyclohexane, hexachlorobenzene, trans-nonachlor and cis-nonachlor and an interquartile increase in ERSrisk enhanced ALS risk 2.58 times (p<0.001). ALS survival was represented by the combined mixture of all POPs and an interquartile increase in ERSsurvival enhanced ALS mortality rate 1.65 times (p=0.008). CONCLUSIONS: These data continue to support POPs as important factors for ALS risk and progression and replicate findings in a new cohort. The assessments of POPs in non-Michigan ALS cohorts are encouraged to better understand the global effect and the need for targeted disease risk reduction strategies.


Assuntos
Esclerose Lateral Amiotrófica , Poluentes Ambientais , Hidrocarbonetos Clorados , Humanos , Poluentes Orgânicos Persistentes , Michigan/epidemiologia , Poluentes Ambientais/efeitos adversos , Fatores de Risco
12.
Environ Sci Technol ; 58(19): 8264-8277, 2024 May 14.
Artigo em Inglês | MEDLINE | ID: mdl-38691655

RESUMO

Prenatal per- and poly-fluoroalkyl substances (PFAS) exposure may influence gestational outcomes through bioactive lipids─metabolic and inflammation pathway indicators. We estimated associations between prenatal PFAS exposure and bioactive lipids, measuring 12 serum PFAS and 50 plasma bioactive lipids in 414 pregnant women (median 17.4 weeks' gestation) from three Environmental influences on Child Health Outcomes Program cohorts. Pairwise association estimates across cohorts were obtained through linear mixed models and meta-analysis, adjusting the former for false discovery rates. Associations between the PFAS mixture and bioactive lipids were estimated using quantile g-computation. Pairwise analyses revealed bioactive lipid levels associated with PFDeA, PFNA, PFOA, and PFUdA (p < 0.05) across three enzymatic pathways (cyclooxygenase, cytochrome p450, lipoxygenase) in at least one combined cohort analysis, and PFOA and PFUdA (q < 0.2) in one linear mixed model. The strongest signature revealed doubling in PFOA corresponding with PGD2 (cyclooxygenase pathway; +24.3%, 95% CI: 7.3-43.9%) in the combined cohort. Mixture analysis revealed nine positive associations across all pathways with the PFAS mixture, the strongest signature indicating a quartile increase in the PFAS mixture associated with PGD2 (+34%, 95% CI: 8-66%), primarily driven by PFOS. Bioactive lipids emerged as prenatal PFAS exposure biomarkers, deepening insights into PFAS' influence on pregnancy outcomes.


Assuntos
Fluorocarbonos , Lipídeos , Humanos , Feminino , Gravidez , Lipídeos/sangue , Fluorocarbonos/sangue , Saúde da Criança , Estudos de Coortes , Estudos Transversais , Adulto , Poluentes Ambientais/sangue , Exposição Ambiental , Exposição Materna , Criança
14.
Environ Res ; 255: 119205, 2024 Aug 15.
Artigo em Inglês | MEDLINE | ID: mdl-38782334

RESUMO

BACKGROUND: Polycyclic aromatic hydrocarbons (PAHs) are endocrine disruptors resulting from incomplete combustion. Pregnancy represents a particularly vulnerable period to such exposures, given the significant influence of hormone physiology on fetal growth and pregnancy outcomes. Maternal thyroid hormones play crucial roles in fetal development and pregnancy outcomes. However, limited studies have examined gestational PAH exposure and maternal thyroid hormones during pregnancy. METHODS: Our study included 439 women enrolled in the LIFECODES birth cohort in Boston, aiming to explore the relationship between urinary PAH metabolites and thyroid hormones throughout pregnancy. Urine samples for PAH metabolite analysis and plasma samples for thyroid hormone were measured up to four visits throughout gestation. Single pollutant analyses employed linear mixed effect models to investigate individual associations between each PAH metabolite and thyroid hormone concentration. Sensitivity analyses were conducted to assess potential susceptibility windows and fetal-sex-specific effects of PAH exposure. Mixture analyses utilized quantile g-computation to evaluate the collective impact of eight PAH metabolites on thyroid hormone concentrations. Additionally, Bayesian kernel machine regression (BKMR) was employed to explore potential non-linear associations and interactions between PAH metabolites. Subject-specific random intercepts were incorporated to address intra-individual correlation of serial measurements over time in both single pollutant and mixture analyses. RESULTS: Our findings revealed positive trends in associations between PAH metabolites and thyroid hormones, both individually and collectively as a mixture. Sensitivity analyses indicated that these associations were influenced by the study visit and fetal sex. Mixture analyses suggested non-linear relationships and interactions between different PAH exposures. CONCLUSIONS: This comprehensive investigation underscores the critical importance of understanding the impact of PAH exposures on thyroid hormone physiology during pregnancy. The findings highlight the intricate interplay between environmental pollutants and human pregnancy physiology, emphasizing the need for targeted interventions and public health policies to mitigate adverse outcomes associated with prenatal PAH exposure.


Assuntos
Exposição Materna , Hidrocarbonetos Policíclicos Aromáticos , Hormônios Tireóideos , Humanos , Feminino , Gravidez , Hidrocarbonetos Policíclicos Aromáticos/urina , Hormônios Tireóideos/sangue , Adulto , Exposição Materna/efeitos adversos , Poluentes Ambientais/urina , Poluentes Ambientais/sangue , Boston , Estudos de Coortes , Adulto Jovem , Disruptores Endócrinos/urina
15.
PLoS Genet ; 17(9): e1009670, 2021 09.
Artigo em Inglês | MEDLINE | ID: mdl-34529658

RESUMO

Polygenic risk scores (PRS) can provide useful information for personalized risk stratification and disease risk assessment, especially when combined with non-genetic risk factors. However, their construction depends on the availability of summary statistics from genome-wide association studies (GWAS) independent from the target sample. For best compatibility, it was reported that GWAS and the target sample should match in terms of ancestries. Yet, GWAS, especially in the field of cancer, often lack diversity and are predominated by European ancestry. This bias is a limiting factor in PRS research. By using electronic health records and genetic data from the UK Biobank, we contrast the utility of breast and prostate cancer PRS derived from external European-ancestry-based GWAS across African, East Asian, European, and South Asian ancestry groups. We highlight differences in the PRS distributions of these groups that are amplified when PRS methods condense hundreds of thousands of variants into a single score. While European-GWAS-derived PRS were not directly transferrable across ancestries on an absolute scale, we establish their predictive potential when considering them separately within each group. For example, the top 10% of the breast cancer PRS distributions within each ancestry group each revealed significant enrichments of breast cancer cases compared to the bottom 90% (odds ratio of 2.81 [95%CI: 2.69,2.93] in European, 2.88 [1.85, 4.48] in African, 2.60 [1.25, 5.40] in East Asian, and 2.33 [1.55, 3.51] in South Asian individuals). Our findings highlight a compromise solution for PRS research to compensate for the lack of diversity in well-powered European GWAS efforts while recruitment of diverse participants in the field catches up.


Assuntos
Neoplasias da Mama/genética , Predisposição Genética para Doença , Herança Multifatorial , Feminino , Estudo de Associação Genômica Ampla , Humanos
16.
Am J Epidemiol ; 192(3): 328-333, 2023 02 24.
Artigo em Inglês | MEDLINE | ID: mdl-36446573

RESUMO

The widespread testing for severe acute respiratory syndrome coronavirus 2 infection has facilitated the use of test-negative designs (TNDs) for modeling coronavirus disease 2019 (COVID-19) vaccination and outcomes. Despite the comprehensive literature on TND, the use of TND in COVID-19 studies is relatively new and calls for robust design and analysis to adapt to a rapidly changing and dynamically evolving pandemic and to account for changes in testing and reporting practices. In this commentary, we aim to draw the attention of researchers to COVID-specific challenges in using TND as we are analyzing data amassed over more than two years of the pandemic. We first review when and why TND works and general challenges in TND studies presented in the literature. We then discuss COVID-specific challenges which have not received adequate acknowledgment but may add to the risk of invalid conclusions in TND studies of COVID-19.


Assuntos
COVID-19 , Humanos , Vacinas contra COVID-19 , Teste para COVID-19 , Vacinação
17.
Am J Hum Genet ; 107(2): 222-233, 2020 08 06.
Artigo em Inglês | MEDLINE | ID: mdl-32589924

RESUMO

With increasing biobanking efforts connecting electronic health records and national registries to germline genetics, the time-to-event data analysis has attracted increasing attention in the genetics studies of human diseases. In time-to-event data analysis, the Cox proportional hazards (PH) regression model is one of the most used approaches. However, existing methods and tools are not scalable when analyzing a large biobank with hundreds of thousands of samples and endpoints, and they are not accurate when testing low-frequency and rare variants. Here, we propose a scalable and accurate method, SPACox (a saddlepoint approximation implementation based on the Cox PH regression model), that is applicable for genome-wide scale time-to-event data analysis. SPACox requires fitting a Cox PH regression model only once across the genome-wide analysis and then uses a saddlepoint approximation (SPA) to calibrate the test statistics. Simulation studies show that SPACox is 76-252 times faster than other existing alternatives, such as gwasurvivr, 185-511 times faster than the standard Wald test, and more than 6,000 times faster than the Firth correction and can control type I error rates at the genome-wide significance level regardless of minor allele frequencies. Through the analysis of UK Biobank inpatient data of 282,871 white British European ancestry samples, we show that SPACox can efficiently analyze large sample sizes and accurately control type I error rates. We identified 611 loci associated with time-to-event phenotypes of 12 common diseases, of which 38 loci would be missed within a logistic regression framework with a binary phenotype defined as event occurrence status during the follow-up period.


Assuntos
Estudo de Associação Genômica Ampla/métodos , Bancos de Espécimes Biológicos , Estudos de Casos e Controles , Análise de Dados , Frequência do Gene/genética , Humanos , Modelos Logísticos , Fenótipo , Modelos de Riscos Proporcionais , Tamanho da Amostra , Reino Unido , População Branca/genética
18.
Am J Hum Genet ; 107(5): 815-836, 2020 11 05.
Artigo em Inglês | MEDLINE | ID: mdl-32991828

RESUMO

To facilitate scientific collaboration on polygenic risk scores (PRSs) research, we created an extensive PRS online repository for 35 common cancer traits integrating freely available genome-wide association studies (GWASs) summary statistics from three sources: published GWASs, the NHGRI-EBI GWAS Catalog, and UK Biobank-based GWASs. Our framework condenses these summary statistics into PRSs using various approaches such as linkage disequilibrium pruning/p value thresholding (fixed or data-adaptively optimized thresholds) and penalized, genome-wide effect size weighting. We evaluated the PRSs in two biobanks: the Michigan Genomics Initiative (MGI), a longitudinal biorepository effort at Michigan Medicine, and the population-based UK Biobank (UKB). For each PRS construct, we provide measures on predictive performance and discrimination. Besides PRS evaluation, the Cancer-PRSweb platform features construct downloads and phenome-wide PRS association study results (PRS-PheWAS) for predictive PRSs. We expect this integrated platform to accelerate PRS-related cancer research.


Assuntos
Bancos de Espécimes Biológicos/estatística & dados numéricos , Predisposição Genética para Doença , Genoma Humano , Genômica/métodos , Herança Multifatorial , Neoplasias/genética , Adulto , Idoso , Feminino , Estudo de Associação Genômica Ampla , Humanos , Internet , Desequilíbrio de Ligação , Masculino , Pessoa de Meia-Idade , Neoplasias/classificação , Neoplasias/diagnóstico , Neoplasias/epidemiologia , Fenótipo , Característica Quantitativa Herdável , Fatores de Risco , Reino Unido/epidemiologia , Estados Unidos/epidemiologia
19.
Bioinformatics ; 38(18): 4337-4343, 2022 09 15.
Artigo em Inglês | MEDLINE | ID: mdl-35876838

RESUMO

MOTIVATION: In the genome-wide association analysis of population-based biobanks, most diseases have low prevalence, which results in low detection power. One approach to tackle the problem is using family disease history, yet existing methods are unable to address type I error inflation induced by increased correlation of phenotypes among closely related samples, as well as unbalanced phenotypic distribution. RESULTS: We propose a new method for genetic association test with family disease history, mixed-model-based Test with Adjusted Phenotype and Empirical saddlepoint approximation, which controls for increased phenotype correlation by adopting a two-variance-component mixed model, accounts for case-control imbalance by using empirical saddlepoint approximation, and is flexible to incorporate any existing adjusted phenotypes, such as phenotypes from the LT-FH method. We show through simulation studies and analysis of UK Biobank data of white British samples and the Korean Genome and Epidemiology Study of Korean samples that the proposed method is robust and yields better calibration compared to existing methods while gaining power for detection of variant-phenotype associations. AVAILABILITY AND IMPLEMENTATION: The summary statistics and code generated in this study are available at https://github.com/styvon/TAPE. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Estudo de Associação Genômica Ampla/métodos , Estudos de Casos e Controles , Fenótipo , Simulação por Computador
20.
Muscle Nerve ; 67(3): 208-216, 2023 03.
Artigo em Inglês | MEDLINE | ID: mdl-36321729

RESUMO

INTRODUCTION/AIMS: Body mass index (BMI) is linked to amyotrophic lateral sclerosis (ALS) risk and prognosis, but additional research is needed. The aim of this study was to identify whether and when historical changes in BMI occurred in ALS participants, how these longer term trajectories associated with survival, and whether metabolomic profiles provided insight into potential mechanisms. METHODS: ALS and control participants self-reported body height and weight 10 (reference) and 5 years earlier, and at study entry (diagnosis for ALS participants). Generalized estimating equations evaluated differences in BMI trajectories between cases and controls. ALS survival was evaluated by BMI trajectory group using accelerated failure time models. BMI trajectories and survival associations were explored using published metabolomic profiling and correlation networks. RESULTS: Ten-year BMI trends differed between ALS and controls, with BMI loss in the 5 years before diagnosis despite BMI gains 10 to 5 years beforehand in both groups. An overall 10-year drop in BMI associated with a 27.1% decrease in ALS survival (P = .010). Metabolomic networks in ALS participants showed dysregulation in sphingomyelin, bile acid, and plasmalogen subpathways. DISCUSSION: ALS participants lost weight in the 5-year period before enrollment. BMI trajectories had three distinct groups and the group with significant weight loss in the past 10 years had the worst survival. Participants with a high BMI and increase in weight in the 10 years before symptom onset also had shorter survival. Certain metabolomics profiles were associated with the BMI trajectories. Replicating these findings in prospective cohorts is warranted.


Assuntos
Esclerose Lateral Amiotrófica , Humanos , Esclerose Lateral Amiotrófica/diagnóstico , Índice de Massa Corporal , Estudos Prospectivos , Metabolômica , Prognóstico
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA