Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 51
Filtrar
Más filtros

Base de datos
Tipo del documento
Intervalo de año de publicación
1.
Nat Commun ; 15(1): 5357, 2024 Jun 25.
Artículo en Inglés | MEDLINE | ID: mdl-38918381

RESUMEN

Large national-level electronic health record (EHR) datasets offer new opportunities for disentangling the role of genes and environment through deep phenotype information and approximate pedigree structures. Here we use the approximate geographical locations of patients as a proxy for spatially correlated community-level environmental risk factors. We develop a spatial mixed linear effect (SMILE) model that incorporates both genetics and environmental contribution. We extract EHR and geographical locations from 257,620 nuclear families and compile 1083 disease outcome measurements from the MarketScan dataset. We augment the EHR with publicly available environmental data, including levels of particulate matter 2.5 (PM2.5), nitrogen dioxide (NO2), climate, and sociodemographic data. We refine the estimates of genetic heritability and quantify community-level environmental contributions. We also use wind speed and direction as instrumental variables to assess the causal effects of air pollution. In total, we find PM2.5 or NO2 have statistically significant causal effects on 135 diseases, including respiratory, musculoskeletal, digestive, metabolic, and sleep disorders, where PM2.5 and NO2 tend to affect biologically distinct disease categories. These analyses showcase several robust strategies for jointly modeling genetic and environmental effects on disease risk using large EHR datasets and will benefit upcoming biobank studies in the era of precision medicine.


Asunto(s)
Contaminación del Aire , Dióxido de Nitrógeno , Material Particulado , Humanos , Contaminación del Aire/efectos adversos , Material Particulado/efectos adversos , Dióxido de Nitrógeno/efectos adversos , Dióxido de Nitrógeno/análisis , Factores de Riesgo , Exposición a Riesgos Ambientales/efectos adversos , Masculino , Femenino , Registros Electrónicos de Salud , Contaminantes Atmosféricos/efectos adversos , Contaminantes Atmosféricos/análisis , Contaminantes Atmosféricos/toxicidad , Predisposición Genética a la Enfermedad , Interacción Gen-Ambiente , Persona de Mediana Edad , Adulto
2.
Nat Commun ; 15(1): 4260, 2024 May 20.
Artículo en Inglés | MEDLINE | ID: mdl-38769300

RESUMEN

Transcriptome-wide association study (TWAS) is a popular approach to dissect the functional consequence of disease associated non-coding variants. Most existing TWAS use bulk tissues and may not have the resolution to reveal cell-type specific target genes. Single-cell expression quantitative trait loci (sc-eQTL) datasets are emerging. The largest bulk- and sc-eQTL datasets are most conveniently available as summary statistics, but have not been broadly utilized in TWAS. Here, we present a new method EXPRESSO (EXpression PREdiction with Summary Statistics Only), to analyze sc-eQTL summary statistics, which also integrates 3D genomic data and epigenomic annotation to prioritize causal variants. EXPRESSO substantially improves existing methods. We apply EXPRESSO to analyze multi-ancestry GWAS datasets for 14 autoimmune diseases. EXPRESSO uniquely identifies 958 novel gene x trait associations, which is 26% more than the second-best method. Among them, 492 are unique to cell type level analysis and missed by TWAS using whole blood. We also develop a cell type aware drug repurposing pipeline, which leverages EXPRESSO results to identify drug compounds that can reverse disease gene expressions in relevant cell types. Our results point to multiple drugs with therapeutic potentials, including metformin for type 1 diabetes, and vitamin K for ulcerative colitis.


Asunto(s)
Estudio de Asociación del Genoma Completo , Sitios de Carácter Cuantitativo , Análisis de la Célula Individual , Humanos , Análisis de la Célula Individual/métodos , Estudio de Asociación del Genoma Completo/métodos , Predisposición Genética a la Enfermedad/genética , Transcriptoma/genética , Enfermedades Autoinmunes/genética , Polimorfismo de Nucleótido Simple , Herencia Multifactorial/genética , Perfilación de la Expresión Génica/métodos
3.
Nat Commun ; 15(1): 2359, 2024 Mar 19.
Artículo en Inglés | MEDLINE | ID: mdl-38504097

RESUMEN

Genetic mechanisms of blood pressure (BP) regulation remain poorly defined. Using kidney-specific epigenomic annotations and 3D genome information we generated and validated gene expression prediction models for the purpose of transcriptome-wide association studies in 700 human kidneys. We identified 889 kidney genes associated with BP of which 399 were prioritised as contributors to BP regulation. Imputation of kidney proteome and microRNAome uncovered 97 renal proteins and 11 miRNAs associated with BP. Integration with plasma proteomics and metabolomics illuminated circulating levels of myo-inositol, 4-guanidinobutanoate and angiotensinogen as downstream effectors of several kidney BP genes (SLC5A11, AGMAT, AGT, respectively). We showed that genetically determined reduction in renal expression may mimic the effects of rare loss-of-function variants on kidney mRNA/protein and lead to an increase in BP (e.g., ENPEP). We demonstrated a strong correlation (r = 0.81) in expression of protein-coding genes between cells harvested from urine and the kidney highlighting a diagnostic potential of urinary cell transcriptomics. We uncovered adenylyl cyclase activators as a repurposing opportunity for hypertension and illustrated examples of BP-elevating effects of anticancer drugs (e.g. tubulin polymerisation inhibitors). Collectively, our studies provide new biological insights into genetic regulation of BP with potential to drive clinical translation in hypertension.


Asunto(s)
Hipertensión , Proteoma , Humanos , Presión Sanguínea/genética , Proteoma/genética , Proteoma/metabolismo , Transcriptoma/genética , Multiómica , Hipertensión/metabolismo , Riñón/metabolismo , Proteínas de Transporte de Sodio-Glucosa/genética , Proteínas de Transporte de Sodio-Glucosa/metabolismo
4.
Commun Med (Lond) ; 3(1): 76, 2023 May 27.
Artículo en Inglés | MEDLINE | ID: mdl-37244961

RESUMEN

BACKGROUND: Previous studies have demonstrated epidemiological trends in individual metastatic cancer subtypes; however, research forecasting long-term incidence trends and projected survivorship of metastatic cancers is lacking. We assess the burden of metastatic cancer to 2040 by (1) characterizing past, current, and forecasted incidence trends, and (2) estimating odds of long-term (5-year) survivorship. METHODS: This retrospective, serial cross-sectional, population-based study used registry data from the Surveillance, Epidemiology, and End Results (SEER 9) database. Average annual percentage change (AAPC) was calculated to describe cancer incidence trends from 1988 to 2018. Autoregressive integrating moving average (ARIMA) models were used to forecast the distribution of primary metastatic cancer and metastatic cancer to specific sites from 2019 to 2040 and JoinPoint models were fitted to estimate mean projected annual percentage change (APC). RESULTS: The average annual percent change (AAPC) in incidence of metastatic cancer decreased by 0.80 per 100,000 individuals (1988-2018) and we forecast an APC decrease by 0.70 per 100,000 individuals (2018-2040). Analyses predict a decrease in metastases to liver (APC = -3.40, 95% CI [-3.50, -3.30]), lung (APC (2019-2030) = -1.90, 95% CI [-2.90, -1.00]); (2030-2040) = -3.70, 95% CI [-4.60, -2.80]), bone (APC = -4.00, 95% CI [-4.30, -3.70]), and brain (APC = -2.30, 95% CI [-2.60, -2.00]). By 2040, patients with metastatic cancer are predicted to have 46.7% greater odds of long-term survivorship, driven by increasing plurality of patients with more indolent forms of metastatic disease. CONCLUSIONS: By 2040, the distribution of metastatic cancer patients is predicted to shift in predominance from invariably fatal to indolent cancers subtypes. Continued research on metastatic cancers is important to guide health policy and clinical intervention efforts, and direct allocations of healthcare resources.


Cancer that has spread beyond the area where it originated and into different organs is called metastatic cancer. This study analyzed trends in metastatic cancer incidence, the proportion of those with metastatic cancer surviving 5 years after diagnosis and the locations in the body each cancer had spread to. The incidence of metastatic cancer decreased between 1988 and 2018 and is expected to continue to decrease until 2040. Some of the most common locations cancer spreads to is the lung, liver, brain, and bone. Metastatic cancer incidence to these areas is predicted to decrease. Also, the likelihood of surviving for more than 5 years after diagnosis with metastatic cancer is predicted to increase by 2040. This research should facilitate optimal planning of future healthcare resources and policy.

5.
Nat Commun ; 14(1): 668, 2023 02 07.
Artículo en Inglés | MEDLINE | ID: mdl-36750564

RESUMEN

Systemic lupus erythematosus is a heritable autoimmune disease that predominantly affects young women. To improve our understanding of genetic etiology, we conduct multi-ancestry and multi-trait meta-analysis of genome-wide association studies, encompassing 12 systemic lupus erythematosus cohorts from 3 different ancestries and 10 genetically correlated autoimmune diseases, and identify 16 novel loci. We also perform transcriptome-wide association studies, computational drug repurposing analysis, and cell type enrichment analysis. We discover putative drug classes, including a histone deacetylase inhibitor that could be repurposed to treat lupus. We also identify multiple cell types enriched with putative target genes, such as non-classical monocytes and B cells, which may be targeted for future therapeutics. Using this newly assembled result, we further construct polygenic risk score models and demonstrate that integrating polygenic risk score with clinical lab biomarkers improves the diagnostic accuracy of systemic lupus erythematosus using the Vanderbilt BioVU and Michigan Genomics Initiative biobanks.


Asunto(s)
Enfermedades Autoinmunes , Lupus Eritematoso Sistémico , Humanos , Femenino , Estudio de Asociación del Genoma Completo , Predisposición Genética a la Enfermedad , Fenotipo , Polimorfismo de Nucleótido Simple
6.
J Mol Biol ; 434(15): 167693, 2022 08 15.
Artículo en Inglés | MEDLINE | ID: mdl-35777465

RESUMEN

Human microbiome consists of trillions of microorganisms. Microbiota can modulate the host physiology through molecule and metabolite interactions. Integrating microbiome and metabolomics data have the potential to predict different diseases more accurately. Yet, most datasets only measure microbiome data but without paired metabolome data. Here, we propose a novel integrative modeling framework, Microbiome-based Supervised Contrastive Learning Framework (MB-SupCon). MB-SupCon integrates microbiome and metabolome data to generate microbiome embeddings, which can be used to improve the prediction accuracy in datasets that only measure microbiome data. As a proof of concept, we applied MB-SupCon on 720 samples with paired 16S microbiome data and metabolomics data from patients with type 2 diabetes. MB-SupCon outperformed existing prediction methods and achieved high average prediction accuracies for insulin resistance status (84.62%), sex (78.98%), and race (80.04%). Moreover, the microbiome embeddings form separable clusters for different covariate groups in the lower-dimensional space, which enhances data visualization. We also applied MB-SupCon on a large inflammatory bowel disease study and observed similar advantages. Thus, MB-SupCon could be broadly applicable to improve microbiome prediction models in multi-omics disease studies.


Asunto(s)
Metaboloma , Microbiota , Aprendizaje Automático Supervisado , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/microbiología , Humanos , Enfermedades Inflamatorias del Intestino/genética , Enfermedades Inflamatorias del Intestino/microbiología , Metabolómica/métodos , ARN Ribosómico 16S/genética
7.
Front Immunol ; 13: 889296, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35833142

RESUMEN

Genome-wide association studies (GWAS) have identified hundreds of genetic variants associated with autoimmune diseases and provided unique mechanistic insights and informed novel treatments. These individual genetic variants on their own typically confer a small effect of disease risk with limited predictive power; however, when aggregated (e.g., via polygenic risk score method), they could provide meaningful risk predictions for a myriad of diseases. In this review, we describe the recent advances in GWAS for autoimmune diseases and the practical application of this knowledge to predict an individual's susceptibility/severity for autoimmune diseases such as systemic lupus erythematosus (SLE) via the polygenic risk score method. We provide an overview of methods for deriving different polygenic risk scores and discuss the strategies to integrate additional information from correlated traits and diverse ancestries. We further advocate for the need to integrate clinical features (e.g., anti-nuclear antibody status) with genetic profiling to better identify patients at high risk of disease susceptibility/severity even before clinical signs or symptoms develop. We conclude by discussing future challenges and opportunities of applying polygenic risk score methods in clinical care.


Asunto(s)
Enfermedades Autoinmunes , Lupus Eritematoso Sistémico , Enfermedades Autoinmunes/genética , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo/métodos , Humanos , Lupus Eritematoso Sistémico/diagnóstico , Lupus Eritematoso Sistémico/genética , Factores de Riesgo
8.
Nat Commun ; 13(1): 3258, 2022 06 07.
Artículo en Inglés | MEDLINE | ID: mdl-35672318

RESUMEN

Transcriptome-wide association studies (TWAS) are popular approaches to test for association between imputed gene expression levels and traits of interest. Here, we propose an integrative method PUMICE (Prediction Using Models Informed by Chromatin conformations and Epigenomics) to integrate 3D genomic and epigenomic data with expression quantitative trait loci (eQTL) to more accurately predict gene expressions. PUMICE helps define and prioritize regions that harbor cis-regulatory variants, which outperforms competing methods. We further describe an extension to our method PUMICE +, which jointly combines TWAS results from single- and multi-tissue models. Across 79 traits, PUMICE + identifies 22% more independent novel genes and increases median chi-square statistics values at known loci by 35% compared to the second-best method, as well as achieves the narrowest credible interval size. Lastly, we perform computational drug repurposing and confirm that PUMICE + outperforms other TWAS methods.


Asunto(s)
Estudio de Asociación del Genoma Completo , Transcriptoma , Reposicionamiento de Medicamentos , Epigenómica , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo/métodos , Genómica , Humanos , Polimorfismo de Nucleótido Simple , Transcriptoma/genética
9.
JAMA Netw Open ; 5(1): e2145876, 2022 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-35099546

RESUMEN

Importance: The results of studies evaluating spinal cord stimulation (SCS) for postlaminectomy syndrome (PLS) have yielded mixed results. This has led to an increased emphasis on objective outcome measures such as opioid prescribing. Objective: To determine the association between SCS and long-term opioid therapy (LOT) for PLS. Design, Setting, and Participants: In this cohort study, adults with PLS were identified using the TriNetx Diamond Network and separated based on whether they underwent SCS. Patients were stratified according to baseline opioid use (opioid-naive or receiving LOT) and subsequent opioid therapy over the 12-month period ranging from 3 to 15 months post-SCS implantation or post-PLS index date. Statistical analysis was performed from June to December 2021. Exposure: SCS. Main Outcomes and Measures: The main outcome was cessation of opioid use among patients receiving LOT or abstinence from opioids among opioid-naive patients. Opioid-naive patients were defined as those receiving at most 2 opioid prescriptions per year, and patients on LOT were those receiving at least 6 opioid prescriptions per year. Results: Among 552 937 eligible patients treated between December 2015 and May 2021, 26 179 with PLS received an SCS implant. The median (IQR) patient age was 60 (51-69) years; 305 802 patients (55.3%) were female. Among those reporting racial identify (37.0% [204 758 patients]), 9.3% (18 971 patients) were African American, 0.3% (648 patients) were Asian, and 90.4% (185 139 patients) were White. Compared with those who did not receive an SCS, individuals who received an SCS were more likely to be using opioids preimplantation (mean [SD] prescriptions: 4.3 [8.5] vs 4.1 [9.3]; P < .001) but less likely to be using opioids after SCS implantation (mean [SD] prescriptions: 3.8 [8.2] vs 4.0 [9.4]; P = .006). In the 12-month study period, similar proportions in the SCS and no-SCS groups receiving baseline LOT remained on LOT (70.3% [n = 74 585] vs 69.2% [n = 3882], respectively; P = .10). In opioid-naive patients, SCS was associated with a small decreased likelihood of patients subsequently receiving LOT (7.6% vs 7.0%; difference, -0.6% [95% CI, -1.0% to -0.2%]; P = .003). In multivariable analysis, SCS was associated with an increased likelihood of not being on opioids in both opioid-naive (adjusted odds ratio [OR], 0.90 [95% CI, 0.85-0.96]; P < .001) and LOT patients (adjusted OR, 0.93 [95% CI, 0.88-0.99]; P = .02). White patients were significantly more likely to be diagnosed with PLS (ie, underwent surgery) (90.4% vs 85.2%; difference, 5.2% [95% CI, 5.1%-5.4%]; P < .001) and receive an SCS (93.7% vs 90.3%; difference, 3.4% [95% CI, 2.9% to 4.0%]; P < .001) than patients of other racial identities. Conclusions and Relevance: These findings suggest that under real-life conditions, SCS was associated with small, clinically questionable associations with opioid discontinuation and not starting opioids in the context of PLS.


Asunto(s)
Analgésicos Opioides/uso terapéutico , Prescripciones de Medicamentos/estadística & datos numéricos , Síndrome de Fracaso de la Cirugía Espinal Lumbar/terapia , Laminectomía/efectos adversos , Pautas de la Práctica en Medicina/estadística & datos numéricos , Estimulación de la Médula Espinal/estadística & datos numéricos , Anciano , Femenino , Humanos , Masculino , Persona de Mediana Edad , Análisis Multivariante , Oportunidad Relativa , Periodo Posoperatorio , Implantación de Prótesis
10.
Psychol Med ; 52(5): 968-978, 2022 04.
Artículo en Inglés | MEDLINE | ID: mdl-32762793

RESUMEN

BACKGROUND: Substance use occurs at a high rate in persons with a psychiatric disorder. Genetically informative studies have the potential to elucidate the etiology of these phenomena. Recent developments in genome-wide association studies (GWAS) allow new avenues of investigation. METHOD: Using results of GWAS meta-analyses, we performed a factor analysis of the genetic correlation structure, a genome-wide search of shared loci, and causally informative tests for six substance use phenotypes (four smoking, one alcohol, and one cannabis use) and five psychiatric disorders (ADHD, anorexia, depression, bipolar disorder, and schizophrenia). RESULTS: Two correlated externalizing and internalizing/psychosis factor were found, although model fit was beneath conventional standards. Of 458 loci reported in previous univariate GWAS of substance use and psychiatric disorders, about 50% (230 loci) were pleiotropic with additional 111 pleiotropic loci not reported from past GWAS. Of the 341 pleiotropic loci, 152 were associated with both substance use and psychiatric disorders, implicating neurodevelopment, cell morphogenesis, biological adhesion pathways, and enrichment in 13 different brain tissues. Seventy-five and 114 pleiotropic loci were specific to either psychiatric disorders or substance use phenotypes, implicating neuronal signaling pathway and clathrin-binding functions/structures, respectively. No consistent evidence for phenotypic causation was found across different Mendelian randomization methods. CONCLUSIONS: Genetic etiology of substance use and psychiatric disorders is highly pleiotropic and involves shared neurodevelopmental path, neurotransmission, and intracellular trafficking. In aggregate, the patterns are not consistent with vertical pleiotropy, more likely reflecting horizontal pleiotropy or more complex forms of phenotypic causation.


Asunto(s)
Trastornos Mentales , Esquizofrenia , Trastornos Relacionados con Sustancias , Pleiotropía Genética , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Trastornos Mentales/epidemiología , Trastornos Mentales/genética , Fenotipo , Polimorfismo de Nucleótido Simple , Esquizofrenia/epidemiología , Esquizofrenia/genética , Trastornos Relacionados con Sustancias/epidemiología , Trastornos Relacionados con Sustancias/genética
11.
Am J Hum Genet ; 109(1): 81-96, 2022 01 06.
Artículo en Inglés | MEDLINE | ID: mdl-34932938

RESUMEN

Large-scale gene sequencing studies for complex traits have the potential to identify causal genes with therapeutic implications. We performed gene-based association testing of blood lipid levels with rare (minor allele frequency < 1%) predicted damaging coding variation by using sequence data from >170,000 individuals from multiple ancestries: 97,493 European, 30,025 South Asian, 16,507 African, 16,440 Hispanic/Latino, 10,420 East Asian, and 1,182 Samoan. We identified 35 genes associated with circulating lipid levels; some of these genes have not been previously associated with lipid levels when using rare coding variation from population-based samples. We prioritize 32 genes in array-based genome-wide association study (GWAS) loci based on aggregations of rare coding variants; three (EVI5, SH2B3, and PLIN1) had no prior association of rare coding variants with lipid levels. Most of our associated genes showed evidence of association among multiple ancestries. Finally, we observed an enrichment of gene-based associations for low-density lipoprotein cholesterol drug target genes and for genes closest to GWAS index single-nucleotide polymorphisms (SNPs). Our results demonstrate that gene-based associations can be beneficial for drug target development and provide evidence that the gene closest to the array-based GWAS index SNP is often the functional gene for blood lipid levels.


Asunto(s)
Exoma , Variación Genética , Estudio de Asociación del Genoma Completo , Lípidos/sangre , Sistemas de Lectura Abierta , Alelos , Glucemia/genética , Estudios de Casos y Controles , Biología Computacional/métodos , Bases de Datos Genéticas , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/metabolismo , Predisposición Genética a la Enfermedad , Genética de Población , Estudio de Asociación del Genoma Completo/métodos , Humanos , Metabolismo de los Lípidos/genética , Hígado/metabolismo , Hígado/patología , Anotación de Secuencia Molecular , Herencia Multifactorial , Fenotipo , Polimorfismo de Nucleótido Simple
12.
JAMA Netw Open ; 4(10): e2127784, 2021 10 01.
Artículo en Inglés | MEDLINE | ID: mdl-34613403

RESUMEN

Importance: Currently, there are limited published data regarding resource use and spending on cancer care in the US. Objective: To characterize the most frequent medical services provided and the associated spending for privately insured patients with cancer in the US. Design, Setting, and Participants: This cohort study used data from the MarketScan database for the calendar year 2018 from a sample of 27.1 million privately insured individuals, including patients with a diagnosis of the 15 most prevalent cancers, predominantly from large insurers and self-insured employers. Overall societal health care spending was estimated for each cancer type by multiplying the mean total spending per patient (estimated from MarketScan) by the number of privately insured patients living with that cancer in 2018, as reported by the National Cancer Institute's Surveillance, Epidemiology, and End Results program. Analyses were performed from February 1, 2018, to July 8, 2021. Exposures: Evaluation and management as prescribed by treating care team. Main Outcomes and Measures: Current Procedural Terminology and Healthcare Common Procedure Coding System codes based on cancer diagnosis code. Results: The estimated cost of cancer care in 2018 for 402 115 patients with the 15 most prevalent cancer types was approximately $156.2 billion for privately insured adults younger than 65 years in the US. There were a total of 38.4 million documented procedure codes for 15 cancers in the MarketScan database, totaling $10.8 billion. Patients with breast cancer contributed the greatest total number of services (10.9 million [28.4%]), followed by those with colorectal cancer (3.9 million [10.2%]) and prostate cancer (3.6 million [9.4%]). Pathology and laboratory tests contributed the highest number of services performed (11.7 million [30.5%]), followed by medical services (6.3 million [16.4%]) and medical supplies and nonphysician services (6.1 million [15.9%]). The costliest cancers were those of the breast ($3.4 billion [31.5%]), followed by lung ($1.1 billion [10.2%]) and colorectum ($1.1 billion [10.2%]). Medical supplies and nonphysician services contributed the highest total spent ($4.0 billion [37.0%]), followed by radiology ($2.1 billion [19.4%]) and surgery ($1.8 billion [16.7%]). Conclusions and Relevance: This analysis suggests that patients with breast, colorectal, and prostate cancers had the greatest number of services performed, particularly for pathology and laboratory tests, whereas patients with breast, lung, lymphoma, and colorectal cancer incurred the greatest costs, particularly for medical supplies and nonphysician services. The cost of cancer care in 2018 for the 15 most prevalent cancer types was estimated to be approximately $156.2 billion for privately insured adults younger than 65 years in the US.


Asunto(s)
Planes de Seguro con Fines de Lucro/normas , Costos de la Atención en Salud/estadística & datos numéricos , Neoplasias/economía , Aceptación de la Atención de Salud/estadística & datos numéricos , Adulto , Estudios de Cohortes , Femenino , Planes de Seguro con Fines de Lucro/estadística & datos numéricos , Humanos , Masculino , Persona de Mediana Edad , Neoplasias/epidemiología , Estados Unidos/epidemiología
13.
Genome Res ; 31(9): 1629-1637, 2021 09.
Artículo en Inglés | MEDLINE | ID: mdl-34426515

RESUMEN

The X Chromosome plays an important role in human development and disease. However, functional genomic and disease association studies of X genes greatly lag behind autosomal gene studies, in part owing to the unique biology of X-Chromosome inactivation (XCI). Because of XCI, most genes are only expressed from one allele. Yet, ∼30% of X genes "escape" XCI and are transcribed from both alleles, many only in a proportion of the population. Such interindividual differences are likely to be disease relevant, particularly for sex-biased disorders. To understand the functional biology for X-linked genes, we developed X-Chromosome inactivation for RNA-seq (XCIR), a novel approach to identify escape genes using bulk RNA-seq data. Our method, available as an R package, is more powerful than alternative approaches and is computationally efficient to handle large population-scale data sets. Using annotated XCI states, we examined the contribution of X-linked genes to the disease heritability in the United Kingdom Biobank data set. We show that escape and variable escape genes explain the largest proportion of X heritability, which is in large part attributable to X genes with Y homology. Finally, we investigated the role of each XCI state in sex-biased diseases and found that although XY homologous gene pairs have a larger overall effect size, enrichment for variable escape genes is significantly increased in female-biased diseases. Our results, for the first time, quantitate the importance of variable escape genes for the etiology of sex-biased disease, and our pipeline allows analysis of larger data sets for a broad range of phenotypes.


Asunto(s)
Genes Ligados a X , Inactivación del Cromosoma X , Alelos , Animales , Femenino , Genómica , Cromosoma X/genética
14.
Nat Neurosci ; 24(10): 1367-1376, 2021 10.
Artículo en Inglés | MEDLINE | ID: mdl-34446935

RESUMEN

Behaviors and disorders related to self-regulation, such as substance use, antisocial behavior and attention-deficit/hyperactivity disorder, are collectively referred to as externalizing and have shared genetic liability. We applied a multivariate approach that leverages genetic correlations among externalizing traits for genome-wide association analyses. By pooling data from ~1.5 million people, our approach is statistically more powerful than single-trait analyses and identifies more than 500 genetic loci. The loci were enriched for genes expressed in the brain and related to nervous system development. A polygenic score constructed from our results predicts a range of behavioral and medical outcomes that were not part of genome-wide analyses, including traits that until now lacked well-performing polygenic scores, such as opioid use disorder, suicide, HIV infections, criminal convictions and unemployment. Our findings are consistent with the idea that persistent difficulties in self-regulation can be conceptualized as a neurodevelopmental trait with complex and far-reaching social and health correlates.


Asunto(s)
Conducta Adictiva/genética , Estudios de Asociación Genética , Autocontrol , Trastorno por Déficit de Atención con Hiperactividad/genética , Conducta Adictiva/psicología , Síntomas Conductuales/genética , Síntomas Conductuales/psicología , Biología Computacional , Crimen/psicología , Estudio de Asociación del Genoma Completo , Infecciones por VIH/genética , Infecciones por VIH/psicología , Humanos , Metaanálisis como Asunto , Herencia Multifactorial , Análisis Multivariante , Trastornos Relacionados con Opioides/genética , Trastornos Relacionados con Opioides/psicología , Reproducibilidad de los Resultados , Suicidio , Desempleo
15.
Nat Commun ; 12(1): 1964, 2021 03 30.
Artículo en Inglés | MEDLINE | ID: mdl-33785739

RESUMEN

Genome-wide association meta-analysis (GWAMA) is an effective approach to enlarge sample sizes and empower the discovery of novel associations between genotype and phenotype. Independent replication has been used as a gold-standard for validating genetic associations. However, as current GWAMA often seeks to aggregate all available datasets, it becomes impossible to find a large enough independent dataset to replicate new discoveries. Here we introduce a method, MAMBA (Meta-Analysis Model-based Assessment of replicability), for assessing the "posterior-probability-of-replicability" for identified associations by leveraging the strength and consistency of association signals between contributing studies. We demonstrate using simulations that MAMBA is more powerful and robust than existing methods, and produces more accurate genetic effects estimates. We apply MAMBA to a large-scale meta-analysis of addiction phenotypes with 1.2 million individuals. In addition to accurately identifying replicable common variant associations, MAMBA also pinpoints novel replicable rare variant associations from imputation-based GWAMA and hence greatly expands the set of analyzable variants.


Asunto(s)
Algoritmos , Biología Computacional/métodos , Estudio de Asociación del Genoma Completo/métodos , Metaanálisis como Asunto , Modelos Genéticos , Polimorfismo de Nucleótido Simple , Estudios de Asociación Genética/métodos , Genotipo , Fenotipo , Reproducibilidad de los Resultados , Tamaño de la Muestra , Programas Informáticos
16.
G3 (Bethesda) ; 11(4)2021 04 15.
Artículo en Inglés | MEDLINE | ID: mdl-33713107

RESUMEN

In microbiome research, metagenomic sequencing generates enormous amounts of data. These data are typically classified into taxa for taxonomy analysis, or into genes for functional analysis. However, a joint analysis where the reads are classified into taxa-specific genes is often overlooked. To enable the analysis of this biologically meaningful feature, we developed a novel bioinformatic toolkit, MetaPrism, which can analyze sequence reads for a set of joint taxa/gene analyses to: 1) classify sequence reads and estimate the abundances for taxa-specific genes; 2) tabularize and visualize taxa-specific gene abundances; 3) compare the abundances between groups; and 4) build prediction models for clinical outcome. We illustrated these functions using a published microbiome metagenomics dataset from patients treated with immune checkpoint inhibitor therapy and showed the joint features can serve as potential biomarkers to predict therapeutic responses. MetaPrism is a toolkit for joint taxa and gene analysis. It offers biological insights on the taxa-specific genes on top of the taxa-alone or gene-alone analysis. MetaPrism is open-source software and freely available at https://github.com/jiwoongbio/MetaPrism. The example script to reproduce the manuscript is also provided in the above code repository.


Asunto(s)
Metagenómica , Microbiota , Algoritmos , Biología Computacional , Humanos , Metagenoma , Microbiota/genética , Análisis de Secuencia de ADN , Programas Informáticos
18.
Gigascience ; 10(2)2021 02 05.
Artículo en Inglés | MEDLINE | ID: mdl-33543271

RESUMEN

BACKGROUND: Trillions of microbes inhabit the human body and have a profound effect on human health. The recent development of metagenome-wide association studies and other quantitative analysis methods accelerate the discovery of the associations between human microbiome and diseases. To assess the strengths and limitations of these analytical tools, simulating realistic microbiome datasets is critically important. However, simulating the real microbiome data is challenging because it is difficult to model their correlation structure using explicit statistical models. RESULTS: To address the challenge of simulating realistic microbiome data, we designed a novel simulation framework termed MB-GAN, by using a generative adversarial network (GAN) and utilizing methodology advancements from the deep learning community. MB-GAN can automatically learn from given microbial abundances and compute simulated abundances that are indistinguishable from them. In practice, MB-GAN showed the following advantages. First, MB-GAN avoids explicit statistical modeling assumptions, and it only requires real datasets as inputs. Second, unlike the traditional GANs, MB-GAN is easily applicable and can converge efficiently. CONCLUSIONS: By applying MB-GAN to a case-control gut microbiome study of 396 samples, we demonstrated that the simulated data and the original data had similar first-order and second-order properties, including sparsity, diversities, and taxa-taxa correlations. These advantages are suitable for further microbiome methodology development where high-fidelity microbiome data are needed.


Asunto(s)
Microbiota , Redes Neurales de la Computación , Simulación por Computador , Humanos , Procesamiento de Imagen Asistido por Computador , Proteínas
19.
J Invest Dermatol ; 141(6): 1493-1502, 2021 06.
Artículo en Inglés | MEDLINE | ID: mdl-33385400

RESUMEN

Psoriasis and type 2 diabetes (T2D) are complex conditions with significant impacts on health. Patients with psoriasis have a higher risk of T2D (∼1.5 OR) and vice versa, controlling for body mass index; yet, there has been a limited study comparing their genetic architecture. We hypothesized that there are shared genetic components between psoriasis and T2D. Trans-disease meta-analysis was applied to 8,016,731 well-imputed genetic markers from large-scale meta-analyses of psoriasis (11,024 cases and 16,336 controls) and T2D (74,124 cases and 824,006 controls), adjusted for body mass index. We confirmed our findings in a hospital-based study (42,112 patients) and tested for causal relationships with multivariable Mendelian randomization. Mendelian randomization identified a causal relationship between psoriasis and T2D (P = 1.6 × 10‒4, OR = 1.01) and highlighted the impact of body mass index. Trans-disease meta-analysis further revealed four genome-wide significant loci (P < 5 × 10‒8) with evidence of colocalization and shared directions of effect between psoriasis and T2D not present in body mass index. The proteins coded by genes in these loci (ACTR2, ERLIN1, TRMT112, and BECN1) are connected through NF-κB signaling. Our results provide insight into the immunological components that connect immune-mediated skin conditions and metabolic diseases, independent of confounding factors.


Asunto(s)
Diabetes Mellitus Tipo 2/genética , Sitios Genéticos/inmunología , Psoriasis/genética , Índice de Masa Corporal , Causalidad , Diabetes Mellitus Tipo 2/epidemiología , Diabetes Mellitus Tipo 2/inmunología , Predisposición Genética a la Enfermedad/epidemiología , Estudio de Asociación del Genoma Completo , Humanos , Análisis de la Aleatorización Mendeliana , FN-kappa B/metabolismo , Polimorfismo de Nucleótido Simple , Psoriasis/epidemiología , Psoriasis/inmunología , Transducción de Señal/genética , Transducción de Señal/inmunología
20.
Nat Genet ; 52(12): 1314-1332, 2020 12.
Artículo en Inglés | MEDLINE | ID: mdl-33230300

RESUMEN

Genetic studies of blood pressure (BP) to date have mainly analyzed common variants (minor allele frequency > 0.05). In a meta-analysis of up to ~1.3 million participants, we discovered 106 new BP-associated genomic regions and 87 rare (minor allele frequency ≤ 0.01) variant BP associations (P < 5 × 10-8), of which 32 were in new BP-associated loci and 55 were independent BP-associated single-nucleotide variants within known BP-associated regions. Average effects of rare variants (44% coding) were ~8 times larger than common variant effects and indicate potential candidate causal genes at new and known loci (for example, GATA5 and PLCB3). BP-associated variants (including rare and common) were enriched in regions of active chromatin in fetal tissues, potentially linking fetal development with BP regulation in later life. Multivariable Mendelian randomization suggested possible inverse effects of elevated systolic and diastolic BP on large artery stroke. Our study demonstrates the utility of rare-variant analyses for identifying candidate genes and the results highlight potential therapeutic targets.


Asunto(s)
Presión Sanguínea/genética , Frecuencia de los Genes/genética , Predisposición Genética a la Enfermedad/genética , Hipertensión/genética , Factor de Transcripción GATA5/genética , Estudio de Asociación del Genoma Completo , Genotipo , Humanos , Mutación/genética , Fosfolipasa C beta/genética , Polimorfismo de Nucleótido Simple/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA