RESUMO
Cigarette smoking and alcohol use are among the most prevalent substances used worldwide and account for a substantial proportion of preventable morbidity and mortality, underscoring the public health significance of understanding their etiology. Genome-wide association studies (GWAS) have successfully identified genetic variants associated with cigarette smoking and alcohol use traits. However, the vast majority of risk variants reside in non-coding regions of the genome, and their target genes and neurobiological mechanisms are unknown. Chromosomal conformation mappings can address this knowledge gap by charting the interaction profiles of risk-associated regulatory variants with target genes. To investigate the functional impact of common variants associated with cigarette smoking and alcohol use traits, we applied Hi-C coupled MAGMA (H-MAGMA) built upon cortical and newly generated midbrain dopaminergic neuronal Hi-C datasets to GWAS summary statistics of nicotine dependence, cigarettes per day, problematic alcohol use, and drinks per week. The identified risk genes mapped to key pathways associated with cigarette smoking and alcohol use traits, including drug metabolic processes and neuronal apoptosis. Risk genes were highly expressed in cortical glutamatergic, midbrain dopaminergic, GABAergic, and serotonergic neurons, suggesting them as relevant cell types in understanding the mechanisms by which genetic risk factors influence cigarette smoking and alcohol use. Lastly, we identified pleiotropic genes between cigarette smoking and alcohol use traits under the assumption that they may reveal substance-agnostic, shared neurobiological mechanisms of addiction. The number of pleiotropic genes was ~26-fold higher in dopaminergic neurons than in cortical neurons, emphasizing the critical role of ascending dopaminergic pathways in mediating general addiction phenotypes. Collectively, brain region- and neuronal subtype-specific 3D genome architecture helps refine neurobiological hypotheses for smoking, alcohol, and general addiction phenotypes by linking genetic risk factors to their target genes.
Assuntos
Comportamento Aditivo , Fumar Cigarros , Comportamento Aditivo/genética , Cromatina , Etanol , Estudo de Associação Genômica Ampla , FenótipoRESUMO
Rationale: The ability of peripheral blood biomarkers to assess chronic obstructive pulmonary disease (COPD) risk and progression is unknown. Genetics and gene expression may capture important aspects of COPD-related biology that predict disease activity. Objectives: Develop a transcriptional risk score (TRS) for COPD and assess the contribution of the TRS and a polygenic risk score (PRS) for disease susceptibility and progression. Methods: We randomly split 2,569 COPDGene (Genetic Epidemiology of COPD) participants with whole-blood RNA sequencing into training (n = 1,945) and testing (n = 624) samples and used 468 ECLIPSE (Evaluation of COPD Longitudinally to Identify Predictive Surrogate End-points) COPD cases with microarray data for replication. We developed a TRS using penalized regression (least absolute shrinkage and selection operator) to model FEV1/FVC and studied the predictive value of TRS for COPD (Global Initiative for Chronic Obstructive Lung Disease 2-4), prospective FEV1 change (ml/yr), and additional COPD-related traits. We adjusted for potential confounders, including age and smoking. We evaluated the predictive performance of the TRS in the context of a previously derived PRS and clinical factors. Measurements and Main Results: The TRS included 147 transcripts and was associated with COPD (odds ratio, 3.3; 95% confidence interval [CI], 2.4-4.5; P < 0.001), FEV1 change (ß, -17 ml/yr; 95% CI, -28 to -6.6; P = 0.002), and other COPD-related traits. In ECLIPSE cases, we replicated the association with FEV1 change (ß, -8.2; 95% CI, -15 to -1; P = 0.025) and the majority of other COPD-related traits. Models including PRS, TRS, and clinical factors were more predictive of COPD (area under the receiver operator characteristic curve, 0.84) and annualized FEV1 change compared with models with one risk score or clinical factors alone. Conclusions: Blood transcriptomics can improve prediction of COPD and lung function decline when added to a PRS and clinical risk factors.
Assuntos
Biomarcadores/sangue , Progressão da Doença , Doença Pulmonar Obstrutiva Crônica/sangue , Doença Pulmonar Obstrutiva Crônica/genética , Doença Pulmonar Obstrutiva Crônica/fisiopatologia , Medição de Risco/métodos , Idoso , Feminino , Regulação da Expressão Gênica , Predisposição Genética para Doença , Humanos , Masculino , Pessoa de Meia-Idade , Herança Multifatorial , Razão de Chances , Fenótipo , Valor Preditivo dos Testes , Estudos Prospectivos , Fatores de Risco , Índice de Gravidade de Doença , Fatores de TranscriçãoRESUMO
RATIONALE: COPD can be assessed using multidimensional grading systems with components from three domains: pulmonary function tests, symptoms and systemic features. Clinically, measures may be used interchangeably, though it is not known if they share similar pathobiology. OBJECTIVE: To use RNA sequencing (RNA-seq) to determine if there is an overlap in the underlying biological mechanisms and consequences driving different components of the multidimensional grading systems. METHODS: Whole blood was collected for RNA-seq from current and former smokers in the Genetic Epidemiology of COPD study. We tested the overlap in gene expression and biological pathways associated with case-control status and quantitative COPD phenotypes within and between the three domains. RESULTS: In 2647 subjects, there were 3030 genes differentially expressed in any of the three domains or case-control status. There were five genes that overlapped between the three domains and case-control status, including G protein-coupled receptor 15(GPR15), sestrin 1 (SESN1) and interferon-induced guanylate-binding protein 1 (GBP1), which were associated with longitudinal decline in FEV1. The overlap between the three domains was enriched for pathways related to cellular components. CONCLUSIONS: We identified gene sets and pathways that overlap between 12 COPD-related phenotypes and case-control status. There were no pathways represented in the overlap between the three domains and case-control status, but we identified multiple genes that demonstrated a consistent pattern of expression across several of the phenotypes. Patterns of gene expression correlation were generally similar to the correlation of clinical phenotypes in the PFT and symptom domains but not the systemic features.
Assuntos
Doença Pulmonar Obstrutiva Crônica , Expressão Gênica , Homologia de Genes , Humanos , Fenótipo , Doença Pulmonar Obstrutiva Crônica/genética , Análise de Sequência de RNARESUMO
BACKGROUND: The molecular basis of airway remodelling in chronic obstructive pulmonary disease (COPD) remains poorly understood. We identified gene expression signatures associated with chest computed tomography (CT) scan airway measures to understand molecular pathways associated with airway disease. METHODS: In 2396 subjects in the COPDGene Study, we examined the relationship between quantitative CT airway phenotypes and blood transcriptomes to identify airway disease-specific genes and to define an airway wall thickness (AWT) gene set score. Multivariable regression analyses were performed to identify associations of the AWT score with clinical phenotypes, bronchial gene expression and genetic variants. RESULTS: Type 1 interferon (IFN)-induced genes were consistently associated with AWT, square root wall area of a hypothetical airway with 10â mm internal perimeter (Pi10) and wall area percentage, with the strongest enrichment in AWT. A score derived from 18 genes whose expression was associated with AWT was associated with COPD-related phenotypes including reduced lung function (forced expiratory volume in 1â s percentage predicted ß=â-3.4; p<0.05) and increased exacerbations (incidence rate ratio 1.7; p<0.05). The AWT score was reproducibly associated with AWT in bronchial samples from 23 subjects (ß=3.22; p<0.05). The blood AWT score was associated with genetic variant rs876039, an expression quantitative trait locus for IKZF1, a gene that regulates IFN signalling and is associated with inflammatory diseases. CONCLUSIONS: A gene expression signature with IFN-stimulated genes from peripheral blood and bronchial brushings is associated with CT AWT, lung function and exacerbations. Shared genes and genetic associations suggest viral responses and/or autoimmune dysregulation as potential underlying mechanisms of airway disease in COPD.
Assuntos
Doença Pulmonar Obstrutiva Crônica , Enfisema Pulmonar , Transtornos Respiratórios , Volume Expiratório Forçado , Perfilação da Expressão Gênica , Humanos , Interferons/genética , Pulmão , Doença Pulmonar Obstrutiva Crônica/genéticaRESUMO
Most predictive models based on gene expression data do not leverage information related to gene splicing, despite the fact that splicing is a fundamental feature of eukaryotic gene expression. Cigarette smoking is an important environmental risk factor for many diseases, and it has profound effects on gene expression. Using smoking status as a prediction target, we developed deep neural network predictive models using gene, exon, and isoform level quantifications from RNA sequencing data in 2,557 subjects in the COPDGene Study. We observed that models using exon and isoform quantifications clearly outperformed gene-level models when using data from 5 genes from a previously published prediction model. Whereas the test set performance of the previously published model was 0.82 in the original publication, our exon-based models including an exon-to-isoform mapping layer achieved a test set AUC (area under the receiver operating characteristic) of 0.88, which improved to an AUC of 0.94 using exon quantifications from a larger set of genes. Isoform variability is an important source of latent information in RNA-seq data that can be used to improve clinical prediction models.
Assuntos
Aprendizado Profundo , Modelos Estatísticos , RNA-Seq/métodos , Fumar , Idoso , Biologia Computacional , Éxons/genética , Feminino , Perfilação da Expressão Gênica , Humanos , Masculino , Pessoa de Meia-Idade , Isoformas de Proteínas/genética , Curva ROC , Fumar/epidemiologia , Fumar/genéticaRESUMO
Cigarette smoking induces a profound transcriptomic and systemic inflammatory response. Previous studies have focused on gene level differential expression of smoking, but the genome-wide effects of smoking on alternative isoform regulation have not yet been described. We conducted RNA sequencing in whole-blood samples of 454 current and 767 former smokers in the COPDGene Study, and we analyzed the effects of smoking on differential usage of isoforms and exons. At 10% FDR, we detected 3167 differentially expressed genes, 945 differentially used isoforms and 160 differentially used exons. Isoform switch analysis revealed widespread 3' UTR lengthening associated with cigarette smoking. The lengthening of these 3' UTRs was consistent with alternative usage of distal polyadenylation sites, and these extended 3' UTR regions were significantly enriched with functional sequence elements including microRNA and RNA-protein binding sites. These findings warrant further studies on alternative polyadenylation events as potential biomarkers and novel therapeutic targets for smoking-related diseases.
Assuntos
Fumar Cigarros , Poliadenilação , Regiões 3' não Traduzidas , Fumar Cigarros/efeitos adversos , Fumar Cigarros/genética , Isoformas de Proteínas/genética , Fumar/efeitos adversos , Fumar/genéticaRESUMO
INTRODUCTION: Cachexia contributes to increased mortality and reduced quality of life in Chronic Obstructive Pulmonary Disease (COPD) and may be associated with underlying gene expression changes. Our goal was to identify differential gene expression signatures associated with COPD cachexia in current and former smokers. METHODS: We analyzed whole-blood gene expression data from participants with COPD in a discovery cohort (COPDGene, N = 400) and assessed replication (ECLIPSE, N = 114). To approximate the consensus definition using available criteria, cachexia was defined as weight-loss > 5% in the past 12 months or low body mass index (BMI) (< 20 kg/m2) and 1/3 criteria: decreased muscle strength (six-minute walk distance < 350 m), anemia (hemoglobin < 12 g/dl), and low fat-free mass index (FFMI) (< 15 kg/m2 among women and < 17 kg/m2 among men) in COPDGene. In ECLIPSE, cachexia was defined as weight-loss > 5% in the past 12 months or low BMI and 3/5 criteria: decreased muscle strength, anorexia, abnormal biochemistry (anemia or high c-reactive protein (> 5 mg/l)), fatigue, and low FFMI. Differential gene expression was assessed between cachectic and non-cachectic subjects, adjusting for age, sex, white blood cell counts, and technical covariates. Gene set enrichment analysis was performed using MSigDB. RESULTS: The prevalence of COPD cachexia was 13.7% in COPDGene and 7.9% in ECLIPSE. Fourteen genes were differentially downregulated in cachectic versus non-cachectic COPD patients in COPDGene (FDR < 0.05) and ECLIPSE (FDR < 0.05). DISCUSSION: Several replicated genes regulating heme metabolism were downregulated among participants with COPD cachexia. Impaired heme biosynthesis may contribute to cachexia development through free-iron buildup and oxidative tissue damage.
Assuntos
Caquexia/genética , Caquexia/metabolismo , Heme/genética , Heme/metabolismo , Doença Pulmonar Obstrutiva Crônica/genética , Doença Pulmonar Obstrutiva Crônica/metabolismo , Idoso , Idoso de 80 Anos ou mais , Caquexia/epidemiologia , Estudos de Coortes , Regulação para Baixo/fisiologia , Feminino , Seguimentos , Estudo de Associação Genômica Ampla/métodos , Humanos , Estudos Longitudinais , Masculino , Pessoa de Meia-Idade , Doença Pulmonar Obstrutiva Crônica/epidemiologiaRESUMO
Genome-wide association studies (GWAS) have identified multiple associations with emphysema apicobasal distribution (EABD), but the biological functions of these variants are unknown. To characterize the functions of EABD-associated variants, we integrated GWAS results with 1) expression quantitative trait loci (eQTL) from the Genotype Tissue Expression (GTEx) project and subjects in the COPDGene (Genetic Epidemiology of COPD) study and 2) cell type epigenomic marks from the Roadmap Epigenomics project. On the basis of these analyses, we selected a variant near ACVR1B (activin A receptor type 1B) for functional validation. SNPs from 168 loci with P values less than 5 × 10-5 in the largest GWAS meta-analysis of EABD were analyzed. Eighty-four loci overlapped eQTL, with 12 of these loci showing greater than 80% likelihood of harboring a single, shared GWAS and eQTL causal variant. Seventeen cell types were enriched for overlap between EABD loci and Roadmap Epigenomics marks (permutation P < 0.05), with the strongest enrichment observed in CD4+, CD8+, and regulatory T cells. We selected a putative causal variant, rs7962469, associated with ACVR1B expression in lung tissue for additional functional investigation, and reporter assays confirmed allele-specific regulatory activity for this variant in human bronchial epithelial and Jurkat immune cell lines. ACVR1B expression levels exhibit a nominally significant association with emphysema distribution. EABD-associated loci are preferentially enriched in regulatory elements of multiple cell types, most notably T-cell subsets. Multiple EABD loci colocalize to regulatory elements that are active across multiple tissues and cell types, and functional analyses confirm the presence of an EABD-associated functional variant that regulates ACVR1B expression, indicating that transforming growth factor-ß signaling plays a role in the EABD phenotype. Clinical trial registered with www.clinicaltrials.gov (NCT00608764).
Assuntos
Receptores de Ativinas Tipo I/genética , Predisposição Genética para Doença/genética , Enfisema Pulmonar/genética , Fator de Crescimento Transformador beta1/metabolismo , Linhagem Celular Tumoral , Estudo de Associação Genômica Ampla , Humanos , Células Jurkat , Pulmão/patologia , Polimorfismo de Nucleotídeo Único/genética , Estudo de Prova de Conceito , Locos de Características Quantitativas/genética , Subpopulações de Linfócitos T/imunologiaRESUMO
Genome-wide association studies have identified loci associated with Alzheimer's Disease (AD), but identifying the exact causal variants and genes at each locus is challenging due to linkage disequilibrium and their largely non-coding nature. To address this, we performed a massively parallel reporter assay of 3,576 AD-associated variants in THP-1 macrophages in both resting and proinflammatory states and identified 47 expression-modulating variants (emVars). To understand the endogenous chromatin context of emVars, we built an activity-by-contact model using epigenomic maps of macrophage inflammation and inferred condition-specific enhancer-promoter pairs. Intersection of emVars with enhancer-promoter pairs and microglia expression quantitative trait loci allowed us to connect 39 emVars to 76 putative AD risk genes enriched for AD-associated molecular signatures. Overall, systematic characterization of AD-associated variants enhances our understanding of the regulatory mechanisms underlying AD pathogenesis.
RESUMO
Genome-wide association studies (GWASs) have successfully identified 145 genomic regions that contribute to schizophrenia risk, but linkage disequilibrium makes it challenging to discern causal variants. We performed a massively parallel reporter assay (MPRA) on 5,173 fine-mapped schizophrenia GWAS variants in primary human neural progenitors and identified 439 variants with allelic regulatory effects (MPRA-positive variants). Transcription factor binding had modest predictive power, while fine-map posterior probability, enhancer overlap, and evolutionary conservation failed to predict MPRA-positive variants. Furthermore, 64% of MPRA-positive variants did not exhibit expressive quantitative trait loci signature, suggesting that MPRA could identify yet unexplored variants with regulatory potentials. To predict the combinatorial effect of MPRA-positive variants on gene regulation, we propose an accessibility-by-contact model that combines MPRA-measured allelic activity with neuronal chromatin architecture.
RESUMO
INTRODUCTION: Chronic obstructive pulmonary disease (COPD) can progress across several domains, complicating the identification of the determinants of disease progression. In our previous work, we applied k-means clustering to spirometric and chest radiological measures to identify four COPD-related subtypes: 'relatively resistant smokers (RRS)', 'mild upper lobe-predominant emphysema (ULE)', 'airway-predominant disease (AD)' and 'severe emphysema (SE)'. In the current study, we examined the associations of these subtypes to longitudinal COPD-related health measures as well as blood transcriptomic and plasma proteomic biomarkers. METHODS: We included 8266 non-Hispanic white and African-American smokers from the COPDGene study. We used linear regression to investigate cluster associations to 5-year prospective changes in spirometric and radiological measures and to gene expression and protein levels. We used Cox-proportional hazard test to test for cluster associations to prospective exacerbations, comorbidities and mortality. RESULTS: The RRS, ULE, AD and SE clusters represented 39%, 15%, 26% and 20% of the studied cohort at baseline, respectively. The SE cluster had the greatest 5-year FEV1 (forced expiratory volume in 1 s) and emphysema progression, and the highest risks of exacerbations, cardiovascular disease and mortality. The AD cluster had the highest diabetes risk. After adjustments, only the SE cluster had an elevated respiratory mortality risk, while the ULE, AD and SE clusters had elevated all-cause mortality risks. These clusters also demonstrated differential protein and gene expression biomarker associations, mostly related to inflammatory and immune processes. CONCLUSION: COPD k-means subtypes demonstrate varying rates of disease progression, prospective comorbidities, mortality and associations to transcriptomic and proteomic biomarkers. These findings emphasise the clinical and biological relevance of these subtypes, which call for more study for translation into clinical practice. TRAIL REGISTRATION NUMBER: NCT00608764.
Assuntos
Enfisema , Doença Pulmonar Obstrutiva Crônica , Enfisema Pulmonar , Biomarcadores , Análise por Conglomerados , Progressão da Doença , Enfisema/complicações , Humanos , Estudos Prospectivos , Proteômica , Doença Pulmonar Obstrutiva Crônica/complicações , Enfisema Pulmonar/complicações , Enfisema Pulmonar/diagnóstico por imagem , Tomografia Computadorizada por Raios XRESUMO
The human microbiome has a role in the development of multiple diseases. Individual microbiome profiles are highly personalized, though many species are shared. Understanding the relationship between the human microbiome and disease may inform future individualized treatments. We hypothesize the blood microbiome signature may be a surrogate for some lung microbial characteristics. We sought associations between the blood microbiome signature and lung-relevant host factors. Based on reads not mapped to the human genome, we detected microbial nucleic acids through secondary use of peripheral blood RNA-sequencing from 2,590 current and former smokers with and without chronic obstructive pulmonary disease (COPD) from the COPDGene study. We used the Genome Analysis Toolkit (GATK) microbial pipeline PathSeq to infer microbial profiles. We tested associations between the inferred profiles and lung disease relevant phenotypes and examined links to host gene expression pathways. We replicated our analyses using a second independent set of blood RNA-seq data from 1,065 COPDGene study subjects and performed a meta-analysis across the two studies. The four phyla with highest abundance across all subjects were Proteobacteria, Actinobacteria, Firmicutes and Bacteroidetes. In our meta-analysis, we observed associations (q-value < 0.05) between Acinetobacter, Serratia, Streptococcus and Bacillus inferred abundances and Modified Medical Research Council (mMRC) dyspnea score. Current smoking status was associated (q < 0.05) with Acinetobacter, Serratia and Cutibacterium abundance. All 12 taxa investigated were associated with at least one white blood cell distribution variable. Abundance for nine of the 12 taxa was associated with sex, and seven of the 12 taxa were associated with race. Host-microbiome interaction analysis revealed clustering of genera associated with mMRC dyspnea score and smoking status, through shared links to several host pathways. This study is the first to identify a bacterial microbiome signature in the peripheral blood of current and former smokers. Understanding the relationships between systemic microbial signatures and lung-related phenotypes may inform novel interventions and aid understanding of the systemic effects of smoking.