RESUMO
Mutations in a diverse set of driver genes increase the fitness of haematopoietic stem cells (HSCs), leading to clonal haematopoiesis1. These lesions are precursors for blood cancers2-6, but the basis of their fitness advantage remains largely unknown, partly owing to a paucity of large cohorts in which the clonal expansion rate has been assessed by longitudinal sampling. Here, to circumvent this limitation, we developed a method to infer the expansion rate from data from a single time point. We applied this method to 5,071 people with clonal haematopoiesis. A genome-wide association study revealed that a common inherited polymorphism in the TCL1A promoter was associated with a slower expansion rate in clonal haematopoiesis overall, but the effect varied by driver gene. Those carrying this protective allele exhibited markedly reduced growth rates or prevalence of clones with driver mutations in TET2, ASXL1, SF3B1 and SRSF2, but this effect was not seen in clones with driver mutations in DNMT3A. TCL1A was not expressed in normal or DNMT3A-mutated HSCs, but the introduction of mutations in TET2 or ASXL1 led to the expression of TCL1A protein and the expansion of HSCs in vitro. The protective allele restricted TCL1A expression and expansion of mutant HSCs, as did experimental knockdown of TCL1A expression. Forced expression of TCL1A promoted the expansion of human HSCs in vitro and mouse HSCs in vivo. Our results indicate that the fitness advantage of several commonly mutated driver genes in clonal haematopoiesis may be mediated by TCL1A activation.
Assuntos
Hematopoiese Clonal , Células-Tronco Hematopoéticas , Animais , Humanos , Camundongos , Alelos , Hematopoiese Clonal/genética , Estudo de Associação Genômica Ampla , Hematopoese/genética , Células-Tronco Hematopoéticas/citologia , Células-Tronco Hematopoéticas/metabolismo , Mutação , Regiões Promotoras GenéticasRESUMO
Gene regulatory networks (GRNs) are effective tools for inferring complex interactions between molecules that regulate biological processes and hence can provide insights into drivers of biological systems. Inferring coexpression networks is a critical element of GRN inference, as the correlation between expression patterns may indicate that genes are coregulated by common factors. However, methods that estimate coexpression networks generally derive an aggregate network representing the mean regulatory properties of the population and so fail to fully capture population heterogeneity. Bayesian optimized networks obtained by assimilating omic data (BONOBO) is a scalable Bayesian model for deriving individual sample-specific coexpression matrices that recognizes variations in molecular interactions across individuals. For each sample, BONOBO assumes a Gaussian distribution on the log-transformed centered gene expression and a conjugate prior distribution on the sample-specific coexpression matrix constructed from all other samples in the data. Combining the sample-specific gene coexpression with the prior distribution, BONOBO yields a closed-form solution for the posterior distribution of the sample-specific coexpression matrices, thus allowing the analysis of large data sets. We demonstrate BONOBO's utility in several contexts, including analyzing gene regulation in yeast transcription factor knockout studies, the prognostic significance of miRNA-mRNA interaction in human breast cancer subtypes, and sex differences in gene regulation within human thyroid tissue. We find that BONOBO outperforms other methods that have been used for sample-specific coexpression network inference and provides insight into individual differences in the drivers of biological processes.
Assuntos
Teorema de Bayes , Neoplasias da Mama , Redes Reguladoras de Genes , MicroRNAs , Humanos , Neoplasias da Mama/genética , Neoplasias da Mama/metabolismo , Feminino , Masculino , MicroRNAs/genética , MicroRNAs/metabolismo , Perfilação da Expressão Gênica/métodos , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , AlgoritmosRESUMO
Innate immune responses such as phagocytosis are critically linked to the generation of adaptive immune responses against the neoantigens in cancer and the efferocytosis that is essential for homeostasis in diseases characterized by lung injury, inflammation, and remodeling as in chronic obstructive pulmonary disease (COPD). Chitinase 3-like-1 (CHI3L1) is induced in many cancers where it inhibits adaptive immune responses by stimulating immune checkpoint molecules (ICPs) and portends a poor prognosis. CHI3L1 is also induced in COPD where it regulates epithelial cell death. In this study, we demonstrate that pulmonary melanoma metastasis inhibits macrophage phagocytosis by stimulating the CD47-SIRPα and CD24-Siglec10 phagocytosis checkpoint pathways while inhibiting macrophage "eat me" signals from calreticulin and HMGB1. We also demonstrate that these effects on macrophage phagocytosis are associated with CHI3L1 stimulation of the SHP-1 and SHP-2 phosphatases and inhibition of the accumulation and phosphorylation of cytoskeleton-regulating nonmuscle myosin IIa. This inhibition of innate immune responses such as phagocytosis provides a mechanistic explanation for the ability of CHI3L1 to stimulate ICPs and inhibit adaptive immune responses in cancer and diseases such as COPD. The ability of CHI3L1 to simultaneously inhibit innate immune responses, stimulate ICPs, inhibit T cell costimulation, and regulate a number of other oncogenic and inflammation pathways suggests that CHI3L1-targeted therapeutics are promising interventions in cancer, COPD, and other disorders.
Assuntos
Antígeno CD47 , Proteína 1 Semelhante à Quitinase-3 , Imunidade Inata , Fagocitose , Receptores Imunológicos , Animais , Fagocitose/imunologia , Camundongos , Antígeno CD47/imunologia , Antígeno CD47/metabolismo , Imunidade Inata/imunologia , Proteína 1 Semelhante à Quitinase-3/metabolismo , Proteína 1 Semelhante à Quitinase-3/imunologia , Receptores Imunológicos/imunologia , Receptores Imunológicos/metabolismo , Camundongos Endogâmicos C57BL , Macrófagos/imunologia , Antígeno CD24/imunologia , Antígeno CD24/metabolismo , Neoplasias Pulmonares/imunologia , Neoplasias Pulmonares/patologia , Antígenos de Diferenciação/imunologia , Humanos , Lectinas/metabolismo , Lectinas/imunologia , Linhagem Celular TumoralRESUMO
BACKGROUND: Asthma is a heterogeneous common respiratory disease that remains poorly understood. The established genetic associations fail to explain the high estimated heritability, and the prevalence of asthma differs between populations and geographic regions. Robust association analyses incorporating different genetic ancestries and whole-genome sequencing data may identify novel genetic associations. METHODS: We performed family-based genome-wide association analyses of childhood-onset asthma based on whole-genome sequencing (WGS) data for the 'The Genetic Epidemiology of Asthma in Costa Rica' study (GACRS) and the Childhood Asthma Management Program (CAMP). Based on parent-child trios with children diagnosed with asthma, we performed a single variant analysis using an additive and a recessive genetic model and a region-based association analysis of low-frequency and rare variants. RESULTS: Based on 1180 asthmatic trios (894 GACRS trios and 286 CAMP trios, a total of 3540 samples with WGS data), we identified three novel genetic loci associated with childhood-onset asthma: rs4832738 on 4p14 ($P=1.72\ast{10}^{-9}$, recessive model), rs1581479 on 8p22 ($P=1.47\ast{10}^{-8}$, additive model) and rs73367537 on 10q26 ($P=1.21\ast{10}^{-8}$, additive model in GACRS only). Integrative analyses suggested potential novel candidate genes underlying these associations: PGM2 on 4p14 and FGF20 on 8p22. CONCLUSION: Our family-based whole-genome sequencing analysis identified three novel genetic loci for childhood-onset asthma. Gene expression data and integrative analyses point to PGM2 on 4p14 and FGF20 on 8p22 as linked genes. Furthermore, region-based analyses suggest independent potential low-frequency/rare variant associations on 8p22. Follow-up analyses are needed to understand the functional mechanisms and generalizability of these associations.
Assuntos
Asma , Estudo de Associação Genômica Ampla , Humanos , Predisposição Genética para Doença , Asma/genética , Loci Gênicos , Sequenciamento Completo do Genoma , Polimorfismo de Nucleotídeo Único/genética , Fatores de Crescimento de Fibroblastos/genéticaRESUMO
Extensive evidence indicates that the pathobiological processes of a complex disease are associated with perturbation in specific neighborhoods of the human protein-protein interaction (PPI) network (also known as the interactome), often referred to as the disease module. Many computational methods have been developed to integrate the interactome and omics profiles to extract context-dependent disease modules. Yet, existing methods all have fundamental limitations in terms of rigor and/or efficiency. Here, we developed a statistical physics approach based on the random-field Ising model (RFIM) for disease module detection, which is both mathematically rigorous and computationally efficient. We applied our RFIM approach to genome-wide association studies (GWAS) of ten complex diseases to examine its performance for disease module detection. We found that our RFIM approach outperforms existing methods in terms of computational efficiency, connectivity of disease modules, and robustness to the interactome incompleteness.
Assuntos
Estudo de Associação Genômica Ampla , Mapas de Interação de Proteínas , Humanos , Estudo de Associação Genômica Ampla/métodos , Física , AlgoritmosRESUMO
Genetic similarity matrices are commonly used to assess population substructure (PS) in genetic studies. Through simulation studies and by the application to whole-genome sequencing (WGS) data, we evaluate the performance of three genetic similarity matrices: the unweighted and weighted Jaccard similarity matrices and the genetic relationship matrix. We describe different scenarios that can create numerical pitfalls and lead to incorrect conclusions in some instances. We consider scenarios in which PS is assessed based on loci that are located across the genome ('globally') and based on loci from a specific genomic region ('locally'). We also compare scenarios in which PS is evaluated based on loci from different minor allele frequency bins: common (>5%), low-frequency (5-0.5%) and rare (<0.5%) single-nucleotide variations (SNVs). Overall, we observe that all approaches provide the best clustering performance when computed based on rare SNVs. The performance of the similarity matrices is very similar for common and low-frequency variants, but for rare variants, the unweighted Jaccard matrix provides preferable clustering features. Based on visual inspection and in terms of standard clustering metrics, its clusters are the densest and the best separated in the principal component analysis of variants with rare SNVs compared with the other methods and different allele frequency cutoffs. In an application, we assessed the role of rare variants on local and global PS, using WGS data from multiethnic Alzheimer's disease data sets and European or East Asian populations from the 1000 Genome Project.
Assuntos
Genoma , Genômica , Análise de Componente Principal , Frequência do Gene , Simulação por Computador , Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo ÚnicoRESUMO
Rationale: The identification of early chronic obstructive pulmonary disease (COPD) is essential to appropriately counsel patients regarding smoking cessation, provide symptomatic treatment, and eventually develop disease-modifying treatments. Disease severity in COPD is defined using race-specific spirometry equations. These may disadvantage non-White individuals in diagnosis and care. Objectives: Determine the impact of race-specific equations on African American (AA) versus non-Hispanic White individuals. Methods: Cross-sectional analyses of the COPDGene (Genetic Epidemiology of Chronic Obstructive Pulmonary Disease) cohort were conducted, comparing non-Hispanic White (n = 6,766) and AA (n = 3,366) participants for COPD manifestations. Measurements and Main Results: Spirometric classifications using race-specific, multiethnic, and "race-reversed" prediction equations (NHANES [National Health and Nutrition Examination Survey] and Global Lung Function Initiative "Other" and "Global") were compared, as were respiratory symptoms, 6-minute-walk distance, computed tomography imaging, respiratory exacerbations, and St. George's Respiratory Questionnaire. Application of different prediction equations to the cohort resulted in different classifications by stage, with NHANES and Global Lung Function Initiative race-specific equations being minimally different, but race-reversed equations moving AA participants to more severe stages and especially between the Global Initiative for Chronic Obstructive Lung Disease (GOLD) stage 0 and preserved ratio impaired spirometry groups. Classification using the established NHANES race-specific equations demonstrated that for each of GOLD stages 1-4, AA participants were younger, had fewer pack-years and more current smoking, but had more exacerbations, shorter 6-minute-walk distance, greater dyspnea, and worse BODE (body mass index, airway obstruction, dyspnea, and exercise capacity) scores and St. George's Respiratory Questionnaire scores. Differences were greatest in GOLD stages 1 and 2. Race-reversed equations reclassified 774 AA participants (43%) from GOLD stage 0 to preserved ratio impaired spirometry. Conclusions: Race-specific equations underestimated disease severity among AA participants. These effects were particularly evident in early disease and may result in late detection of COPD.
Assuntos
Obstrução das Vias Respiratórias , Doença Pulmonar Obstrutiva Crônica , Humanos , Inquéritos Nutricionais , Estudos Transversais , Doença Pulmonar Obstrutiva Crônica/diagnóstico , Doença Pulmonar Obstrutiva Crônica/epidemiologia , Dispneia/diagnóstico , Espirometria , Volume Expiratório ForçadoRESUMO
The increasing quantity of multi-omic data, such as methylomic and transcriptomic profiles collected on the same specimen or even on the same cell, provides a unique opportunity to explore the complex interactions that define cell phenotype and govern cellular responses to perturbations. We propose a network approach based on Gaussian Graphical Models (GGMs) that facilitates the joint analysis of paired omics data. This method, called DRAGON (Determining Regulatory Associations using Graphical models on multi-Omic Networks), calibrates its parameters to achieve an optimal trade-off between the network's complexity and estimation accuracy, while explicitly accounting for the characteristics of each of the assessed omics 'layers.' In simulation studies, we show that DRAGON adapts to edge density and feature size differences between omics layers, improving model inference and edge recovery compared to state-of-the-art methods. We further demonstrate in an analysis of joint transcriptome - methylome data from TCGA breast cancer specimens that DRAGON can identify key molecular mechanisms such as gene regulation via promoter methylation. In particular, we identify Transcription Factor AP-2 Beta (TFAP2B) as a potential multi-omic biomarker for basal-type breast cancer. DRAGON is available as open-source code in Python through the Network Zoo package (netZooPy v0.8; netzoo.github.io).
Assuntos
Multiômica , Neoplasias , Humanos , Software , Simulação por Computador , Transcriptoma , Neoplasias/genética , Redes Reguladoras de GenesRESUMO
The identification and understanding of gene-environment interactions can provide insights into the pathways and mechanisms underlying complex diseases. However, testing for gene-environment interaction remains a challenge since a.) statistical power is often limited and b.) modeling of environmental effects is nontrivial and such model misspecifications can lead to false positive interaction findings. To address the lack of statistical power, recent methods aim to identify interactions on an aggregated level using, for example, polygenic risk scores. While this strategy can increase the power to detect interactions, identifying contributing genes and pathways is difficult based on these relatively global results. Here, we propose RITSS (Robust Interaction Testing using Sample Splitting), a gene-environment interaction testing framework for quantitative traits that is based on sample splitting and robust test statistics. RITSS can incorporate sets of genetic variants and/or multiple environmental factors. Based on the user's choice of statistical/machine learning approaches, a screening step selects and combines potential interactions into scores with improved interpretability. In the testing step, the application of robust statistics minimizes the susceptibility to main effect misspecifications. Using extensive simulation studies, we demonstrate that RITSS controls the type 1 error rate in a wide range of scenarios, and we show how the screening strategy influences statistical power. In an application to lung function phenotypes and human height in the UK Biobank, RITSS identified highly significant interactions based on subcomponents of genetic risk scores. While the contributing single variant interaction signals are weak, our results indicate interaction patterns that result in strong aggregated effects, providing potential insights into underlying gene-environment interaction mechanisms.
Assuntos
Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Humanos , Loci Gênicos , Interação Gene-Ambiente , Fenótipo , Simulação por Computador , Estudo de Associação Genômica AmplaRESUMO
Compared to men, women often develop COPD at an earlier age with worse respiratory symptoms despite lower smoking exposure. However, most preventive, and therapeutic strategies ignore biological sex differences in COPD. Our goal was to better understand sex-specific gene regulatory processes in lung tissue and the molecular basis for sex differences in COPD onset and severity. We analyzed lung tissue gene expression and DNA methylation data from 747 individuals in the Lung Tissue Research Consortium (LTRC), and 85 individuals in an independent dataset. We identified sex differences in COPD-associated gene regulation using gene regulatory networks. We used linear regression to test for sex-biased associations of methylation with lung function, emphysema, smoking, and age. Analyzing gene regulatory networks in the control group, we identified that genes involved in the extracellular matrix (ECM) have higher transcriptional factor targeting in females than in males. However, this pattern is reversed in COPD, with males showing stronger regulatory targeting of ECM-related genes than females. Smoking exposure, age, lung function, and emphysema were all associated with sex-specific differential methylation of ECM-related genes. We identified sex-based gene regulatory patterns of ECM-related genes associated with lung function and emphysema. Multiple factors including epigenetics, smoking, aging, and cell heterogeneity influence sex-specific gene regulation in COPD. Our findings underscore the importance of considering sex as a key factor in disease susceptibility and severity.
RESUMO
BACKGROUND: Studies have identified individual blood biomarkers associated with chronic obstructive pulmonary disease (COPD) and related phenotypes. However, complex diseases such as COPD typically involve changes in multiple molecules with interconnections that may not be captured when considering single molecular features. METHODS: Leveraging proteomic data from 3,173 COPDGene Non-Hispanic White (NHW) and African American (AA) participants, we applied sparse multiple canonical correlation network analysis (SmCCNet) to 4,776 proteins assayed on the SomaScan v4.0 platform to derive sparse networks of proteins associated with current vs. former smoking status, airflow obstruction, and emphysema quantitated from high-resolution computed tomography scans. We then used NetSHy, a dimension reduction technique leveraging network topology, to produce summary scores of each proteomic network, referred to as NetSHy scores. We next performed a genome-wide association study (GWAS) to identify variants associated with the NetSHy scores, or network quantitative trait loci (nQTLs). Finally, we evaluated the replicability of the networks in an independent cohort, SPIROMICS. RESULTS: We identified networks of 13 to 104 proteins for each phenotype and exposure in NHW and AA, and the derived NetSHy scores significantly associated with the variable of interests. Networks included known (sRAGE, ALPP, MIP1) and novel molecules (CA10, CPB1, HIS3, PXDN) and interactions involved in COPD pathogenesis. We observed 7 nQTL loci associated with NetSHy scores, 4 of which remained after conditional analysis. Networks for smoking status and emphysema, but not airflow obstruction, demonstrated a high degree of replicability across race groups and cohorts. CONCLUSIONS: In this work, we apply state-of-the-art molecular network generation and summarization approaches to proteomic data from COPDGene participants to uncover protein networks associated with COPD phenotypes. We further identify genetic associations with networks. This work discovers protein networks containing known and novel proteins and protein interactions associated with clinically relevant COPD phenotypes across race groups and cohorts.
Assuntos
Estudo de Associação Genômica Ampla , Proteômica , Doença Pulmonar Obstrutiva Crônica , Fumar , Humanos , Doença Pulmonar Obstrutiva Crônica/genética , Fumar/genética , Masculino , Feminino , Pessoa de Meia-Idade , Idoso , Locos de Características Quantitativas , Fenótipo , Polimorfismo de Nucleotídeo Único , Variação GenéticaRESUMO
BACKGROUND: Beyond exposure to cigarette smoking and aging, the factors that influence lung function decline to incident chronic obstructive pulmonary disease (COPD) remain unclear. Advancements have been made in categorizing COPD into emphysema and airway predominant disease subtypes; however, predicting which healthy individuals will progress to COPD is difficult because they can exhibit profoundly different disease trajectories despite similar initial risk factors. This study aimed to identify clinical, genetic, and radiological features that are directly linked-and subsequently predict-abnormal lung function. METHODS AND FINDINGS: We employed graph modeling on 2,643 COPDGene participants (aged 45 to 80 years, 51.25% female, 35.1% African Americans; enrollment 11/2007-4/2011) with smoking history but normal spirometry at study enrollment to identify variables that are directly linked to future lung function abnormalities. We developed logistic regression and random forest predictive models for distinguishing individuals who maintain lung function from those who decline. Of the 131 variables analyzed, 6 were identified as informative to future lung function abnormalities, namely forced expiratory flow in the middle range (FEF25-75%), average lung wall thickness in a 10 mm radius (Pi10), severe emphysema, age, sex, and height. We investigated whether these features predict individuals leaving GOLD 0 status (normal spirometry according to Global Initiative for Obstructive Lung Disease (GOLD) criteria). Linear models, trained with these features, were quite predictive (area under receiver operator characteristic curve or AUROC = 0.75). Random forest predictors performed similarly to logistic regression (AUROC = 0.7), indicating that no significant nonlinear effects were present. The results were externally validated on 150 participants from Specialized Center for Clinically Oriented Research (SCCOR) cohort (aged 45 to 80 years, 52.7% female, 4.7% African Americans; enrollment: 7/2007-12/2012) (AUROC = 0.89). The main limitation of longitudinal studies with 5- and 10-year follow-up is the introduction of mortality bias that disproportionately affects the more severe cases. However, our study focused on spirometrically normal individuals, who have a lower mortality rate. Another limitation is the use of strict criteria to define spirometrically normal individuals, which was unavoidable when studying factors associated with changes in normalized forced expiratory volume in 1 s (FEV1%predicted) or the ratio of FEV1/FVC (forced vital capacity). CONCLUSIONS: This study took an agnostic approach to identify which baseline measurements differentiate and predict the early stages of lung function decline in individuals with previous smoking history. Our analysis suggests that emphysema affects obstruction onset, while airway predominant pathology may play a more important role in future FEV1 (%predicted) decline without obstruction, and FEF25-75% may affect both.
Assuntos
Doença Pulmonar Obstrutiva Crônica , Humanos , Doença Pulmonar Obstrutiva Crônica/fisiopatologia , Doença Pulmonar Obstrutiva Crônica/diagnóstico , Doença Pulmonar Obstrutiva Crônica/epidemiologia , Feminino , Masculino , Idoso , Pessoa de Meia-Idade , Fatores de Risco , Idoso de 80 Anos ou mais , Espirometria , Pulmão/fisiopatologia , Pulmão/diagnóstico por imagem , Incidência , Volume Expiratório ForçadoRESUMO
INTRODUCTION: The ability to identify early epigenetic signatures underlying the inheritance of cardiovascular risk, including trans- and intergenerational effects, may help to stratify people before cardiac symptoms occur. METHODS: Prospective and retrospective cohorts and case-control studies focusing on DNA methylation and maternal/paternal effects were searched in Pubmed from 1997 to 2023 by using the following keywords: DNA methylation, genomic imprinting, and network analysis in combination with transgenerational/intergenerational effects. RESULTS: Maternal and paternal exposures to traditional cardiovascular risk factors during critical temporal windows, including the preconceptional period or early pregnancy, may perturb the plasticity of the epigenome (mainly DNA methylation) of the developing fetus especially at imprinted loci, such as the insulin-like growth factor type 2 (IGF2) gene. Thus, the epigenome is akin to a "molecular archive" able to memorize parental environmental insults and predispose an individual to cardiovascular diseases onset in later life. Direct evidence for human transgenerational epigenetic inheritance (at least three generations) of cardiovascular risk is lacking but it is supported by epidemiological studies. Several blood-based association studies showed potential intergenerational epigenetic effects (single-generation studies) which may mediate the transmittance of cardiovascular risk from parents to offspring. DISCUSSION: In this narrative review, we discuss some relevant examples of trans- and intergenerational epigenetic associations with cardiovascular risk. In our perspective, we propose three network-oriented approaches which may help to clarify the unsolved issues regarding transgenerational epigenetic inheritance of cardiovascular risk and provide potential early biomarkers for primary prevention.
Assuntos
Doenças Cardiovasculares , Epigênese Genética , Masculino , Gravidez , Feminino , Humanos , Doenças Cardiovasculares/genética , Estudos Retrospectivos , Estudos Prospectivos , Metilação de DNARESUMO
BACKGROUND: Sex differences related to immune responses can influence atopic manifestations in childhood asthma. While genome-wide association studies have investigated a sex-specific genetic architecture of the immune response, gene-by-sex interactions have not been extensively analysed for atopy-related markers including allergy skin tests, IgE and eosinophils in asthmatic children. METHODS: We performed a genome-wide gene-by-sex interaction analysis for atopy-related markers using whole-genome sequencing data based on 889 trios from the Genetic Epidemiology of Asthma in Costa Rica Study (GACRS) and 284 trios from the Childhood Asthma Management Program (CAMP). We also tested the findings in UK Biobank participants with self-reported childhood asthma. Furthermore, downstream analyses in GACRS integrated gene expression to disentangle observed associations. RESULTS: Single nucleotide polymorphism (SNP) rs1255383 at 10q11.21 demonstrated a genome-wide significant gene-by-sex interaction (pinteraction=9.08×10-10) for atopy (positive skin test) with opposite direction of effects between females and males. In the UK Biobank participants with a history of childhood asthma, the signal was consistently observed with the same sex-specific effect directions for high eosinophil count (pinteraction=0.0058). Gene expression of ZNF33B (zinc finger protein 33B), located at 10q11.21, was moderately associated with atopy in girls, but not in boys. CONCLUSIONS: We report SNPs in/near a zinc finger gene as novel sex-differential loci for atopy-related markers with opposite effect directions in females and males. A potential role for ZNF33B should be studied further as an important driver of sex-divergent features of atopy in childhood asthma.
Assuntos
Asma , Hipersensibilidade Imediata , Criança , Humanos , Masculino , Feminino , Estudo de Associação Genômica Ampla , Imunoglobulina E , Asma/epidemiologia , Hipersensibilidade Imediata/genética , Hipersensibilidade Imediata/epidemiologia , Eosinófilos , Polimorfismo de Nucleotídeo Único , Predisposição Genética para DoençaRESUMO
OBJECTIVE: Higher optimism is associated with reduced mortality and a lower risk of age-related chronic diseases. DNA methylation (DNAm) may provide insight into mechanisms underlying these relationships. We hypothesized that DNAm would differ among older individuals who are more versus less optimistic. METHODS: Using cross-sectional data from two population-based cohorts of women with diverse races/ethnicities ( n = 3816) and men (only White, n = 667), we investigated the associations of optimism with epigenome-wide leukocyte DNAm. Random-effects meta-analyses were subsequently used to pool the individual results. Significantly differentially methylated cytosine-phosphate-guanines (CpGs) were identified by the "number of independent degrees of freedom" approach: effective degrees of freedom correction using the number of principal components (PCs), explaining >95% of the variation of the DNAm data (PC-correction). We performed regional analyses using comb-p and pathway analyses using the Ingenuity Pathway Analysis software. RESULTS: We found that essentially all CpGs (total probe N = 359,862) were homogeneous across sex and race/ethnicity in the DNAm-optimism association. In the single CpG site analyses based on homogeneous CpGs, we identified 13 significantly differentially methylated probes using PC-correction. We found four significantly differentially methylated regions and two significantly differentially methylated pathways. The annotated genes from the single CpG site and regional analyses are involved in psychiatric disorders, cardiovascular disease, cognitive impairment, and cancer. Identified pathways were related to cancer, and neurodevelopmental and neurodegenerative disorders. CONCLUSION: Our findings provide new insights into possible mechanisms underlying optimism and health.
Assuntos
Metilação de DNA , Epigenoma , Masculino , Humanos , Feminino , Epigênese Genética , Estudos Transversais , Estudo de Associação Genômica Ampla , Ilhas de CpG/genéticaRESUMO
BACKGROUND: The association between genetic variants on the X chromosome to risk of COPD has not been fully explored. We hypothesize that the X chromosome harbors variants important in determining risk of COPD related phenotypes and may drive sex differences in COPD manifestations. METHODS: Using X chromosome data from three COPD-enriched cohorts of adult smokers, we performed X chromosome specific quality control, imputation, and testing for association with COPD case-control status, lung function, and quantitative emphysema. Analyses were performed among all subjects, then stratified by sex, and subsequently combined in meta-analyses. RESULTS: Among 10,193 subjects of non-Hispanic white or European ancestry, a variant near TMSB4X, rs5979771, reached genome-wide significance for association with lung function measured by FEV1/FVC ([Formula: see text] 0.020, SE 0.004, p 4.97 × 10-08), with suggestive evidence of association with FEV1 ([Formula: see text] 0.092, SE 0.018, p 3.40 × 10-07). Sex-stratified analyses revealed X chromosome variants that were differentially trending in one sex, with significantly different effect sizes or directions. CONCLUSIONS: This investigation identified loci influencing lung function, COPD, and emphysema in a comprehensive genetic association meta-analysis of X chromosome genetic markers from multiple COPD-related datasets. Sex differences play an important role in the pathobiology of complex lung disease, including X chromosome variants that demonstrate differential effects by sex and variants that may be relevant through escape from X chromosome inactivation. Comprehensive interrogation of the X chromosome to better understand genetic control of COPD and lung function is important to further understanding of disease pathology. Trial registration Genetic Epidemiology of COPD Study (COPDGene) is registered at ClinicalTrials.gov, NCT00608764 (Active since January 28, 2008). Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints Study (ECLIPSE), GlaxoSmithKline study code SCO104960, is registered at ClinicalTrials.gov, NCT00292552 (Active since February 16, 2006). Genetics of COPD in Norway Study (GenKOLS) holds GlaxoSmithKline study code RES11080, Genetics of Chronic Obstructive Lung Disease.
Assuntos
Enfisema , Doença Pulmonar Obstrutiva Crônica , Enfisema Pulmonar , Feminino , Masculino , Humanos , Predisposição Genética para Doença/genética , Estudo de Associação Genômica Ampla , Doença Pulmonar Obstrutiva Crônica/diagnóstico , Doença Pulmonar Obstrutiva Crônica/epidemiologia , Doença Pulmonar Obstrutiva Crônica/genética , Fenótipo , Cromossomo XRESUMO
BACKGROUND: Asthma is a heterogeneous disease with high morbidity. Advancement in high-throughput multi-omics approaches has enabled the collection of molecular assessments at different layers, providing a complementary perspective of complex diseases. Numerous computational methods have been developed for the omics-based patient classification or disease outcome prediction. Yet, a systematic benchmarking of those methods using various combinations of omics data for the prediction of asthma development is still lacking. OBJECTIVE: We aimed to investigate the computational methods in disease status prediction using multi-omics data. METHOD: We systematically benchmarked 18 computational methods using all the 63 combinations of six omics data (GWAS, miRNA, mRNA, microbiome, metabolome, DNA methylation) collected in The Vitamin D Antenatal Asthma Reduction Trial (VDAART) cohort. We evaluated each method using standard performance metrics for each of the 63 omics combinations. RESULTS: Our results indicate that overall Logistic Regression, Multi-Layer Perceptron, and MOGONET display superior performance, and the combination of transcriptional, genomic and microbiome data achieves the best prediction. Moreover, we find that including the clinical data can further improve the prediction performance for some but not all the omics combinations. CONCLUSIONS: Specific omics combinations can reach the optimal prediction of asthma development in children. And certain computational methods showed superior performance than other methods.
Assuntos
Asma , MicroRNAs , Gravidez , Humanos , Feminino , Criança , Benchmarking , Genômica/métodos , Asma/diagnóstico , Asma/epidemiologia , Asma/genética , PrognósticoRESUMO
BACKGROUND: COPD diagnosis is tightly linked to the fixed-ratio spirometry criteria of FEV1/FVC < 0.7. African-Americans are less often diagnosed with COPD. OBJECTIVE: Compare COPD diagnosis by fixed-ratio with findings and outcomes by race. DESIGN: Genetic Epidemiology of COPD (COPDGene) (2007-present), cross-sectional comparing non-Hispanic white (NHW) and African-American (AA) participants for COPD diagnosis, manifestations, and outcomes. SETTING: Multicenter, longitudinal US cohort study. PARTICIPANTS: Current or former smokers with ≥ 10-pack-year smoking history enrolled at 21 clinical centers including over-sampling of participants with known COPD and AA. Exclusions were pre-existing non-COPD lung disease, except for a history of asthma. MEASUREMENTS: Subject diagnosis by conventional criteria. Mortality, imaging, respiratory symptoms, function, and socioeconomic characteristics, including area deprivation index (ADI). Matched analysis (age, sex, and smoking status) of AA vs. NHW within participants without diagnosed COPD (GOLD 0; FEV1 ≥ 80% predicted and FEV1/FVC ≥ 0.7). RESULTS: Using the fixed ratio, 70% of AA (n = 3366) were classified as non-COPD, versus 49% of NHW (n = 6766). AA smokers were younger (55 vs. 62 years), more often current smoking (80% vs. 39%), with fewer pack-years but similar 12-year mortality. Density distribution plots for FEV1 and FVC raw spirometry values showed disproportionate reductions in FVC relative to FEV1 in AA that systematically led to higher ratios. The matched analysis demonstrated GOLD 0 AA had greater symptoms, worse DLCO, spirometry, BODE scores (1.03 vs 0.54, p < 0.0001), and greater deprivation than NHW. LIMITATIONS: Lack of an alternative diagnostic metric for comparison. CONCLUSIONS: The fixed-ratio spirometric criteria for COPD underdiagnosed potential COPD in AA participants when compared to broader diagnostic criteria. Disproportionate reductions in FVC relative to FEV1 leading to higher FEV1/FVC were identified in these participants and associated with deprivation. Broader diagnostic criteria for COPD are needed to identify the disease across all populations.
Assuntos
Doença Pulmonar Obstrutiva Crônica , Humanos , Negro ou Afro-Americano , Estudos de Coortes , Estudos Transversais , Volume Expiratório Forçado , Estudos Longitudinais , Doença Pulmonar Obstrutiva Crônica/diagnóstico , Doença Pulmonar Obstrutiva Crônica/epidemiologia , Espirometria , Capacidade Vital , Pessoa de Meia-Idade , Brancos , Fumar/efeitos adversosRESUMO
Rationale: Multiple studies have demonstrated an increased risk of chronic obstructive pulmonary disease (COPD) in heterozygous carriers of the AAT (alpha-1 antitrypsin) Z allele. However, it is not known if MZ subjects with COPD are phenotypically different from noncarriers (MM genotype) with COPD. Objectives: To assess if MZ subjects with COPD have different clinical features compared with MM subjects with COPD. Methods: Genotypes of SERPINA1 were ascertained by using whole-genome sequencing data in three independent studies. We compared outcomes between MM subjects with COPD and MZ subjects with COPD in each study and combined the results in a meta-analysis. We performed longitudinal and survival analyses to compare outcomes in MM and MZ subjects with COPD over time. Measurements and Main Results: We included 290 MZ subjects with COPD and 6,184 MM subjects with COPD across the three studies. MZ subjects had a lower FEV1% predicted and greater quantitative emphysema on chest computed tomography scans compared with MM subjects. In a meta-analysis, the FEV1 was 3.9% lower (95% confidence interval [CI], -6.55% to -1.26%) and emphysema (the percentage of lung attenuation areas <-950 HU) was 4.14% greater (95% CI, 1.44% to 6.84%) in MZ subjects. We found one gene, PGF (placental growth factor), to be differentially expressed in lung tissue from one study between MZ subjects and MM subjects. Conclusions: Carriers of the AAT Z allele (those who were MZ heterozygous) with COPD had lower lung function and more emphysema than MM subjects with COPD. Taken with the subtle differences in gene expression between the two groups, our findings suggest that MZ subjects represent an endotype of COPD.
Assuntos
Genótipo , Heterozigoto , Fenótipo , Doença Pulmonar Obstrutiva Crônica/genética , alfa 1-Antitripsina/genética , Adulto , Idoso , Idoso de 80 Anos ou mais , Estudos de Casos e Controles , Feminino , Marcadores Genéticos , Humanos , Estudos Longitudinais , Masculino , Pessoa de Meia-Idade , Doença Pulmonar Obstrutiva Crônica/diagnóstico , Doença Pulmonar Obstrutiva Crônica/mortalidade , Doença Pulmonar Obstrutiva Crônica/fisiopatologia , Testes de Função Respiratória , Análise de Sobrevida , Sequenciamento Completo do GenomaRESUMO
BACKGROUND: Chronic obstructive pulmonary disease (COPD) is a highly morbid and heterogenous disease. While COPD is defined by spirometry, many COPD characteristics are seen in cigarette smokers with normal spirometry. The extent to which COPD and COPD heterogeneity is captured in omics of lung tissue is not known. METHODS: We clustered gene expression and methylation data in 78 lung tissue samples from former smokers with normal lung function or severe COPD. We applied two integrative omics clustering methods: (1) Similarity Network Fusion (SNF) and (2) Entropy-Based Consensus Clustering (ECC). RESULTS: SNF clusters were not significantly different by the percentage of COPD cases (48.8% vs. 68.6%, p = 0.13), though were different according to median forced expiratory volume in one second (FEV1) % predicted (82 vs. 31, p = 0.017). In contrast, the ECC clusters showed stronger evidence of separation by COPD case status (48.2% vs. 81.8%, p = 0.013) and similar stratification by median FEV1% predicted (82 vs. 30.5, p = 0.0059). ECC clusters using both gene expression and methylation were identical to the ECC clustering solution generated using methylation data alone. Both methods selected clusters with differentially expressed transcripts enriched for interleukin signaling and immunoregulatory interactions between lymphoid and non-lymphoid cells. CONCLUSIONS: Unsupervised clustering analysis from integrated gene expression and methylation data in lung tissue resulted in clusters with modest concordance with COPD, though were enriched in pathways potentially contributing to COPD-related pathology and heterogeneity.