RESUMO
Gene regulatory networks (GRNs) are effective tools for inferring complex interactions between molecules that regulate biological processes and hence can provide insights into drivers of biological systems. Inferring coexpression networks is a critical element of GRN inference, as the correlation between expression patterns may indicate that genes are coregulated by common factors. However, methods that estimate coexpression networks generally derive an aggregate network representing the mean regulatory properties of the population and so fail to fully capture population heterogeneity. Bayesian optimized networks obtained by assimilating omic data (BONOBO) is a scalable Bayesian model for deriving individual sample-specific coexpression matrices that recognizes variations in molecular interactions across individuals. For each sample, BONOBO assumes a Gaussian distribution on the log-transformed centered gene expression and a conjugate prior distribution on the sample-specific coexpression matrix constructed from all other samples in the data. Combining the sample-specific gene coexpression with the prior distribution, BONOBO yields a closed-form solution for the posterior distribution of the sample-specific coexpression matrices, thus allowing the analysis of large data sets. We demonstrate BONOBO's utility in several contexts, including analyzing gene regulation in yeast transcription factor knockout studies, the prognostic significance of miRNA-mRNA interaction in human breast cancer subtypes, and sex differences in gene regulation within human thyroid tissue. We find that BONOBO outperforms other methods that have been used for sample-specific coexpression network inference and provides insight into individual differences in the drivers of biological processes.
Assuntos
Teorema de Bayes , Neoplasias da Mama , Redes Reguladoras de Genes , MicroRNAs , Humanos , Neoplasias da Mama/genética , Neoplasias da Mama/metabolismo , Feminino , Masculino , MicroRNAs/genética , MicroRNAs/metabolismo , Perfilação da Expressão Gênica/métodos , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , AlgoritmosRESUMO
Compared to men, women often develop COPD at an earlier age with worse respiratory symptoms despite lower smoking exposure. However, most preventive, and therapeutic strategies ignore biological sex differences in COPD. Our goal was to better understand sex-specific gene regulatory processes in lung tissue and the molecular basis for sex differences in COPD onset and severity. We analyzed lung tissue gene expression and DNA methylation data from 747 individuals in the Lung Tissue Research Consortium (LTRC), and 85 individuals in an independent dataset. We identified sex differences in COPD-associated gene regulation using gene regulatory networks. We used linear regression to test for sex-biased associations of methylation with lung function, emphysema, smoking, and age. Analyzing gene regulatory networks in the control group, we identified that genes involved in the extracellular matrix (ECM) have higher transcriptional factor targeting in females than in males. However, this pattern is reversed in COPD, with males showing stronger regulatory targeting of ECM-related genes than females. Smoking exposure, age, lung function, and emphysema were all associated with sex-specific differential methylation of ECM-related genes. We identified sex-based gene regulatory patterns of ECM-related genes associated with lung function and emphysema. Multiple factors including epigenetics, smoking, aging, and cell heterogeneity influence sex-specific gene regulation in COPD. Our findings underscore the importance of considering sex as a key factor in disease susceptibility and severity.
RESUMO
Gene regulation plays a fundamental role in shaping tissue identity, function, and response to perturbation. Regulatory processes are controlled by complex networks of interacting elements, including transcription factors, miRNAs and their target genes. The structure of these networks helps to determine phenotypes and can ultimately influence the development of disease or response to therapy. We developed GRAND (https://grand.networkmedicine.org) as a database for computationally-inferred, context-specific gene regulatory network models that can be compared between biological states, or used to predict which drugs produce changes in regulatory network structure. The database includes 12 468 genome-scale networks covering 36 human tissues, 28 cancers, 1378 unperturbed cell lines, as well as 173 013 TF and gene targeting scores for 2858 small molecule-induced cell line perturbation paired with phenotypic information. GRAND allows the networks to be queried using phenotypic information and visualized using a variety of interactive tools. In addition, it includes a web application that matches disease states to potentially therapeutic small molecule drugs using regulatory network properties.
Assuntos
Bases de Dados Genéticas , Bases de Dados de Produtos Farmacêuticos , Redes Reguladoras de Genes/genética , Software , Regulação da Expressão Gênica/genética , Genoma Humano/genética , Humanos , MicroRNAs/classificação , MicroRNAs/genética , Fatores de Transcrição/classificação , Fatores de Transcrição/genéticaRESUMO
BACKGROUND: The association between genetic variants on the X chromosome to risk of COPD has not been fully explored. We hypothesize that the X chromosome harbors variants important in determining risk of COPD related phenotypes and may drive sex differences in COPD manifestations. METHODS: Using X chromosome data from three COPD-enriched cohorts of adult smokers, we performed X chromosome specific quality control, imputation, and testing for association with COPD case-control status, lung function, and quantitative emphysema. Analyses were performed among all subjects, then stratified by sex, and subsequently combined in meta-analyses. RESULTS: Among 10,193 subjects of non-Hispanic white or European ancestry, a variant near TMSB4X, rs5979771, reached genome-wide significance for association with lung function measured by FEV1/FVC ([Formula: see text] 0.020, SE 0.004, p 4.97 × 10-08), with suggestive evidence of association with FEV1 ([Formula: see text] 0.092, SE 0.018, p 3.40 × 10-07). Sex-stratified analyses revealed X chromosome variants that were differentially trending in one sex, with significantly different effect sizes or directions. CONCLUSIONS: This investigation identified loci influencing lung function, COPD, and emphysema in a comprehensive genetic association meta-analysis of X chromosome genetic markers from multiple COPD-related datasets. Sex differences play an important role in the pathobiology of complex lung disease, including X chromosome variants that demonstrate differential effects by sex and variants that may be relevant through escape from X chromosome inactivation. Comprehensive interrogation of the X chromosome to better understand genetic control of COPD and lung function is important to further understanding of disease pathology. Trial registration Genetic Epidemiology of COPD Study (COPDGene) is registered at ClinicalTrials.gov, NCT00608764 (Active since January 28, 2008). Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints Study (ECLIPSE), GlaxoSmithKline study code SCO104960, is registered at ClinicalTrials.gov, NCT00292552 (Active since February 16, 2006). Genetics of COPD in Norway Study (GenKOLS) holds GlaxoSmithKline study code RES11080, Genetics of Chronic Obstructive Lung Disease.
Assuntos
Enfisema , Doença Pulmonar Obstrutiva Crônica , Enfisema Pulmonar , Feminino , Masculino , Humanos , Predisposição Genética para Doença/genética , Estudo de Associação Genômica Ampla , Doença Pulmonar Obstrutiva Crônica/diagnóstico , Doença Pulmonar Obstrutiva Crônica/epidemiologia , Doença Pulmonar Obstrutiva Crônica/genética , Fenótipo , Cromossomo XRESUMO
Characterizing the collective regulatory impact of genetic variants on complex phenotypes is a major challenge in developing a genotype to phenotype map. Using expression quantitative trait locus (eQTL) analyses, we constructed bipartite networks in which edges represent significant associations between genetic variants and gene expression levels and found that the network structure informs regulatory function. We show, in 13 tissues, that these eQTL networks are organized into dense, highly modular communities grouping genes often involved in coherent biological processes. We find communities representing shared processes across tissues, as well as communities associated with tissue-specific processes that coalesce around variants in tissue-specific active chromatin regions. Node centrality is also highly informative, with the global and community hubs differing in regulatory potential and likelihood of being disease associated.
Assuntos
Estudo de Associação Genômica Ampla/métodos , Especificidade de Órgãos/genética , Locos de Características Quantitativas/genética , Expressão Gênica/genética , Regulação da Expressão Gênica/genética , Redes Reguladoras de Genes/genética , Predisposição Genética para Doença/genética , Variação Genética , Genótipo , Humanos , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/fisiologia , Transcriptoma/genéticaRESUMO
An important premise of epidemiology is that individuals with the same disease share similar underlying etiologies and clinical outcomes. In the past few decades, our knowledge of disease pathogenesis has improved, and disease classification systems have evolved to the point where no complex disease processes are considered homogenous. As a result, pathology and epidemiology have been integrated into the single, unified field of molecular pathological epidemiology (MPE). Advancing integrative molecular and population-level health sciences and addressing the unique research challenges specific to the field of MPE necessitates assembling experts in diverse fields, including epidemiology, pathology, biostatistics, computational biology, bioinformatics, genomics, immunology, and nutritional and environmental sciences. Integrating these seemingly divergent fields can lead to a greater understanding of pathogenic processes. The International MPE Meeting Series fosters discussion that addresses the specific research questions and challenges in this emerging field. The purpose of the meeting series is to: discuss novel methods to integrate pathology and epidemiology; discuss studies that provide pathogenic insights into population impact; and educate next-generation scientists. Herein, we share the proceedings of the Fourth International MPE Meeting, held in Boston, MA, USA, on 30 May-1 June, 2018. Major themes of this meeting included 'integrated genetic and molecular pathologic epidemiology', 'immunology-MPE', and 'novel disease phenotyping'. The key priority areas for future research identified by meeting attendees included integration of tumor immunology and cancer disparities into epidemiologic studies, further collaboration between computational and population-level scientists to gain new insight on exposure-disease associations, and future pooling projects of studies with comparable data.
Assuntos
Epidemiologia , Patologia Molecular , Humanos , Neoplasias/epidemiologia , Neoplasias/genética , Neoplasias/imunologia , Neoplasias/patologiaRESUMO
BACKGROUND: Although ultrahigh-throughput RNA-Sequencing has become the dominant technology for genome-wide transcriptional profiling, the vast majority of RNA-Seq studies typically profile only tens of samples, and most analytical pipelines are optimized for these smaller studies. However, projects are generating ever-larger data sets comprising RNA-Seq data from hundreds or thousands of samples, often collected at multiple centers and from diverse tissues. These complex data sets present significant analytical challenges due to batch and tissue effects, but provide the opportunity to revisit the assumptions and methods that we use to preprocess, normalize, and filter RNA-Seq data - critical first steps for any subsequent analysis. RESULTS: We find that analysis of large RNA-Seq data sets requires both careful quality control and the need to account for sparsity due to the heterogeneity intrinsic in multi-group studies. We developed Yet Another RNA Normalization software pipeline (YARN), that includes quality control and preprocessing, gene filtering, and normalization steps designed to facilitate downstream analysis of large, heterogeneous RNA-Seq data sets and we demonstrate its use with data from the Genotype-Tissue Expression (GTEx) project. CONCLUSIONS: An R package instantiating YARN is available at http://bioconductor.org/packages/yarn .
Assuntos
Bases de Dados Genéticas , Especificidade de Órgãos/genética , Análise de Sequência de RNA/métodos , Análise de Sequência de RNA/normas , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Humanos , Anotação de Sequência Molecular , Análise de Componente Principal , Controle de Qualidade , Padrões de Referência , Tamanho da Amostra , SoftwareRESUMO
BACKGROUND: Cell lines are an indispensable tool in biomedical research and often used as surrogates for tissues. Although there are recognized important cellular and transcriptomic differences between cell lines and tissues, a systematic overview of the differences between the regulatory processes of a cell line and those of its tissue of origin has not been conducted. The RNA-Seq data generated by the GTEx project is the first available data resource in which it is possible to perform a large-scale transcriptional and regulatory network analysis comparing cell lines with their tissues of origin. RESULTS: We compared 127 paired Epstein-Barr virus transformed lymphoblastoid cell lines (LCLs) and whole blood samples, and 244 paired primary fibroblast cell lines and skin samples. While gene expression analysis confirms that these cell lines carry the expression signatures of their primary tissues, albeit at reduced levels, network analysis indicates that expression changes are the cumulative result of many previously unreported alterations in transcription factor (TF) regulation. More specifically, cell cycle genes are over-expressed in cell lines compared to primary tissues, and this alteration in expression is a result of less repressive TF targeting. We confirmed these regulatory changes for four TFs, including SMAD5, using independent ChIP-seq data from ENCODE. CONCLUSIONS: Our results provide novel insights into the regulatory mechanisms controlling the expression differences between cell lines and tissues. The strong changes in TF regulation that we observe suggest that network changes, in addition to transcriptional levels, should be considered when using cell lines as models for tissues.
Assuntos
Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Ciclo Celular/genética , Linhagem Celular , Humanos , Especificidade de ÓrgãosRESUMO
The phenotype of primary cells in culture varies according to the donor environmental condition. We recently showed that the time of the day imposes a molecular program linked to the inflammatory response that is heritable in culture. Here we investigated whether microRNAs (miRNAs) would show differential expression according to the time when cells were obtained, namely daytime or nighttime. Cells obtained from explants of cremaster muscle and cultivated until confluence (â¼20 days) presented high CD133 expression. Global miRNA expression analysis was performed through deep sequencing in order to compare both cultured cells. A total of 504 mature miRNAs were identified, with a specific miRNA signature being associated to the light versus dark phase of a circadian cycle. miR-1249 and miR-129-2-3p were highly expressed in daytime cells, while miR-182, miR-96-5p, miR-146a-3p, miR-146a-5p, and miR-223-3p were highly expressed in nighttime cells. Nighttime cells are regulated for programs involved in cell processes and development, as well as in the inflammation, cell differentiation and maturation; while daytime cells express miRNAs that control stemness and cytoskeleton remodeling. In summary, the time of the day imposes a differential profile regarding to miRNA signature on CD133(+) cells in culture. Understanding this daily profile in the phenotype of cultured cells is highly relevant for clinical outputs, including cellular therapy approaches. J. Cell. Physiol. 231: 1953-1963, 2016. © 2016 Wiley Periodicals, Inc.
Assuntos
Diferenciação Celular/genética , Inflamação/genética , MicroRNAs/genética , Fotoperíodo , Antígeno AC133/imunologia , Animais , Células Cultivadas , Perfilação da Expressão Gênica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Masculino , Ratos WistarRESUMO
BACKGROUND: Lung adenocarcinoma (LUAD) has been observed to have significant sex differences in incidence, prognosis, and response to therapy. However, the molecular mechanisms responsible for these disparities have not been investigated extensively. METHODS: Sample-specific gene regulatory network methods were used to analyze RNA sequencing data from non-cancerous human lung samples from The Genotype Tissue Expression Project (GTEx) and lung adenocarcinoma primary tumor samples from The Cancer Genome Atlas (TCGA); results were validated on independent data. RESULTS: We found that genes associated with key biological pathways including cell proliferation, immune response and drug metabolism are differentially regulated between males and females in both healthy lung tissue and tumor, and that these regulatory differences are further perturbed by tobacco smoking. We also discovered significant sex bias in transcription factor targeting patterns of clinically actionable oncogenes and tumor suppressor genes, including AKT2 and KRAS. Using differentially regulated genes between healthy and tumor samples in conjunction with a drug repurposing tool, we identified several small-molecule drugs that might have sex-biased efficacy as cancer therapeutics and further validated this observation using an independent cell line database. CONCLUSIONS: These findings underscore the importance of including sex as a biological variable and considering gene regulatory processes in developing strategies for disease prevention and management.
Lung adenocarcinoma (LUAD) is a disease that affects males and females differently. Biological sex not only influences chances of developing the disease, but also how the disease progresses and how effective various therapies may be. We analyzed sex-specific gene regulatory networks consisting of transcription factors and the genes they regulate in both healthy lung tissue and in LUAD and identified sex-biased differences. We found that genes associated with cell proliferation, immune response, and drug metabolism are differentially targeted by transcription factors between males and females. We also found that several genes that are drug targets in LUAD, are also regulated differently between males and females. Importantly, these differences are also influenced by an individual's smoking history. Extending our analysis using a drug repurposing tool, we found candidate drugs with evidence that they might work better for one sex or the other. These results demonstrate that considering the differences in gene regulation between males and females will be essential if we are to develop precision medicine strategies for preventing and treating LUAD.
Assuntos
Adenocarcinoma de Pulmão , Redes Reguladoras de Genes , Adenocarcinoma de Pulmão/diagnóstico , Adenocarcinoma de Pulmão/genética , Adenocarcinoma de Pulmão/terapia , Fatores Sexuais , Regulação Neoplásica da Expressão Gênica/genética , Pulmão/metabolismo , Fumar Tabaco/efeitos adversos , Prognóstico , Imunoterapia , Terapia de Alvo Molecular , Linhagem Celular Tumoral , Humanos , Masculino , Feminino , Descoberta de DrogasRESUMO
Aging is the primary risk factor for many individual cancer types, including lung adenocarcinoma (LUAD). To understand how aging-related alterations in the regulation of key cellular processes might affect LUAD risk and survival outcomes, we built individual (person)-specific gene regulatory networks integrating gene expression, transcription factor protein-protein interaction, and sequence motif data, using PANDA/LIONESS algorithms, for both non-cancerous lung tissue samples from the Genotype Tissue Expression (GTEx) project and LUAD samples from The Cancer Genome Atlas (TCGA). In GTEx, we found that pathways involved in cell proliferation and immune response are increasingly targeted by regulatory transcription factors with age; these aging-associated alterations are accelerated by tobacco smoking and resemble oncogenic shifts in the regulatory landscape observed in LUAD and suggests that dysregulation of aging pathways might be associated with an increased risk of LUAD. Comparing normal adjacent samples from individuals with LUAD with healthy lung tissue samples from those without LUAD, we found that aging-associated genes show greater aging-biased targeting patterns in younger individuals with LUAD compared to their healthy counterparts of similar age, a pattern suggestive of age acceleration. This implies that an accelerated aging process may be responsible for tumor incidence in younger individuals. Using drug repurposing tool CLUEreg, we found small molecule drugs with potential geroprotective effects that may alter the accelerating aging profiles we found. We also observed that, in contrast to chronological age, a network-informed aging signature was associated with survival and response to chemotherapy in LUAD.
RESUMO
There is increasing recognition that the sex chromosomes, X and Y, play an important role in health and disease that goes beyond the determination of biological sex. Loss of the Y chromosome (LOY) in blood, which occurs naturally in aging men, has been found to be a driver of cardiac fibrosis and heart failure mortality. LOY also occurs in most solid tumors in males and is often associated with worse survival, suggesting that LOY may give tumor cells a growth or survival advantage. We analyzed LOY in lung adenocarcinoma (LUAD) using both bulk and single-cell expression data and found evidence suggesting that LOY affects the tumor immune environment by altering cancer/testis antigen expression and consequently facilitating tumor immune evasion. Analyzing immunotherapy data, we show that LOY and changes in expression of particular cancer/testis antigens are associated with response to pembrolizumab treatment and outcome, providing a new and powerful biomarker for predicting immunotherapy response in LUAD tumors in males.
RESUMO
Lung adenocarcinoma (LUAD) has been observed to have significant sex differences in incidence, prognosis, and response to therapy. However, the molecular mechanisms responsible for these disparities have not been investigated extensively. Sample-specific gene regulatory network methods were used to analyze RNA sequencing data from non-cancerous human lung samples from The Genotype Tissue Expression Project (GTEx) and lung adenocarcinoma primary tumor samples from The Cancer Genome Atlas (TCGA); results were validated on independent data. We observe that genes associated with key biological pathways including cell proliferation, immune response and drug metabolism are differentially regulated between males and females in both healthy lung tissue, as well as in tumor, and that these regulatory differences are further perturbed by tobacco smoking. We also uncovered significant sex bias in transcription factor targeting patterns of clinically actionable oncogenes and tumor suppressor genes, including AKT2 and KRAS. Using differentially regulated genes between healthy and tumor samples in conjunction with a drug repurposing tool, we identified several small-molecule drugs that might have sex-biased efficacy as cancer therapeutics and further validated this observation using an independent cell line database. These findings underscore the importance of including sex as a biological variable and considering gene regulatory processes in developing strategies for disease prevention and management.
RESUMO
Inference and analysis of gene regulatory networks (GRNs) require software that integrates multi-omic data from various sources. The Network Zoo (netZoo; netzoo.github.io) is a collection of open-source methods to infer GRNs, conduct differential network analyses, estimate community structure, and explore the transitions between biological states. The netZoo builds on our ongoing development of network methods, harmonizing the implementations in various computing languages and between methods to allow better integration of these tools into analytical pipelines. We demonstrate the utility using multi-omic data from the Cancer Cell Line Encyclopedia. We will continue to expand the netZoo to incorporate additional methods.
Assuntos
Redes Reguladoras de Genes , Neoplasias , Humanos , Algoritmos , Software , Multiômica , Biologia Computacional/métodosRESUMO
BACKGROUND: Mutations in SPG11 are the most frequent known cause of autosomal recessive hereditary spastic paraplegia. Corpus callosum thinning is a hallmark of the condition but little is known about damage to other structures in the CNS. OBJECTIVE: To evaluate in vivo cerebral damage in patients with SPG11 mutations. METHODS: 5 patients and 15 age and sex matched healthy controls underwent high resolution diffusion tensor imaging (32 directions) and a T1 volumetric (1 mm slices) acquisition protocol in a 3 T scanner (Philips Achieva). These sequences were then analysed through voxel based morphometry (VBM) and tract based spatial statistics (TBSS). RESULTS: Mean age of the patients was 23.6±4.5 years (range 14-45) and mean duration of disease was 12 years (range 5-15). All patients presented with progressive spastic paraplegia and three were already wheelchair bound when first evaluated. Mutations found were: c.529_533delATATT, c.704_705delAT, c.733_734delAT, c.118C>T and c.7256A>G. VBM identified significant grey matter atrophy in both the thalamus and lentiform nuclei. TBSS analyses revealed reduced fractional anisotropy involving symmetrically subcortical white matter of the temporal and frontal lobes, the cingulated gyrus, cuneus, striatum, corpus callosum and brainstem. CONCLUSIONS: Widespread white matter damage in patients with SPG11 mutations has been demonstrated. Grey matter atrophy was prominent in both the thalamus and basal ganglia but not in the cerebral cortex. These findings suggest that neuronal damage/dysfunction is more widespread than previously recognised in this condition.
Assuntos
Encéfalo/patologia , Proteínas/genética , Paraplegia Espástica Hereditária/patologia , Adolescente , Adulto , Encéfalo/fisiopatologia , Corpo Caloso/patologia , Corpo Estriado/patologia , Imagem de Tensor de Difusão , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Paraplegia Espástica Hereditária/genética , Tálamo/patologia , Adulto JovemRESUMO
Glioblastoma is an aggressive cancer of the brain and spine. While analysis of glioblastoma 'omics data has somewhat improved our understanding of the disease, it has not led to direct improvement in patient survival. Cancer survival is often characterized by differences in gene expression, but the mechanisms that drive these differences are generally unknown. We therefore set out to model the regulatory mechanisms associated with glioblastoma survival. We inferred individual patient gene regulatory networks using data from two different expression platforms from The Cancer Genome Atlas. We performed comparative network analysis between patients with long- and short-term survival. Seven pathways were identified as associated with survival, all of them involved in immune signaling; differential regulation of PD1 signaling was validated to correspond with outcome in an independent dataset from the German Glioma Network. In this pathway, transcriptional repression of genes for which treatment options are available was lost in short-term survivors; this was independent of mutational burden and only weakly associated with T-cell infiltration. Collectively, these results provide a new way to stratify patients with glioblastoma that uses network features as biomarkers to predict survival. They also identify new potential therapeutic interventions, underscoring the value of analyzing gene regulatory networks in individual patients with cancer. SIGNIFICANCE: Genome-wide network modeling of individual glioblastomas identifies dysregulation of PD1 signaling in patients with poor prognosis, indicating this approach can be used to understand how gene regulation influences cancer progression.
Assuntos
Biomarcadores Tumorais/metabolismo , Regulação Neoplásica da Expressão Gênica , Redes Reguladoras de Genes , Glioblastoma/patologia , Linfócitos do Interstício Tumoral/imunologia , Mutação , Receptor de Morte Celular Programada 1/metabolismo , Biomarcadores Tumorais/genética , Neoplasias Encefálicas/genética , Neoplasias Encefálicas/imunologia , Neoplasias Encefálicas/metabolismo , Neoplasias Encefálicas/patologia , Estudos de Coortes , Feminino , Perfilação da Expressão Gênica , Glioblastoma/genética , Glioblastoma/imunologia , Glioblastoma/metabolismo , Humanos , Masculino , Pessoa de Meia-Idade , Prognóstico , Receptor de Morte Celular Programada 1/genética , Taxa de SobrevidaRESUMO
Despite their known importance in clinical medicine, differences based on sex and gender are among the least studied factors affecting cancer susceptibility, progression, survival, and therapeutic response. In particular, the molecular mechanisms driving sex differences are poorly understood and so most approaches to precision medicine use mutational or other genomic data to assign therapy without considering how the sex of the individual might influence therapeutic efficacy. The mandate by the National Institutes of Health that research studies include sex as a biological variable has begun to expand our understanding on its importance. Sex differences in cancer may arise due to a combination of environmental, genetic, and epigenetic factors, as well as differences in gene regulation, and expression. Extensive sex differences occur genome-wide, and ultimately influence cancer biology and outcomes. In this review, we summarize the current state of knowledge about sex-specific genetic and genome-wide influences in cancer, describe how differences in response to environmental exposures and genetic and epigenetic alterations alter the trajectory of the disease, and provide insights into the importance of integrative analyses in understanding the interplay of sex and genomics in cancer. In particular, we will explore some of the emerging analytical approaches, such as the use of network methods, that are providing a deeper understanding of the drivers of differences based on sex and gender. Better understanding these complex factors and their interactions will improve cancer prevention, treatment, and outcomes for all individuals.
RESUMO
Sex differences manifest in many diseases and may drive sex-specific therapeutic responses. To understand the molecular basis of sex differences, we evaluated sex-biased gene regulation by constructing sample-specific gene regulatory networks in 29 human healthy tissues using 8,279 whole-genome expression profiles from the Genotype-Tissue Expression (GTEx) project. We find sex-biased regulatory network structures in each tissue. Even though most transcription factors (TFs) are not differentially expressed between males and females, many have sex-biased regulatory targeting patterns. In each tissue, genes that are differentially targeted by TFs between the sexes are enriched for tissue-related functions and diseases. In brain tissue, for example, genes associated with Parkinson's disease and Alzheimer's disease are targeted by different sets of TFs in each sex. Our systems-based analysis identifies a repertoire of TFs that play important roles in sex-specific architecture of gene regulatory networks, and it underlines sex-specific regulatory processes in both health and disease.
Assuntos
Regulação da Expressão Gênica , Redes Reguladoras de Genes , Especificidade de Órgãos/genética , Caracteres Sexuais , Cromossomos Humanos X/genética , Proteínas de Ligação a DNA/genética , Proteínas de Ligação a DNA/metabolismo , Feminino , Humanos , Masculino , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismoRESUMO
Understanding sex differences in colon cancer is essential to advance disease prevention, diagnosis, and treatment. Males have a higher risk of developing colon cancer and a lower survival rate than women. However, the molecular features that drive these sex differences are poorly understood. In this study, we use both transcript-based and gene regulatory network methods to analyze RNA-seq data from The Cancer Genome Atlas for 445 patients with colon cancer. We compared gene expression between tumors in men and women and observed significant sex differences in sex chromosome genes only. We then inferred patient-specific gene regulatory networks and found significant regulatory differences between males and females, with drug and xenobiotics metabolism via cytochrome P450 pathways more strongly targeted in females. This finding was validated in a dataset of 1,193 patients from five independent studies. While targeting, the drug metabolism pathway did not change overall survival for males treated with adjuvant chemotherapy, females with greater targeting showed an increase in 10-year overall survival probability, 89% [95% confidence interval (CI), 78-100] survival compared with 61% (95% CI, 45-82) for women with lower targeting, respectively (P = 0.034). Our network analysis uncovers patterns of transcriptional regulation that differentiate male and female colon cancer and identifies differences in regulatory processes involving the drug metabolism pathway associated with survival in women who receive adjuvant chemotherapy. This approach can be used to investigate the molecular features that drive sex differences in other cancers and complex diseases.Significance: A network-based approach reveals that sex-specific patterns of gene targeting by transcriptional regulators are associated with survival outcome in colon cancer. This approach can be used to understand how sex influences progression and response to therapies in other cancers. Cancer Res; 78(19); 5538-47. ©2018 AACR.