RESUMO
Determining how cells vary with their local signaling environment and organize into distinct cellular communities is critical for understanding processes as diverse as development, aging, and cancer. Here we introduce EcoTyper, a machine learning framework for large-scale identification and validation of cell states and multicellular communities from bulk, single-cell, and spatially resolved gene expression data. When applied to 12 major cell lineages across 16 types of human carcinoma, EcoTyper identified 69 transcriptionally defined cell states. Most states were specific to neoplastic tissue, ubiquitous across tumor types, and significantly prognostic. By analyzing cell-state co-occurrence patterns, we discovered ten clinically distinct multicellular communities with unexpectedly strong conservation, including three with myeloid and stromal elements linked to adverse survival, one enriched in normal tissue, and two associated with early cancer development. This study elucidates fundamental units of cellular organization in human carcinoma and provides a framework for large-scale profiling of cellular ecosystems in any tissue.
Assuntos
Neoplasias/patologia , Microambiente Tumoral , Sobrevivência Celular , Regulação Neoplásica da Expressão Gênica , Humanos , Imunoterapia , Inflamação/patologia , Ligantes , Neoplasias/genética , Fenótipo , Prognóstico , Transcrição GênicaRESUMO
Considering genetic variability in population studies focusing on the health risk assessment of exposure to environmental carcinogens may provide improved insights in individual environmental cancer risks. Therefore, the current study aims to determine the impact of genetic polymorphisms on the relationship between exposure and gene expression, by identifying exposure-dependently coregulated genes and genetic pathways. Statistical analysis based on mixed models, was performed to relate gene expression data from 134 subjects to exposure measurements of multiple carcinogens, 28 polymorphisms, age, sex and biomarkers of cancer risk. We evaluated the combined exposure to cadmium, lead, polychlorinated biphenyls, p,p'-dichlorodiphenyldichloroethylene, hexachlorobenzene and 1-OH-pyrene, and the outcome was biologically interpreted by using ConsensusPathDB, thereby focusing on carcinogenesis-related pathways. We found generic and carcinogenesis-related pathways deregulated in both sexes, but males showed a stronger transcriptome response than females. We highlighted NOTCH1, CBR1, ITGB3, ITGA4, ADI1, HES1, NCOA2 and SMARCA2 in view of their direct link with cancer development. Two of these, NOTCH1 and ITGB3, are also known to respond to PCBs and cadmium chloride exposure in rodents and to lead in humans. Subjects carrying a high number of risk alleles appear more responsive to combined carcinogen exposure with respect to the induced expression of some of these cancer-related genes, which may be indicative of increased cancer risk as a consequence of environmental factors.
Assuntos
Carcinógenos Ambientais/toxicidade , Proteínas de Neoplasias/biossíntese , Neoplasias/genética , Transcriptoma/genética , Biomarcadores Tumorais/biossíntese , Biomarcadores Tumorais/genética , Exposição Ambiental , Monitoramento Ambiental , Feminino , Regulação Neoplásica da Expressão Gênica/efeitos dos fármacos , Humanos , Masculino , Redes e Vias Metabólicas/efeitos dos fármacos , Redes e Vias Metabólicas/genética , Neoplasias/induzido quimicamente , Neoplasias/patologia , Bifenilos Policlorados/toxicidade , Medição de RiscoRESUMO
BACKGROUND: The prognosis for patients with head and neck cancer (HNC) is poor and has improved little in recent decades, partially due to lack of therapeutic options. To identify effective therapeutic targets, we sought to identify molecular pathways that drive metastasis and HNC progression, through large-scale systematic analyses of transcriptomic data. METHODS: We performed meta-analysis across 29 gene expression studies including 2074 primary HNC biopsies to identify genes and transcriptional pathways associated with survival and lymph node metastasis (LNM). To understand the biological roles of these genes in HNC, we identified their associated cancer pathways, as well as the cell types that express them within HNC tumor microenvironments, by integrating single-cell RNA-seq and bulk RNA-seq from sorted cell populations. RESULTS: Patient survival-associated genes were heterogenous and included drivers of diverse tumor biological processes: these included tumor-intrinsic processes such as epithelial dedifferentiation and epithelial to mesenchymal transition, as well as tumor microenvironmental factors such as T cell-mediated immunity and cancer-associated fibroblast activity. Unexpectedly, LNM-associated genes were almost universally associated with epithelial dedifferentiation within malignant cells. Genes negatively associated with LNM consisted of regulators of squamous epithelial differentiation that are expressed within well-differentiated malignant cells, while those positively associated with LNM represented cell cycle regulators that are normally repressed by the p53-DREAM pathway. These pro-LNM genes are overexpressed in proliferating malignant cells of TP53 mutated and HPV + ve HNCs and are strongly associated with stemness, suggesting that they represent markers of pre-metastatic cancer stem-like cells. LNM-associated genes are deregulated in high-grade oral precancerous lesions, and deregulated further in primary HNCs with advancing tumor grade and deregulated further still in lymph node metastases. CONCLUSIONS: In HNC, patient survival is affected by multiple biological processes and is strongly influenced by the tumor immune and stromal microenvironments. In contrast, LNM appears to be driven primarily by malignant cell plasticity, characterized by epithelial dedifferentiation coupled with EMT-independent proliferation and stemness. Our findings postulate that LNM is initially caused by loss of p53-DREAM-mediated repression of cell cycle genes during early tumorigenesis.
Assuntos
Genes cdc , Neoplasias de Cabeça e Pescoço , Humanos , Transição Epitelial-Mesenquimal/genética , Neoplasias de Cabeça e Pescoço/genética , Metástase Linfática , Microambiente Tumoral/genética , Proteína Supressora de Tumor p53/genéticaRESUMO
Lack of accurate methods for early lymphoma detection limits the ability to cure patients. Since patients with Non-Hodgkin lymphomas (NHL) who present with advanced disease have worse outcomes, accurate and sensitive methods for early detection are needed to improve patient care. We developed a DNA methylation-based prediction tool for NHL, based on blood samples collected prospectively from 278 apparently healthy patients who were followed for up to 16 years to monitor for NHL development. A predictive score was developed using machine learning methods in a robust training/validation framework. Our predictive score incorporates CpG DNA methylation at 135 genomic positions, with higher scores predicting higher risk. It was 85% and 78% accurate for identifying patients at risk of developing future NHL, in patients with high or low epigenetic mitotic clock respectively, in a validation cohort. It was also sensitive at detecting active NHL (96.3% accuracy) and healthy status (95.6% accuracy) in additional independent cohorts. Scores optimized for specific NHL subtypes showed significant but lower accuracy for predicting other subtypes. Our score incorporates hyper-methylation of Polycomb and HOX genes, which have roles in NHL development, as well as PAX5 - a master transcriptional regulator of B-cell fate. Subjects with higher risk scores showed higher regulatory T-cells, memory B-cells, but lower naïve T helper lymphocytes fractions in the blood. Future prospective studies will be required to confirm the utility of our signature for managing patients who are at high risk for developing future NHL.
RESUMO
Polychlorinated biphenyls (PCBs) are a class of widespread environmental pollutants, commonly found in human blood, that have been suggested to be linked to the occurrence of sporadic Parkinson's disease (PD). It has been reported that some non-coplanar PCBs accumulate in the brains of female PD patients. To improve our understanding of the association between PCB exposure and PD risk we have applied whole transcriptome gene expression analysis in blood cells from 594 PCB-exposed subjects (369 female, 225 male). Interestingly, we observe that in females, blood levels of non-coplanar PCBs appear to be associated with expression levels of PD-specific genes. However, no such association was detected in males. Among the 131 PD-specific genes affected, 39 have been shown to display similar changes in expression levels in the substantia nigra of deceased PD patients. Especially among the down-regulated genes, transcripts of genes involved in neurotransmitter vesicle-related functions were predominant.
Assuntos
Poluentes Ambientais/sangue , Expressão Gênica/efeitos dos fármacos , Doença de Parkinson/genética , Bifenilos Policlorados/sangue , Substância Negra/metabolismo , Adulto , Idoso , Feminino , Perfilação da Expressão Gênica , Humanos , Masculino , Pessoa de Meia-Idade , Doença de Parkinson/sangue , Fatores SexuaisRESUMO
PCBs are classified as xenoestrogens and carcinogens and their health risks may be sex-specific. To identify potential sex-specific responses to PCB-exposure we established gene expression profiles in a population study subdivided into females and males. Gene expression profiles were determined in a study population consisting of 512 subjects from the EnviroGenomarkers project, 217 subjects who developed lymphoma and 295 controls were selected in later life. We ran linear mixed models in order to find associations between gene expression and exposure to PCBs, while correcting for confounders, in particular distribution of white blood cells (WBC), as well as random effects. The analysis was subdivided according to sex and development of lymphoma in later life. The changes in gene expression as a result of exposure to the six studied PCB congeners were sex- and WBC type specific. The relatively large number of genes that are significantly associated with PCB-exposure in the female subpopulation already indicates different biological response mechanisms to PCBs between the two sexes. The interaction analysis between different PCBs and WBCs provides only a small overlap between sexes. In males, cancer-related pathways and in females immune system-related pathways are identified in association with PCBs and WBCs. Future lymphoma cases and controls for both sexes show different responses to the interaction of PCBs with WBCs, suggesting a role of the immune system in PCB-related cancer development.
Assuntos
Carcinógenos/toxicidade , Poluentes Ambientais/toxicidade , Neoplasias/genética , Bifenilos Policlorados/toxicidade , Transcriptoma/efeitos dos fármacos , Monitoramento Ambiental , Feminino , Humanos , Sistema Imunitário/efeitos dos fármacos , Sistema Imunitário/patologia , Leucócitos/efeitos dos fármacos , Masculino , Pessoa de Meia-Idade , Neoplasias/induzido quimicamente , Caracteres Sexuais , Transcriptoma/genética , Xenobióticos/toxicidadeRESUMO
Mitochondrial dysfunction seems to play a key role in the etiology of insulin resistance. At birth, a link has already been established between mitochondrial DNA (mtDNA) content and insulin levels in cord blood. In this study, we explore shared epigenetic mechanisms of the association between mtDNA content and insulin levels, supporting the developmental origins of this link. First, the association between cord blood insulin and mtDNA content in 882 newborns of the ENVIRONAGE birth cohort was assessed. Cord blood mtDNA content was established via qPCR, while cord blood levels of insulin were determined using electrochemiluminescence immunoassays. Then the cord blood DNA methylome and transcriptome were determined in 179 newborns, using the human 450K methylation Illumina and Agilent Whole Human Genome 8 × 60 K microarrays, respectively. Subsequently, we performed an epigenome-wide association study (EWAS) adjusted for different maternal and neonatal variables. Afterward, we focused on the 20 strongest associations based on p-values to assign transcriptomic correlates and allocate corresponding pathways employing the R packages ReactomePA and RDAVIDWebService. On the regional level, we examined differential methylation using the DMRcate and Bumphunter packages in R. Cord blood mtDNA content and insulin were significantly correlated (r = 0.074, p = 0.028), still showing a trend after additional adjustment for maternal and neonatal variables (p = 0.062). We found an overlap of 33 pathways which were in common between the association with cord blood mtDNA content and insulin levels, including pathways of neurodevelopment, histone modification, cytochromes P450 (CYP)-metabolism, and biological aging. We further identified a DMR annotated to Repulsive Guidance Molecule BMP Co-Receptor A (RGMA) linked to cord blood insulin as well as mtDNA content. Metabolic variation in early life represented by neonatal insulin levels and mtDNA content might reflect or accommodate alterations in neurodevelopment, histone modification, CYP-metabolism, and aging, indicating etiological origins in epigenetic programming. Variation in metabolic hormones at birth, reflected by molecular changes, might via these alterations predispose children to metabolic diseases later in life. The results of this study may provide important markers for following targeted studies.
RESUMO
Batch effects are technical sources of variation introduced by the necessity of conducting gene expression analyses on different dates due to the large number of biological samples in population-based studies. The aim of this study is to evaluate the performances of linear mixed models (LMM) and Combat in batch effect removal. We also assessed the utility of adding quality control samples in the study design as technical replicates. In order to do so, we simulated gene expression data by adding "treatment" and batch effects to a real gene expression dataset. The performances of LMM and Combat, with and without quality control samples, are assessed in terms of sensitivity and specificity while correcting for the batch effect using a wide range of effect sizes, statistical noise, sample sizes and level of balanced/unbalanced designs. The simulations showed small differences among LMM and Combat. LMM identifies stronger relationships between big effect sizes and gene expression than Combat, while Combat identifies in general more true and false positives than LMM. However, these small differences can still be relevant depending on the research goal. When any of these methods are applied, quality control samples did not reduce the batch effect, showing no added value for including them in the study design.
Assuntos
Interpretação Estatística de Dados , Perfilação da Expressão Gênica/métodos , Controle de Qualidade , Simulação por Computador , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Modelos Biológicos , TranscriptomaRESUMO
BACKGROUND: Swimming in a chlorinated pool results in high exposure levels to disinfection by-products (DBPs), which have been associated with an increased risk of bladder cancer. OBJECTIVES: By studying molecular responses at the blood transcriptome level we examined the biological processes associated with exposure to these compounds. METHODS: Whole-genome gene expression and microRNA analysis was performed on blood samples collected from 43 volunteers before and 2h after 40min swimming in an indoor chlorinated pool (PISCINAII study). Exposure to THMs was measured in exhaled breath. Heart rate and kcal expenditure were measured as proxies for physical activity. Associations between exposure levels and gene expression were assessed using multivariate normal models (MVN), correcting for age, body mass index and sex. A Bonferroni threshold at 5% was applied. RESULTS: MVN-models for the individual exposures identified 1778 genes and 23 microRNAs that were significantly associated with exposure to at least one DBP. Due to co-linearity it was not possible to statistically disentangle responses to DBP exposure from those related to physical activity. However, after eliminating previously reported transcripts associated with physical activity a large number of hits remained associated with DBP exposure. Among those, 9 were linked with bladder and 31 with colon cancer. Concordant microRNA/mRNA expressions were identified in association with DBP exposure for hsa-mir-22-3p and hsa-miR-146a-5p and their targets RCOR1 and TLR4, both related to colon cancer in association with DBP exposure. CONCLUSIONS: Short-term exposure to low levels of DBPs shows genomics responses that may be indicative of increased cancer risk.
Assuntos
Desinfetantes/toxicidade , MicroRNAs/sangue , Piscinas , Poluentes Químicos da Água/toxicidade , Adolescente , Adulto , Desinfetantes/análise , Feminino , Expressão Gênica/efeitos dos fármacos , Humanos , Masculino , Poluentes Químicos da Água/análise , Adulto JovemRESUMO
Diesel vehicle emissions are the major source of genotoxic compounds in ambient air from urban areas. These pollutants are linked to risks of cardiovascular diseases, lung cancer, respiratory infections and adverse neurological effects. Biological events associated with exposure to some air pollutants are widely unknown but applying omics techniques may help to identify the molecular processes that link exposure to disease risk. Most data on health risks are related to long-term exposure, so the aim of this study is to investigate the impact of short-term exposure (two hours) to air pollutants on the blood transcriptome and microRNA expression levels. We analyzed transcriptomics and microRNA expression using microarray technology on blood samples from volunteers participating in studies in London, the Oxford Street cohort, and, in Barcelona, the TAPAS cohort. Personal exposure levels measurements of particulate matter (PM10, PM2.5), ultrafine particles (UFPC), nitrogen oxides (NO2, NO and NOx), black carbon (BC) and carbon oxides (CO and CO2) were registered for each volunteer. Associations between air pollutant levels and gene/microRNA expression were evaluated using multivariate normal models (MVN). MVN-models identified compound-specific expression of blood cell genes and microRNAs associated with air pollution despite the low exposure levels, the short exposure periods and the relatively small-sized cohorts. Hsa-miR-197-3p, hsa-miR-29a-3p, hsa-miR-15a-5p, hsa-miR-16-5p and hsa-miR-92a-3p are found significantly expressed in association with exposures. These microRNAs target also relevant transcripts, indicating their potential relevance in the research of omics-biomarkers responding to air pollution. Furthermore, these microRNAs are also known to be associated with diseases previously linked to air pollution exposure including several cancers such lung cancer and Alzheimer's disease. In conclusion, we identified in this study promising compound-specific mRNA and microRNA biomarkers after two hours of exposure to low levels of air pollutants during two hours that suggest increased cancer risks.
Assuntos
Poluição do Ar/estatística & dados numéricos , Exposição Ambiental/estatística & dados numéricos , Poluentes Atmosféricos/análise , Poluição do Ar/análise , Biomarcadores , Doenças Cardiovasculares/epidemiologia , Estudos de Coortes , Exposição Ambiental/análise , Feminino , Humanos , Londres , Neoplasias Pulmonares/epidemiologia , Masculino , MicroRNAs , Óxidos de Nitrogênio , Material Particulado/análise , Projetos de Pesquisa , Infecções Respiratórias/epidemiologia , Transcriptoma , Emissões de Veículos/análiseRESUMO
MicroRNAs are small non-coding molecules that regulate gene expression and in return affect diverse biological functions, including those involved in toxicity and development of disease. Recent evidence suggests that microRNAs play an important role in liver pathologies, like viral hepatitis, alcoholic liver, hepatocellular carcinoma, or drug-induced liver injury. Furthermore, numerous studies demonstrated the high potential of microRNAs as promising non-invasive biomarkers of liver disease or as relevant targets for therapeutic treatment. This chapter describes a method for global microRNA analysis of primary hepatocytes by high-throughput sequencing. The method comprises the isolation of high-quality total RNA, analysis of microRNA sequencing data, and the validation of the findings by reverse transcriptase quantitative polymerase chain reaction analysis.