RESUMO
Promoters and enhancers regulate the initiation of gene expression and maintenance of expression levels in spatial and temporal manner. Recent findings stemming from the Cap Analysis of Gene Expression (CAGE) demonstrate that promoters and enhancers, based on their expression profiles after stimulus, belong to different transcription response subclasses. One of the most promising biological features that might explain the difference in transcriptional response between subclasses is the local chromatin environment. We introduce a novel computational framework, PEDAL, for distinguishing effectively transcriptional profiles of promoters and enhancers using solely histone modification marks, chromatin accessibility and binding sites of transcription factors and co-activators. A case study on data from MCF-7 cell-line reveals that PEDAL can identify successfully the transcription response subclasses of promoters and enhancers from two different stimulations. Moreover, we report subsets of input markers that discriminate with minimized classification error MCF-7 promoter and enhancer transcription response subclasses. Our work provides a general computational approach for identifying effectively cell-specific and stimulation-specific promoter and enhancer transcriptional profiles, and thus, contributes to improve our understanding of transcriptional activation in human.
Assuntos
Biologia Computacional/métodos , Elementos Facilitadores Genéticos , Regiões Promotoras Genéticas , Transcrição Gênica , Algoritmos , Cromatina/genética , Fator de Crescimento Epidérmico/farmacologia , Perfilação da Expressão Gênica , Regulação da Expressão Gênica/efeitos dos fármacos , Humanos , Células MCF-7 , Ligação Proteica , Fatores de Transcrição , Ativação Transcricional , Fluxo de TrabalhoRESUMO
Enhancers are cis-acting DNA elements that play critical roles in distal regulation of gene expression. Identifying enhancers is an important step for understanding distinct gene expression programs that may reflect normal and pathogenic cellular conditions. Experimental identification of enhancers is constrained by the set of conditions used in the experiment. This requires multiple experiments to identify enhancers, as they can be active under specific cellular conditions but not in different cell types/tissues or cellular states. This has opened prospects for computational prediction methods that can be used for high-throughput identification of putative enhancers to complement experimental approaches. Potential functions and properties of predicted enhancers have been catalogued and summarized in several enhancer-oriented databases. Because the current methods for the computational prediction of enhancers produce significantly different enhancer predictions, it will be beneficial for the research community to have an overview of the strategies and solutions developed in this field. In this review, we focus on the identification and analysis of enhancers by bioinformatics approaches. First, we describe a general framework for computational identification of enhancers, present relevant data types and discuss possible computational solutions. Next, we cover over 30 existing computational enhancer identification methods that were developed since 2000. Our review highlights advantages, limitations and potentials, while suggesting pragmatic guidelines for development of more efficient computational enhancer prediction methods. Finally, we discuss challenges and open problems of this topic, which require further consideration.
Assuntos
Biologia Computacional , Elementos Facilitadores Genéticos , HistonasRESUMO
BACKGROUND: Circulating free DNA sequencing (cfDNA-Seq) can portray cancer genome landscapes, but highly sensitive and specific technologies are necessary to accurately detect mutations with often low variant frequencies. METHODS: We developed a customizable hybrid-capture cfDNA-Seq technology using off-the-shelf molecular barcodes and a novel duplex DNA molecule identification tool for enhanced error correction. RESULTS: Modeling based on cfDNA yields from 58 patients showed that this technology, requiring 25 ng of cfDNA, could be applied to >95% of patients with metastatic colorectal cancer (mCRC). cfDNA-Seq of a 32-gene, 163.3-kbp target region detected 100% of single-nucleotide variants, with 0.15% variant frequency in spike-in experiments. Molecular barcode error correction reduced false-positive mutation calls by 97.5%. In 28 consecutively analyzed patients with mCRC, 80 out of 91 mutations previously detected by tumor tissue sequencing were called in the cfDNA. Call rates were similar for point mutations and indels. cfDNA-Seq identified typical mCRC driver mutations in patients in whom biopsy sequencing had failed or did not include key mCRC driver genes. Mutations only called in cfDNA but undetectable in matched biopsies included a subclonal resistance driver mutation to anti-EGFR antibodies in KRAS, parallel evolution of multiple PIK3CA mutations in 2 cases, and TP53 mutations originating from clonal hematopoiesis. Furthermore, cfDNA-Seq off-target read analysis allowed simultaneous genome-wide copy number profile reconstruction in 20 of 28 cases. Copy number profiles were validated by low-coverage whole-genome sequencing. CONCLUSIONS: This error-corrected, ultradeep cfDNA-Seq technology with a customizable target region and publicly available bioinformatics tools enables broad insights into cancer genomes and evolution. CLINICALTRIALSGOV IDENTIFIER: NCT02112357.
Assuntos
Biomarcadores Tumorais/sangue , DNA Tumoral Circulante/sangue , Variações do Número de Cópias de DNA/genética , Análise Mutacional de DNA/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Mutação , Biomarcadores Tumorais/genética , DNA Tumoral Circulante/genética , Neoplasias Colorretais/sangue , Neoplasias Colorretais/genética , Neoplasias Colorretais/patologia , Estudo de Associação Genômica Ampla , Humanos , Metástase Neoplásica , Sensibilidade e EspecificidadeRESUMO
Transcription regulation in multicellular eukaryotes is orchestrated by a number of DNA functional elements located at gene regulatory regions. Some regulatory regions (e.g. enhancers) are located far away from the gene they affect. Identification of distal regulatory elements is a challenge for the bioinformatics research. Although existing methodologies increased the number of computationally predicted enhancers, performance inconsistency of computational models across different cell-lines, class imbalance within the learning sets and ad hoc rules for selecting enhancer candidates for supervised learning, are some key questions that require further examination. In this study we developed DEEP, a novel ensemble prediction framework. DEEP integrates three components with diverse characteristics that streamline the analysis of enhancer's properties in a great variety of cellular conditions. In our method we train many individual classification models that we combine to classify DNA regions as enhancers or non-enhancers. DEEP uses features derived from histone modification marks or attributes coming from sequence characteristics. Experimental results indicate that DEEP performs better than four state-of-the-art methods on the ENCODE data. We report the first computational enhancer prediction results on FANTOM5 data where DEEP achieves 90.2% accuracy and 90% geometric mean (GM) of specificity and sensitivity across 36 different tissues. We further present results derived using in vivo-derived enhancer data from VISTA database. DEEP-VISTA, when tested on an independent test set, achieved GM of 80.1% and accuracy of 89.64%. DEEP framework is publicly available at http://cbrc.kaust.edu.sa/deep/.
Assuntos
Elementos Facilitadores Genéticos , Análise de Sequência de DNA/métodos , Imunoprecipitação da Cromatina/métodos , Genômica/métodos , Células HeLa , Histonas/metabolismo , Humanos , Células K562 , Máquina de Vetores de Suporte , Fatores de Transcrição/metabolismoRESUMO
MOTIVATION: Pathogens infect their host and hijack the host machinery to produce more progeny pathogens. Obligate intracellular pathogens, in particular, require resources of the host to replicate. Therefore, infections by these pathogens lead to alterations in the metabolism of the host, shifting in favor of pathogen protein production. Some computational identification of mechanisms of host-pathogen interactions have been proposed, but it seems the problem has yet to be approached from the metabolite-hijacking angle. RESULTS: We propose a novel computational framework, Hi-Jack, for inferring pathway-based interactions between a host and a pathogen that relies on the idea of metabolite hijacking. Hi-Jack searches metabolic network data from hosts and pathogens, and identifies candidate reactions where hijacking occurs. A novel scoring function ranks candidate hijacked reactions and identifies pathways in the host that interact with pathways in the pathogen, as well as the associated frequent hijacked metabolites. We also describe host-pathogen interaction principles that can be used in the future for subsequent studies. Our case study on Mycobacterium tuberculosis (Mtb) revealed pathways in human-e.g. carbohydrate metabolism, lipids metabolism and pathways related to amino acids metabolism-that are likely to be hijacked by the pathogen. In addition, we report interesting potential pathway interconnections between human and Mtb such as linkage of human fatty acid biosynthesis with Mtb biosynthesis of unsaturated fatty acids, or linkage of human pentose phosphate pathway with lipopolysaccharide biosynthesis in Mtb. AVAILABILITY AND IMPLEMENTATION: Datasets and codes are available at http://cloud.kaust.edu.sa/Pages/Hi-Jack.aspx
Assuntos
Biologia Computacional/métodos , Interações Hospedeiro-Patógeno , Redes e Vias Metabólicas , Metabolômica/métodos , Mycobacterium tuberculosis/metabolismo , Proteínas/metabolismo , Tuberculose/metabolismo , Algoritmos , Humanos , Tuberculose/microbiologiaRESUMO
MOTIVATION: Single nucleotide polymorphisms (SNPs) are considered the most frequently occurring DNA sequence variations. Several computational methods have been proposed for the classification of missense SNPs to neutral and disease associated. However, existing computational approaches fail to select relevant features by choosing them arbitrarily without sufficient documentation. Moreover, they are limited to the problem of missing values, imbalance between the learning datasets and most of them do not support their predictions with confidence scores. RESULTS: To overcome these limitations, a novel ensemble computational methodology is proposed. EnsembleGASVR facilitates a two-step algorithm, which in its first step applies a novel evolutionary embedded algorithm to locate close to optimal Support Vector Regression models. In its second step, these models are combined to extract a universal predictor, which is less prone to overfitting issues, systematizes the rebalancing of the learning sets and uses an internal approach for solving the missing values problem without loss of information. Confidence scores support all the predictions and the model becomes tunable by modifying the classification thresholds. An extensive study was performed for collecting the most relevant features for the problem of classifying SNPs, and a superset of 88 features was constructed. Experimental results show that the proposed framework outperforms well-known algorithms in terms of classification performance in the examined datasets. Finally, the proposed algorithmic framework was able to uncover the significant role of certain features such as the solvent accessibility feature, and the top-scored predictions were further validated by linking them with disease phenotypes. AVAILABILITY AND IMPLEMENTATION: Datasets and codes are freely available on the Web at http://prlab.ceid.upatras.gr/EnsembleGASVR/dataset-codes.zip. All the required information about the article is available through http://prlab.ceid.upatras.gr/EnsembleGASVR/site.html.
Assuntos
Algoritmos , Mutação de Sentido Incorreto , Polimorfismo de Nucleotídeo Único , Substituição de Aminoácidos , HumanosRESUMO
To decipher the interactions between various components of the tumor microenvironment (TME) and tumor cells in a preserved spatial context, a multiparametric approach is essential. In this pursuit, imaging mass cytometry (IMC) emerges as a valuable tool, capable of concurrently analyzing up to 40 parameters at subcellular resolution. In this study, a set of antibodies was selected to spatially resolve multiple cell types and TME elements, including a comprehensive panel targeted at dissecting the heterogeneity of cancer-associated fibroblasts (CAF), a pivotal TME component. This antibody panel was standardized and optimized using formalin-fixed paraffin-embedded tissue (FFPE) samples from different organs/lesions known to express the markers of interest. The final composition of the antibody panel was determined based on the performance of conjugated antibodies in both immunohistochemistry (IHC) and IMC. Tissue images were segmented employing the Steinbock framework. Unsupervised clustering of single-cell data was carried out using a bioinformatics pipeline developed in R program. This paper provides a detailed description of the staining procedure and analysis workflow. Subsequently, the panel underwent validation on clinical FFPE samples from head and neck squamous cell carcinoma (HNSCC). The panel and bioinformatics pipeline established here proved to be robust in characterizing different TME components of HNSCC while maintaining a high degree of spatial detail. The platform we describe shows promise for understanding the clinical implications of TMA heterogeneity in large patient cohorts with FFPE tissues available in diagnostic biobanks worldwide.
RESUMO
BACKGROUND: Presence of nerves in tumours, by axonogenesis and neurogenesis, is gaining increased attention for its impact on cancer initiation and development, and the new field of cancer neuroscience is emerging. A recent study in prostate cancer suggested that the tumour microenvironment may influence cancer progression by recruitment of Doublecortin (DCX)-expressing neural progenitor cells (NPCs). However, the presence of such cells in human breast tumours has not been comprehensively explored. METHODS: Here, we investigate the presence of DCX-expressing cells in breast cancer stromal tissue from patients using Imaging Mass Cytometry. Single-cell analysis of 372,468 cells across histopathological images of 107 breast cancers enabled spatial resolution of neural elements in the stromal compartment in correlation with clinicopathological features of these tumours. In parallel, we established a 3D in vitro model mimicking breast cancer neural progenitor-innervation and examined the two cell types as they co-evolved in co-culture by using mass spectrometry-based global proteomics. FINDINGS: Stromal presence of DCX + cells is associated with tumours of higher histological grade, a basal-like phenotype, and shorter patient survival in tumour tissue from patients with breast cancer. Global proteomics analysis revealed significant changes in the proteomic landscape of both breast cancer cells and neural progenitors in co-culture. INTERPRETATION: These results support that neural involvement plays an active role in breast cancer and warrants further studies on the relevance of nerve elements for tumour progression. FUNDING: This work was supported by the Research Council of Norway through its Centre of Excellence funding scheme, project number 223250 (to L.A.A), the Norwegian Cancer Society (to L.A.A. and H.V.), the Regional Health Trust Western Norway (Helse Vest) (to L.A.A.), the Meltzer Research Fund (to H.V.) and the National Institutes of Health (NIH)/NIGMS grant R01 GM132129 (to J.A.P.).
Assuntos
Neoplasias da Mama , Técnicas de Cocultura , Proteína Duplacortina , Células-Tronco Neurais , Proteômica , Análise de Célula Única , Humanos , Células-Tronco Neurais/metabolismo , Neoplasias da Mama/metabolismo , Neoplasias da Mama/patologia , Feminino , Proteômica/métodos , Análise de Célula Única/métodos , Proteoma/metabolismo , Microambiente Tumoral , Linhagem Celular Tumoral , Proteínas Associadas aos Microtúbulos/metabolismo , Proteínas Associadas aos Microtúbulos/genéticaRESUMO
Traditional biology was forced to restate some of its principles when the microRNA (miRNA) genes and their regulatory role were firstly discovered. Typically, miRNAs are small non-coding RNA molecules which have the ability to bind to the 3'untraslated region (UTR) of their mRNA target genes for cleavage or translational repression. Existing experimental techniques for their identification and the prediction of the target genes share some important limitations such as low coverage, time consuming experiments and high cost reagents. Hence, many computational methods have been proposed for these tasks to overcome these limitations. Recently, many researchers emphasized on the development of computational approaches to predict the participation of miRNA genes in regulatory networks and to analyze their transcription mechanisms. All these approaches have certain advantages and disadvantages which are going to be described in the present survey. Our work is differentiated from existing review papers by updating the methodologies list and emphasizing on the computational issues that arise from the miRNA data analysis. Furthermore, in the present survey, the various miRNA data analysis steps are treated as an integrated procedure whose aims and scope is to uncover the regulatory role and mechanisms of the miRNA genes. This integrated view of the miRNA data analysis steps may be extremely useful for all researchers even if they work on just a single step.
Assuntos
Biologia Computacional , MicroRNAs/genética , Redes Reguladoras de Genes , Máquina de Vetores de SuporteRESUMO
Motivation: Recent advances in highly multiplexed imaging have provided unprecedented insights into the complex cellular organization of tissues, with many applications in translational medicine. However, downstream analyses of multiplexed imaging data face several technical limitations, and although some computational methods and bioinformatics tools are available, deciphering the complex spatial organization of cellular ecosystems remains a challenging problem. Results: To mitigate this problem, we develop a novel computational tool, LOCATOR (anaLysis Of CAncer Tissue micrOenviRonment), for spatial analysis of cancer tissue microenvironments using data acquired from mass cytometry imaging technologies. LOCATOR introduces a graph-based representation of tissue images to describe features of the cellular organization and deploys downstream analysis and visualization utilities that can be used for data-driven patient-risk stratification. Our case studies using mass cytometry imaging data from two well-annotated breast cancer cohorts re-confirmed that the spatial organization of the tumour-immune microenvironment is strongly associated with the clinical outcome in breast cancer. In addition, we report interesting potential associations between the spatial organization of macrophages and patients' survival. Our work introduces an automated and versatile analysis tool for mass cytometry imaging data with many applications in future cancer research projects. Availability and implementation: Datasets and codes of LOCATOR are publicly available at https://github.com/RezvanEhsani/LOCATOR.
RESUMO
Cancers are often associated with hypoxia and metabolic reprogramming, resulting in enhanced tumor progression. Here, we aim to study breast cancer hypoxia responses, focusing on secreted proteins from low-grade (luminal-like) and high-grade (basal-like) cell lines before and after hypoxia. We examine the overlap between proteomics data from secretome analysis and laser microdissected human breast cancer stroma, and we identify a 33-protein stromal-based hypoxia profile (33P) capturing differences between luminal-like and basal-like tumors. The 33P signature is associated with metabolic differences and other adaptations following hypoxia. We observe that mRNA values for 33P predict patient survival independently of molecular subtypes and basic prognostic factors, also among low-grade luminal-like tumors. We find a significant prognostic interaction between 33P and radiation therapy.
Assuntos
Neoplasias da Mama , Humanos , Feminino , Neoplasias da Mama/patologia , Proteoma/metabolismo , Perfilação da Expressão Gênica , Linhagem Celular Tumoral , Hipóxia/genética , Regulação Neoplásica da Expressão GênicaRESUMO
Tumor neurogenesis, a process by which new nerves invade tumors, is a growing area of interest in cancer research. Nerve presence has been linked to aggressive features of various solid tumors, including breast and prostate cancer. A recent study suggested that the tumor microenvironment may influence cancer progression through recruitment of neural progenitor cells from the central nervous system. However, the presence of neural progenitors in human breast tumors has not been reported. Here, we investigate the presence of Doublecortin (DCX) and Neurofilament-Light (NFL) co-expressing (DCX+/NFL+) cells in patient breast cancer tissue using Imaging Mass Cytometry. To map the interaction between breast cancer cells and neural progenitor cells further, we created an in vitro model mimicking breast cancer innervation, and characterized using mass spectrometry-based proteomics on the two cell types as they co- evolved in co-culture. Our results indicate stromal presence of DCX+/NFL+ cells in breast tumor tissue from a cohort of 107 patient cases, and that neural interaction contribute to drive a more aggressive breast cancer phenotype in our co-culture models. Our results support that neural involvement plays an active role in breast cancer and warrants further studies on the interaction between nervous system and breast cancer progression.
RESUMO
Aberrant pro-survival signaling is a hallmark of cancer cells, but the response to chemotherapy is poorly understood. In this study, we investigate the initial signaling response to standard induction chemotherapy in a cohort of 32 acute myeloid leukemia (AML) patients, using 36-dimensional mass cytometry. Through supervised and unsupervised machine learning approaches, we find that reduction of extracellular-signal-regulated kinase (ERK) 1/2 and p38 mitogen-activated protein kinase (MAPK) phosphorylation in the myeloid cell compartment 24 h post-chemotherapy is a significant predictor of patient 5-year overall survival in this cohort. Validation by RNA sequencing shows induction of MAPK target gene expression in patients with high phospho-ERK1/2 24 h post-chemotherapy, while proteomics confirm an increase of the p38 prime target MAPK activated protein kinase 2 (MAPKAPK2). In this study, we demonstrate that mass cytometry can be a valuable tool for early response evaluation in AML and elucidate the potential of functional signaling analyses in precision oncology diagnostics.
Assuntos
Leucemia Mieloide Aguda , Medicina de Precisão , Humanos , Transdução de Sinais , Leucemia Mieloide Aguda/tratamento farmacológico , Leucemia Mieloide Aguda/genética , Leucemia Mieloide Aguda/metabolismo , Fosforilação , Proteínas Quinases p38 Ativadas por Mitógeno/metabolismo , Sistema de Sinalização das MAP Quinases/fisiologiaRESUMO
Profiling of circulating tumor DNA (ctDNA) may offer a non-invasive approach to monitor disease progression. Here, we develop a quantitative method, exploiting local tissue-specific cell-free DNA (cfDNA) degradation patterns, that accurately estimates ctDNA burden independent of genomic aberrations. Nucleosome-dependent cfDNA degradation at promoters and first exon-intron junctions is strongly associated with differential transcriptional activity in tumors and blood. A quantitative model, based on just 6 regulatory regions, could accurately predict ctDNA levels in colorectal cancer patients. Strikingly, a model restricted to blood-specific regulatory regions could predict ctDNA levels across both colorectal and breast cancer patients. Using compact targeted sequencing (<25 kb) of predictive regions, we demonstrate how the approach could enable quantitative low-cost tracking of ctDNA dynamics and disease progression.
Assuntos
Ácidos Nucleicos Livres/metabolismo , DNA Tumoral Circulante/metabolismo , Fragmentação do DNA , Carga Tumoral/fisiologia , Ácidos Nucleicos Livres/sangue , Ácidos Nucleicos Livres/genética , DNA Tumoral Circulante/genética , Neoplasias do Colo/genética , Neoplasias Colorretais/genética , Progressão da Doença , Regulação Neoplásica da Expressão Gênica , Genômica , Humanos , MutaçãoRESUMO
Epidermal growth factor receptor antibodies (EGFR-Abs) confer a survival benefit in patients with RAS wild-type metastatic colorectal cancer (mCRC), but resistance invariably occurs. Previous data showed that only a minority of cancer cells harboured known genetic resistance drivers when clinical resistance to single-agent EGFR-Abs had evolved, supporting the activity of non-genetic resistance mechanisms. Here, we used error-corrected ctDNA-sequencing (ctDNA-Seq) of 40 cancer genes to identify drivers of resistance and whether a genetic resistance-gap (a lack of detectable genetic resistance mechanisms in a large fraction of the cancer cell population) also occurs in RAS wild-type mCRCs treated with a combination of EGFR-Abs and chemotherapy. We detected one MAP2K1/MEK1 mutation and one ERBB2 amplification in 2/3 patients with primary resistance and KRAS, NRAS, MAP2K1/MEK1 mutations and ERBB2 aberrations in 6/7 patients with acquired resistance. In vitro testing identified MAP2K1/MEK1 P124S as a novel driver of EGFR-Ab resistance. Mutation subclonality analyses confirmed a genetic resistance-gap in mCRCs treated with EGFR-Abs and chemotherapy, with only 13.42% of cancer cells harboring identifiable resistance drivers. Our results support the utility of ctDNA-Seq to guide treatment allocation for patients with resistance and the importance of investigating further non-canonical EGFR-Ab resistance mechanisms, such as microenvironmentally-mediated resistance. The detection of MAP2K1 mutations could inform trials of MEK-inhibitors in these tumours.
RESUMO
Analysis of circulating cell-free DNA (cfDNA) has opened new opportunities for characterizing tumour mutational landscapes with many applications in genomic-driven oncology. We developed a customized targeted cfDNA sequencing approach for breast cancer (BC) using unique molecular identifiers (UMIs) for error correction. Our assay, spanning a 284.5 kb target region, is combined with a novel freely-licensed bioinformatics pipeline that provides detection of low-frequency variants, and reliable identification of copy number variations (CNVs) directly from plasma DNA. We first evaluated our pipeline on reference samples. Then in a cohort of 35 BC patients our approach detected actionable driver and clonal variants at low variant frequency levels in cfDNA that were concordant (77%) with sequencing of primary and/or metastatic solid tumour sites. We also detected ERRB2 gene CNVs used for HER2 subtype classification with 80% precision compared to immunohistochemistry. Further, we evaluated fragmentation profiles of cfDNA in BC and observed distinct differences compared to data from healthy individuals. Our results show that the developed assay addresses the majority of tumour associated aberrations directly from plasma DNA, and thus may be used to elucidate genomic alterations in liquid biopsy studies.
Assuntos
Neoplasias da Mama/genética , DNA Tumoral Circulante/genética , Variações do Número de Cópias de DNA , Adulto , Idoso , Biomarcadores Tumorais/genética , Neoplasias da Mama/patologia , Feminino , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Pessoa de Meia-Idade , Mutação , Análise de Sequência de DNARESUMO
Tumor DNA circulates in the plasma of cancer patients admixed with DNA from noncancerous cells. The genomic landscape of plasma DNA has been characterized in metastatic castration-resistant prostate cancer (mCRPC) but the plasma methylome has not been extensively explored. Here, we performed next-generation sequencing (NGS) on plasma DNA with and without bisulfite treatment from mCRPC patients receiving either abiraterone or enzalutamide in the pre- or post-chemotherapy setting. Principal component analysis on the mCRPC plasma methylome indicated that the main contributor to methylation variance (principal component one, or PC1) was strongly correlated with genomically determined tumor fraction (r = -0.96; P < 10-8) and characterized by hypermethylation of targets of the polycomb repressor complex 2 components. Further deconvolution of the PC1 top-correlated segments revealed that these segments are comprised of methylation patterns specific to either prostate cancer or prostate normal epithelium. To extract information specific to an individual's cancer, we then focused on an orthogonal methylation signature, which revealed enrichment for androgen receptor binding sequences and hypomethylation of these segments associated with AR copy number gain. Individuals harboring this methylation pattern had a more aggressive clinical course. Plasma methylome analysis can accurately quantitate tumor fraction and identify distinct biologically relevant mCRPC phenotypes.
Assuntos
DNA Tumoral Circulante , Metilação de DNA , Epigênese Genética , Regulação Neoplásica da Expressão Gênica , Neoplasias da Próstata , Adulto , Idoso , Idoso de 80 Anos ou mais , DNA Tumoral Circulante/sangue , DNA Tumoral Circulante/genética , Estudo de Associação Genômica Ampla , Humanos , Masculino , Pessoa de Meia-Idade , Metástase Neoplásica , Neoplasias da Próstata/sangue , Neoplasias da Próstata/genética , Neoplasias da Próstata/patologiaRESUMO
A 1-D full-life-cycle, Individual-based model (IBM), two-way coupled with a hydrodynamic/biogeochemical model, is demonstrated for anchovy and sardine in the N. Aegean Sea (Eastern Mediterranean). The model is stage-specific and includes a 'Wisconsin' type bioenergetics, a diel vertical migration and a population dynamics module, with the incorporation of known differences in biological attributes between the anchovy and sardine stocks. A new energy allocation/egg production algorithm was developed, allowing for breeding pattern to move along the capital-income breeding continuum. Fish growth was calibrated against available size-at-age data by tuning food consumption (the half saturation coefficients) using a genetic algorithm. After a ten-years spin up, the model reproduced well the magnitude of population biomasses and spawning periods of the two species in the N. Aegean Sea. Surprisingly, model simulations revealed that anchovy depends primarily on stored energy for egg production (mostly capital breeder) whereas sardine depends heavily on direct food intake (income breeder). This is related to the peculiar phenology of plankton production in the area, with mesozooplankton concentration exhibiting a sharp decrease from early summer to autumn and a subsequent increase from winter to early summer. Monthly changes in somatic condition of fish collected on board the commercial purse seine fleet followed closely the simulated mesozooplankton concentration. Finally, model simulations showed that, when both the anchovy and sardine stocks are overexploited, the mesozooplankton concentration increases, which may open up ecological space for competing species. The importance of protecting the recruit spawners was highlighted with model simulations testing the effect of changing the timing of the existing 2.5-months closed period. Optimum timing for fishery closure is different for anchovy and sardine because of their opposite spawning and recruitment periods.
Assuntos
Peixes/crescimento & desenvolvimento , Estágios do Ciclo de Vida , Modelos Teóricos , Animais , Calibragem , Pesqueiros , Cadeia Alimentar , Mar MediterrâneoRESUMO
BACKGROUND: Targeted deep sequencing is a highly effective technology to identify known and novel single nucleotide variants (SNVs) with many applications in translational medicine, disease monitoring and cancer profiling. However, identification of SNVs using deep sequencing data is a challenging computational problem as different sequencing artifacts limit the analytical sensitivity of SNV detection, especially at low variant allele frequencies (VAFs). METHODS: To address the problem of relatively high noise levels in amplicon-based deep sequencing data (e.g. with the Ion AmpliSeq technology) in the context of SNV calling, we have developed a new bioinformatics tool called AmpliSolve. AmpliSolve uses a set of normal samples to model position-specific, strand-specific and nucleotide-specific background artifacts (noise), and deploys a Poisson model-based statistical framework for SNV detection. RESULTS: Our tests on both synthetic and real data indicate that AmpliSolve achieves a good trade-off between precision and sensitivity, even at VAF below 5% and as low as 1%. We further validate AmpliSolve by applying it to the detection of SNVs in 96 circulating tumor DNA samples at three clinically relevant genomic positions and compare the results to digital droplet PCR experiments. CONCLUSIONS: AmpliSolve is a new tool for in-silico estimation of background noise and for detection of low frequency SNVs in targeted deep sequencing data. Although AmpliSolve has been specifically designed for and tested on amplicon-based libraries sequenced with the Ion Torrent platform it can, in principle, be applied to other sequencing platforms as well. AmpliSolve is freely available at https://github.com/dkleftogi/AmpliSolve .
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Polimorfismo de Nucleotídeo Único/genética , Benchmarking , DNA Tumoral Circulante/genética , Humanos , Reprodutibilidade dos TestesRESUMO
Neuroblastoma is a pediatric cancer that is frequently metastatic and resistant to conventional treatment. In part, a lack of natively metastatic, chemoresistant in vivo models has limited our insight into the development of aggressive disease. The Th-MYCN genetically engineered mouse model develops rapidly progressive chemosensitive neuroblastoma and lacks clinically relevant metastases. To study tumor progression in a context more reflective of clinical therapy, we delivered multicycle treatment with cyclophosphamide to Th-MYCN mice, individualizing therapy using MRI, to generate the Th-MYCN CPM32 model. These mice developed chemoresistance and spontaneous bone marrow metastases. Tumors exhibited an altered immune microenvironment with increased stroma and tumor-associated fibroblasts. Analysis of copy number aberrations revealed genomic changes characteristic of human MYCN-amplified neuroblastoma, specifically copy number gains at mouse chromosome 11, syntenic with gains on human chromosome 17q. RNA sequencing revealed enriched expression of genes associated with 17q gain and upregulation of genes associated with high-risk neuroblastoma, such as the cell-cycle regulator cyclin B1-interacting protein 1 (Ccnb1ip1) and thymidine kinase (TK1). The antiapoptotic, prometastatic JAK-STAT3 pathway was activated in chemoresistant tumors, and treatment with the JAK1/JAK2 inhibitor CYT387 reduced progression of chemoresistant tumors and increased survival. Our results highlight that under treatment conditions that mimic chemotherapy in human patients, Th-MYCN mice develop genomic, microenvironmental, and clinical features reminiscent of human chemorefractory disease. The Th-MYCN CPM32 model therefore is a useful tool to dissect in detail mechanisms that drive metastasis and chemoresistance, and highlights dysregulation of signaling pathways such as JAK-STAT3 that could be targeted to improve treatment of aggressive disease. SIGNIFICANCE: An in vivo mouse model of high-risk treatment-resistant neuroblastoma exhibits changes in the tumor microenvironment, widespread metastases, and sensitivity to JAK1/2 inhibition.