RESUMO
BACKGROUND: Somatic copy number alterations (SCNAs) are an important class of genomic alteration in cancer. They are frequently observed in cancer samples, with studies showing that, on average, SCNAs affect 34% of a cancer cell's genome. Furthermore, SCNAs have been shown to be major drivers of tumour development and have been associated with response to therapy and prognosis. Large-scale cancer genome studies suggest that tumours are driven by somatic copy number alterations (SCNAs) or single-nucleotide variants (SNVs). Despite the frequency of SCNAs and their clinical relevance, the use of genomics assays in the clinic is biased towards targeted gene panels, which identify SNVs but provide limited scope to detect SCNAs throughout the genome. There is a need for a comparably low-cost and simple method for high-resolution SCNA profiling. RESULTS: We present conliga, a fully probabilistic method that infers SCNA profiles from a low-cost, simple, and clinically-relevant assay (FAST-SeqS). When applied to 11 high-purity oesophageal adenocarcinoma samples, we obtain good agreement (Spearman's rank correlation coefficient, rs=0.94) between conliga's inferred SCNA profiles using FAST-SeqS data (approximately £14 per sample) and those inferred by ASCAT using high-coverage WGS (gold-standard). We find that conliga outperforms CNVkit (rs=0.89), also applied to FAST-SeqS data, and is comparable to QDNAseq (rs=0.96) applied to low-coverage WGS, which is approximately four-fold more expensive, more laborious and less clinically-relevant. By performing an in silico dilution series experiment, we find that conliga is particularly suited to detecting SCNAs in low tumour purity samples. At two million reads per sample, conliga is able to detect SCNAs in all nine samples at 3% tumour purity and as low as 0.5% purity in one sample. Crucially, we show that conliga's hidden state information can be used to decide when a sample is abnormal or normal, whereas CNVkit and QDNAseq cannot provide this critical information. CONCLUSIONS: We show that conliga provides high-resolution SCNA profiles using a convenient, low-cost assay. We believe conliga makes FAST-SeqS a more clinically valuable assay as well as a useful research tool, enabling inexpensive and fast copy number profiling of pre-malignant and cancer samples.
Assuntos
Variações do Número de Cópias de DNA , Neoplasias , Sequência de Bases , DNA , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Neoplasias/genéticaRESUMO
Bacteria in the genus Streptomyces are soil-dwelling oligotrophs and important producers of secondary metabolites. Previously, we showed that global messenger RNA expression was subject to a series of metabolic and regulatory switches during the lifetime of a fermentor batch culture of Streptomyces coelicolor M145. Here we analyze the proteome from eight time points from the same fermentor culture and, because phosphate availability is an important regulator of secondary metabolite production, compare this to the proteome of a similar time course from an S. coelicolor mutant, INB201 (ΔphoP), defective in the control of phosphate utilization. The proteomes provide a detailed view of enzymes involved in central carbon and nitrogen metabolism. Trends in protein expression over the time courses were deduced from a protein abundance index, which also revealed the importance of stress pathway proteins in both cultures. As expected, the ΔphoP mutant was deficient in expression of PhoP-dependent genes, and several putatively compensatory metabolic and regulatory pathways for phosphate scavenging were detected. Notably there is a succession of switches that coordinately induce the production of enzymes for five different secondary metabolite biosynthesis pathways over the course of the batch cultures.
Assuntos
Aclimatação , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Mutação/genética , Fosfatos/metabolismo , Streptomyces coelicolor/metabolismo , Técnicas de Cultura Celular por Lotes , Biomarcadores/metabolismo , Células Cultivadas , Cromatografia Líquida , Perfilação da Expressão Gênica , Regulação Bacteriana da Expressão Gênica , Análise de Sequência com Séries de Oligonucleotídeos , Proteoma/análise , Proteômica , RNA Bacteriano/genética , RNA Mensageiro/genética , Espectrometria de Massas por Ionização e Dessorção a Laser Assistida por Matriz , Streptomyces coelicolor/crescimento & desenvolvimentoAssuntos
Proteínas de Drosophila/genética , Proteínas de Drosophila/metabolismo , Proteínas Hedgehog/metabolismo , Peptídeos e Proteínas de Sinalização Intracelular/metabolismo , Proteínas Serina-Treonina Quinases/metabolismo , Receptores de Superfície Celular/genética , Nicho de Células-Tronco/fisiologia , Células-Tronco/fisiologia , Animais , MasculinoRESUMO
We propose a semiparametric Bayesian model, based on penalized splines, for the recovery of the time-invariant topology of a causal interaction network from longitudinal data. Our motivation is inference of gene regulatory networks from low-resolution microarray time series, where existence of nonlinear interactions is well known. Parenthood relations are mapped by augmenting the model with kinship indicators and providing these with either an overall or gene-wise hierarchical structure. Appropriate specification of the prior is crucial to control the flexibility of the splines, especially under circumstances of scarce data; thus, we provide an informative, proper prior. Substantive improvement in network inference over a linear model is demonstrated using synthetic data drawn from ordinary differential equation models and gene expression from an experimental data set of the Arabidopsis thaliana circadian rhythm.
Assuntos
Teorema de Bayes , Redes Reguladoras de Genes , Modelos Genéticos , Modelos Estatísticos , Algoritmos , Arabidopsis/genética , Bioestatística , Ritmo Circadiano/genética , Genoma de Planta , Modelos Lineares , Cadeias de Markov , Dinâmica não Linear , Análise de Sequência com Séries de Oligonucleotídeos/estatística & dados numéricosRESUMO
Pomelo II (http://pomelo2.bioinfo.cnio.es) is an open-source, web-based, freely available tool for the analysis of gene (and protein) expression and tissue array data. Pomelo II implements: permutation-based tests for class comparisons (t-test, ANOVA) and regression; survival analysis using Cox model; contingency table analysis with Fisher's exact test; linear models (of which t-test and ANOVA are especial cases) that allow additional covariates for complex experimental designs and use empirical Bayes moderated statistics. Permutation-based and Cox model analysis use parallel computing, which permits taking advantage of multicore CPUs and computing clusters. Access to, and further analysis of, additional biological information and annotations (PubMed references, Gene Ontology terms, KEGG and Reactome pathways) are available either for individual genes (from clickable links in tables and figures) or sets of genes. The source code is available, allowing for extending and reusing the software. A comprehensive test suite is also available, and covers both the user interface and the numerical results. The possibility of including additional covariates, parallelization of computation, open-source availability of the code and comprehensive testing suite make Pomelo II a unique tool.
Assuntos
Perfilação da Expressão Gênica , Software , Interpretação Estatística de Dados , Proteínas/genética , Reprodutibilidade dos Testes , Análise Serial de Tecidos , Interface Usuário-ComputadorRESUMO
BACKGROUND: During the lifetime of a fermenter culture, the soil bacterium S. coelicolor undergoes a major metabolic switch from exponential growth to antibiotic production. We have studied gene expression patterns during this switch, using a specifically designed Affymetrix genechip and a high-resolution time-series of fermenter-grown samples. RESULTS: Surprisingly, we find that the metabolic switch actually consists of multiple finely orchestrated switching events. Strongly coherent clusters of genes show drastic changes in gene expression already many hours before the classically defined transition phase where the switch from primary to secondary metabolism was expected. The main switch in gene expression takes only 2 hours, and changes in antibiotic biosynthesis genes are delayed relative to the metabolic rearrangements. Furthermore, global variation in morphogenesis genes indicates an involvement of cell differentiation pathways in the decision phase leading up to the commitment to antibiotic biosynthesis. CONCLUSIONS: Our study provides the first detailed insights into the complex sequence of early regulatory events during and preceding the major metabolic switch in S. coelicolor, which will form the starting point for future attempts at engineering antibiotic production in a biotechnological setting.
Assuntos
Perfilação da Expressão Gênica , Streptomyces coelicolor/genética , Streptomyces coelicolor/metabolismo , Antibacterianos/biossíntese , Análise por Conglomerados , Fermentação , Regulação Bacteriana da Expressão Gênica , Genes Bacterianos , Família Multigênica , RNA Bacteriano/genética , Streptomyces coelicolor/crescimento & desenvolvimentoRESUMO
Asterias (http://www.asterias.info) is an open-source, web-based, suite for the analysis of gene expression and aCGH data. Asterias implements validated statistical methods, and most of the applications use parallel computing, which permits taking advantage of multicore CPUs and computing clusters. Access to, and further analysis of, additional biological information and annotations (PubMed references, Gene Ontology terms, KEGG and Reactome pathways) are available either for individual genes (from clickable links in tables and figures) or sets of genes. These applications cover from array normalization to imputation and preprocessing, differential gene expression analysis, class and survival prediction and aCGH analysis. The source code is available, allowing for extention and reuse of the software. The links and analysis of additional functional information, parallelization of computation and open-source availability of the code make Asterias a unique suite that can exploit features specific to web-based environments.
Assuntos
Biologia Computacional/métodos , Perfilação da Expressão Gênica , Internet , Hibridização de Ácido Nucleico , Análise de Sequência com Séries de Oligonucleotídeos , Animais , Automação , Genômica , Humanos , Linguagens de Programação , Software , Interface Usuário-ComputadorRESUMO
The proliferative and functional heterogeneity among seemingly uniform cells is a universal phenomenon. Identifying the underlying factors requires single-cell analysis of function and proliferation. Here we show that the pancreatic beta-cells in zebrafish exhibit different growth-promoting and functional properties, which in part reflect differences in the time elapsed since birth of the cells. Calcium imaging shows that the beta-cells in the embryonic islet become functional during early zebrafish development. At later stages, younger beta-cells join the islet following differentiation from post-embryonic progenitors. Notably, the older and younger beta-cells occupy different regions within the islet, which generates topological asymmetries in glucose responsiveness and proliferation. Specifically, the older beta-cells exhibit robust glucose responsiveness, whereas younger beta-cells are more proliferative but less functional. As the islet approaches its mature state, heterogeneity diminishes and beta-cells synchronize function and proliferation. Our work illustrates a dynamic model of heterogeneity based on evolving proliferative and functional beta-cell states.Βeta-cells have recently been shown to be heterogeneous with regard to morphology and function. Here, the authors show that ß-cells in zebrafish switch from proliferative to functional states with increasing time since ß-cell birth, leading to functional and proliferative heterogeneity.
Assuntos
Células Secretoras de Insulina/citologia , Ilhotas Pancreáticas/citologia , Peixe-Zebra/embriologia , Animais , Animais Geneticamente Modificados , Linhagem da Célula , Proliferação de Células , Técnicas Citológicas/métodos , Embrião não Mamífero/citologia , Embrião não Mamífero/efeitos dos fármacos , Glucose/metabolismo , Células Secretoras de Insulina/efeitos dos fármacos , Células Secretoras de Insulina/fisiologia , Ilhotas Pancreáticas/embriologia , Tamoxifeno/análogos & derivados , Tamoxifeno/farmacologia , Urocortinas/metabolismo , Peixe-Zebra/genéticaRESUMO
The analysis of expression and CGH arrays plays a central role in the study of complex diseases, especially cancer, including finding markers for early diagnosis and prognosis, choosing an optimal therapy, or increasing our understanding of cancer development and metastasis. Asterias (http://www.asterias.info) is an integrated collection of freely-accessible web tools for the analysis of gene expression and aCGH data. Most of the tools use parallel computing (via MPI) and run on a server with 60 CPUs for computation; compared to a desktop or server-based but not parallelized application, parallelization provides speed ups of factors up to 50. Most of our applications allow the user to obtain additional information for user-selected genes (chromosomal location, PubMed ids, Gene Ontology terms, etc.) by using clickable links in tables and/or figures. Our tools include: normalization of expression and aCGH data (DNMAD); converting between different types of gene/clone and protein identifiers (IDconverter/IDClight); filtering and imputation (preP); finding differentially expressed genes related to patient class and survival data (Pomelo II); searching for models of class prediction (Tnasas); using random forests to search for minimal models for class prediction or for large subsets of genes with predictive capacity (GeneSrF); searching for molecular signatures and predictive genes with survival data (SignS); detecting regions of genomic DNA gain or loss (ADaCGH). The capability to send results between different applications, access to additional functional information, and parallelized computation make our suite unique and exploit features only available to web-based applications.