Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 7 de 7
Filter
1.
Cell ; 173(7): 1755-1769.e22, 2018 06 14.
Article in English | MEDLINE | ID: mdl-29754820

ABSTRACT

High-grade serous ovarian cancer (HGSC) exhibits extensive malignant clonal diversity with widespread but non-random patterns of disease dissemination. We investigated whether local immune microenvironment factors shape tumor progression properties at the interface of tumor-infiltrating lymphocytes (TILs) and cancer cells. Through multi-region study of 212 samples from 38 patients with whole-genome sequencing, immunohistochemistry, histologic image analysis, gene expression profiling, and T and B cell receptor sequencing, we identified three immunologic subtypes across samples and extensive within-patient diversity. Epithelial CD8+ TILs negatively associated with malignant diversity, reflecting immunological pruning of tumor clones inferred by neoantigen depletion, HLA I loss of heterozygosity, and spatial tracking between T cell and tumor clones. In addition, combinatorial prognostic effects of mutational processes and immune properties were observed, illuminating how specific genomic aberration types associate with immune response and impact survival. We conclude that within-patient spatial immune microenvironment variation shapes intraperitoneal malignant spread, provoking new evolutionary perspectives on HGSC clonal dispersion.


Subject(s)
Lymphocytes, Tumor-Infiltrating/immunology , Ovarian Neoplasms/pathology , Adult , Aged , Aged, 80 and over , Antigens, Neoplasm/genetics , Antigens, Neoplasm/metabolism , BRCA1 Protein/genetics , BRCA1 Protein/metabolism , BRCA2 Protein/genetics , BRCA2 Protein/metabolism , CD8 Antigens/metabolism , Cluster Analysis , Female , HLA Antigens/genetics , HLA Antigens/metabolism , Humans , Loss of Heterozygosity , Lymphocytes, Tumor-Infiltrating/cytology , Lymphocytes, Tumor-Infiltrating/metabolism , Middle Aged , Neoplasm Grading , Ovarian Neoplasms/classification , Ovarian Neoplasms/immunology , Polymorphism, Single Nucleotide , Receptors, Antigen, T-Cell/genetics , Receptors, Antigen, T-Cell/metabolism , Whole Genome Sequencing , Young Adult
2.
Bioinformatics ; 38(9): 2619-2620, 2022 04 28.
Article in English | MEDLINE | ID: mdl-35258549

ABSTRACT

SUMMARY: SomaticSiMu is an in silico simulator of single and double base substitutions, and single base insertions and deletions in an input genomic sequence to mimic mutational signatures. SomaticSiMu outputs simulated DNA sequences and mutational catalogues with imposed mutational signatures. The tool is the first mutational signature simulator featuring a graphical user interface, control of mutation rates and built-in visualization tools of the simulated mutations. Simulated datasets are useful as a ground truth to test the accuracy and sensitivity of DNA sequence classification tools and mutational signature extraction tools under different experimental scenarios. The reliability of SomaticSiMu was affirmed by (i) supervised machine learning classification of simulated sequences with different mutation types and burdens, and (ii) mutational signature extraction from simulated mutational catalogues. AVAILABILITY AND IMPLEMENTATION: SomaticSiMu is written in Python 3.8.3. The open-source code, documentation and tutorials are available at https://github.com/HillLab/SomaticSiMu under the terms of the CreativeCommonsAttribution4.0InternationalLicense. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Genomics , Software , Reproducibility of Results , Mutation , Genome
3.
J Appl Stat ; 51(5): 958-992, 2024.
Article in English | MEDLINE | ID: mdl-38524799

ABSTRACT

Considering the context of functional data analysis, we developed and applied a new Bayesian approach via the Gibbs sampler to select basis functions for a finite representation of functional data. The proposed methodology uses Bernoulli latent variables to assign zero to some of the basis function coefficients with a positive probability. This procedure allows for an adaptive basis selection since it can determine the number of bases and which ones should be selected to represent functional data. Moreover, the proposed procedure measures the uncertainty of the selection process and can be applied to multiple curves simultaneously. The methodology developed can deal with observed curves that may differ due to experimental error and random individual differences between subjects, which one can observe in a real dataset application involving daily numbers of COVID-19 cases in Brazil. Simulation studies show the main properties of the proposed method, such as its accuracy in estimating the coefficients and the strength of the procedure to find the true set of basis functions. Despite having been developed in the context of functional data analysis, we also compared the proposed model via simulation with the well-established LASSO and Bayesian LASSO, which are methods developed for non-functional data.

4.
PLoS One ; 15(4): e0232391, 2020.
Article in English | MEDLINE | ID: mdl-32330208

ABSTRACT

The 2019 novel coronavirus (renamed SARS-CoV-2, and generally referred to as the COVID-19 virus) has spread to 184 countries with over 1.5 million confirmed cases. Such major viral outbreaks demand early elucidation of taxonomic classification and origin of the virus genomic sequence, for strategic planning, containment, and treatment. This paper identifies an intrinsic COVID-19 virus genomic signature and uses it together with a machine learning-based alignment-free approach for an ultra-fast, scalable, and highly accurate classification of whole COVID-19 virus genomes. The proposed method combines supervised machine learning with digital signal processing (MLDSP) for genome analyses, augmented by a decision tree approach to the machine learning component, and a Spearman's rank correlation coefficient analysis for result validation. These tools are used to analyze a large dataset of over 5000 unique viral genomic sequences, totalling 61.8 million bp, including the 29 COVID-19 virus sequences available on January 27, 2020. Our results support a hypothesis of a bat origin and classify the COVID-19 virus as Sarbecovirus, within Betacoronavirus. Our method achieves 100% accurate classification of the COVID-19 virus sequences, and discovers the most relevant relationships among over 5000 viral genomes within a few minutes, ab initio, using raw DNA sequence data alone, and without any specialized biological knowledge, training, gene or genome annotations. This suggests that, for novel viral and pathogen genome sequences, this alignment-free whole-genome machine-learning approach can provide a reliable real-time option for taxonomic classification.


Subject(s)
Betacoronavirus/genetics , Coronavirus Infections/virology , Genome, Viral , Machine Learning , Pneumonia, Viral/virology , Betacoronavirus/classification , COVID-19 , Coronavirus Infections/epidemiology , Genomics , Humans , Pandemics , Pneumonia, Viral/epidemiology , SARS-CoV-2
5.
Sci Rep ; 7(1): 13467, 2017 10 18.
Article in English | MEDLINE | ID: mdl-29044127

ABSTRACT

Characterization and quantification of tumour clonal populations over time via longitudinal sampling are essential components in understanding and predicting the response to therapeutic interventions. Computational methods for inferring tumour clonal composition from deep-targeted sequencing data are ubiquitous, however due to the lack of a ground truth biological data, evaluating their performance is difficult. In this work, we generate a benchmark data set that simulates tumour longitudinal growth and heterogeneity by in vitro mixing of cancer cell lines with known proportions. We apply four different algorithms to our ground truth data set and assess their performance in inferring clonal composition using different metrics. We also analyse the performance of these algorithms on breast tumour xenograft samples. We conclude that methods that can simultaneously analyse multiple samples while accounting for copy number alterations as a factor in allelic measurements exhibit the most accurate predictions. These results will inform future functional genomics oriented studies of model systems where time series measurements in the context of therapeutic interventions are becoming increasingly common. These studies will need computational models which accurately reflect the multi-factorial nature of allele measurement in cancer including, as we show here, segmental aneuploidies.


Subject(s)
Computer Simulation , Models, Biological , Neoplasms/etiology , Neoplasms/pathology , Algorithms , Animals , Breast Neoplasms/etiology , Breast Neoplasms/pathology , Cell Line, Tumor , Computational Biology/methods , DNA Copy Number Variations , Disease Models, Animal , Female , Heterografts , Humans , Mice , Polymorphism, Single Nucleotide , Reproducibility of Results , Exome Sequencing
6.
Genome Biol ; 18(1): 140, 2017 07 27.
Article in English | MEDLINE | ID: mdl-28750660

ABSTRACT

Somatic evolution of malignant cells produces tumors composed of multiple clonal populations, distinguished in part by rearrangements and copy number changes affecting chromosomal segments. Whole genome sequencing mixes the signals of sampled populations, diluting the signals of clone-specific aberrations, and complicating estimation of clone-specific genotypes. We introduce ReMixT, a method to unmix tumor and contaminating normal signals and jointly predict mixture proportions, clone-specific segment copy number, and clone specificity of breakpoints. ReMixT is free, open-source software and is available at http://bitbucket.org/dranew/remixt .


Subject(s)
Breast Neoplasms/genetics , Cystadenocarcinoma, Serous/genetics , Genome, Human , Models, Statistical , Ovarian Neoplasms/genetics , Software , Algorithms , Animals , Breast Neoplasms/metabolism , Breast Neoplasms/pathology , Cell Count , Clone Cells , Cystadenocarcinoma, Serous/metabolism , Cystadenocarcinoma, Serous/pathology , DNA Copy Number Variations , Female , Genotype , Heterografts/metabolism , Heterografts/pathology , Humans , Internet , Mice , Mice, SCID , Neoplastic Cells, Circulating , Ovarian Neoplasms/metabolism , Ovarian Neoplasms/pathology , Translocation, Genetic , Whole Genome Sequencing
SELECTION OF CITATIONS
SEARCH DETAIL