Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 19 de 19
Filtrar
1.
Cell ; 184(8): 2239-2254.e39, 2021 04 15.
Artículo en Inglés | MEDLINE | ID: mdl-33831375

RESUMEN

Intra-tumor heterogeneity (ITH) is a mechanism of therapeutic resistance and therefore an important clinical challenge. However, the extent, origin, and drivers of ITH across cancer types are poorly understood. To address this, we extensively characterize ITH across whole-genome sequences of 2,658 cancer samples spanning 38 cancer types. Nearly all informative samples (95.1%) contain evidence of distinct subclonal expansions with frequent branching relationships between subclones. We observe positive selection of subclonal driver mutations across most cancer types and identify cancer type-specific subclonal patterns of driver gene mutations, fusions, structural variants, and copy number alterations as well as dynamic changes in mutational processes between subclonal expansions. Our results underline the importance of ITH and its drivers in tumor evolution and provide a pan-cancer resource of comprehensively annotated subclonal events from whole-genome sequencing data.


Asunto(s)
Heterogeneidad Genética , Neoplasias/genética , Variaciones en el Número de Copia de ADN , ADN de Neoplasias/química , ADN de Neoplasias/metabolismo , Bases de Datos Genéticas , Resistencia a Antineoplásicos/genética , Humanos , Neoplasias/patología , Polimorfismo de Nucleótido Simple , Secuenciación Completa del Genoma
2.
Nature ; 578(7793): 122-128, 2020 02.
Artículo en Inglés | MEDLINE | ID: mdl-32025013

RESUMEN

Cancer develops through a process of somatic evolution1,2. Sequencing data from a single biopsy represent a snapshot of this process that can reveal the timing of specific genomic aberrations and the changing influence of mutational processes3. Here, by whole-genome sequencing analysis of 2,658 cancers as part of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA)4, we reconstruct the life history and evolution of mutational processes and driver mutation sequences of 38 types of cancer. Early oncogenesis is characterized by mutations in a constrained set of driver genes, and specific copy number gains, such as trisomy 7 in glioblastoma and isochromosome 17q in medulloblastoma. The mutational spectrum changes significantly throughout tumour evolution in 40% of samples. A nearly fourfold diversification of driver genes and increased genomic instability are features of later stages. Copy number alterations often occur in mitotic crises, and lead to simultaneous gains of chromosomal segments. Timing analyses suggest that driver mutations often precede diagnosis by many years, if not decades. Together, these results determine the evolutionary trajectories of cancer, and highlight opportunities for early cancer detection.


Asunto(s)
Evolución Molecular , Genoma Humano/genética , Neoplasias/genética , Reparación del ADN/genética , Dosificación de Gen , Genes Supresores de Tumor , Variación Genética , Humanos , Mutagénesis Insercional/genética
3.
Nat Methods ; 18(2): 144-155, 2021 02.
Artículo en Inglés | MEDLINE | ID: mdl-33398189

RESUMEN

Subclonal reconstruction from bulk tumor DNA sequencing has become a pillar of cancer evolution studies, providing insight into the clonality and relative ordering of mutations and mutational processes. We provide an outline of the complex computational approaches used for subclonal reconstruction from single and multiple tumor samples. We identify the underlying assumptions and uncertainties in each step and suggest best practices for analysis and quality assessment. This guide provides a pragmatic resource for the growing user community of subclonal reconstruction methods.


Asunto(s)
ADN de Neoplasias/genética , Neoplasias/genética , Análisis de Secuencia de ADN/métodos , Algoritmos , Humanos , Polimorfismo de Nucleótido Simple
5.
Bioinformatics ; 34(13): i429-i437, 2018 07 01.
Artículo en Inglés | MEDLINE | ID: mdl-29949959

RESUMEN

Motivation: Alternative splice site selection is inherently competitive and the probability of a given splice site to be used also depends on the strength of neighboring sites. Here, we present a new model named the competitive splice site model (COSSMO), which explicitly accounts for these competitive effects and predicts the percent selected index (PSI) distribution over any number of putative splice sites. We model an alternative splicing event as the choice of a 3' acceptor site conditional on a fixed upstream 5' donor site or the choice of a 5' donor site conditional on a fixed 3' acceptor site. We build four different architectures that use convolutional layers, communication layers, long short-term memory and residual networks, respectively, to learn relevant motifs from sequence alone. We also construct a new dataset from genome annotations and RNA-Seq read data that we use to train our model. Results: COSSMO is able to predict the most frequently used splice site with an accuracy of 70% on unseen test data, and achieve an R2 of 0.6 in modeling the PSI distribution. We visualize the motifs that COSSMO learns from sequence and show that COSSMO recognizes the consensus splice site sequences and many known splicing factors with high specificity. Availability and implementation: Model predictions, our training dataset, and code are available from http://cossmo.genes.toronto.edu. Supplementary information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Empalme Alternativo , Aprendizaje Profundo , Sitios de Empalme de ARN , Análisis de Secuencia de ARN/métodos , Biología Computacional/métodos , Humanos , Modelos Genéticos , Probabilidad , Programas Informáticos
6.
BMC Bioinformatics ; 16: 156, 2015 May 14.
Artículo en Inglés | MEDLINE | ID: mdl-25972088

RESUMEN

BACKGROUND: Tumour samples containing distinct sub-populations of cancer and normal cells present challenges in the development of reproducible biomarkers, as these biomarkers are based on bulk signals from mixed tumour profiles. ISOpure is the only mRNA computational purification method to date that does not require a paired tumour-normal sample, provides a personalized cancer profile for each patient, and has been tested on clinical data. Replacing mixed tumour profiles with ISOpure-preprocessed cancer profiles led to better prognostic gene signatures for lung and prostate cancer. RESULTS: To simplify the integration of ISOpure into standard R-based bioinformatics analysis pipelines, the algorithm has been implemented as an R package. The ISOpureR package performs analogously to the original code in estimating the fraction of cancer cells and the patient cancer mRNA abundance profile from tumour samples in four cancer datasets. CONCLUSIONS: The ISOpureR package estimates the fraction of cancer cells and personalized patient cancer mRNA abundance profile from a mixed tumour profile. This open-source R implementation enables integration into existing computational pipelines, as well as easy testing, modification and extension of the model.


Asunto(s)
Algoritmos , Biología Computacional/métodos , Perfilación de la Expresión Génica , Neoplasias de la Próstata/diagnóstico , Neoplasias de la Próstata/genética , Programas Informáticos , Humanos , Masculino , Modelos Teóricos , Pronóstico
7.
Bioinformatics ; 30(7): 956-61, 2014 Apr 01.
Artículo en Inglés | MEDLINE | ID: mdl-24123674

RESUMEN

MOTIVATION: Gene expression data are currently collected on a wide range of platforms. Differences between platforms make it challenging to combine and compare data collected on different platforms. We propose a new method of cross-platform normalization that uses topic models to summarize the expression patterns in each dataset before normalizing the topics learned from each dataset using per-gene multiplicative weights. RESULTS: This method allows for cross-platform normalization even when samples profiled on different platforms have systematic differences, allows the simultaneous normalization of data from an arbitrary number of platforms and, after suitable training, allows for online normalization of expression data collected individually or in small batches. In addition, our method outperforms existing state-of-the-art platform normalization tools. AVAILABILITY AND IMPLEMENTATION: MATLAB code is available at http://morrislab.med.utoronto.ca/plida/.


Asunto(s)
Perfilación de la Expresión Génica/métodos , Expresión Génica , Neoplasias de la Mama/genética , Regulación Neoplásica de la Expresión Génica , Humanos , Modelos Genéticos , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Programas Informáticos
8.
BMC Bioinformatics ; 15: 35, 2014 Feb 01.
Artículo en Inglés | MEDLINE | ID: mdl-24484323

RESUMEN

BACKGROUND: High-throughput sequencing allows the detection and quantification of frequencies of somatic single nucleotide variants (SNV) in heterogeneous tumor cell populations. In some cases, the evolutionary history and population frequency of the subclonal lineages of tumor cells present in the sample can be reconstructed from these SNV frequency measurements. But automated methods to do this reconstruction are not available and the conditions under which reconstruction is possible have not been described. RESULTS: We describe the conditions under which the evolutionary history can be uniquely reconstructed from SNV frequencies from single or multiple samples from the tumor population and we introduce a new statistical model, PhyloSub, that infers the phylogeny and genotype of the major subclonal lineages represented in the population of cancer cells. It uses a Bayesian nonparametric prior over trees that groups SNVs into major subclonal lineages and automatically estimates the number of lineages and their ancestry. We sample from the joint posterior distribution over trees to identify evolutionary histories and cell population frequencies that have the highest probability of generating the observed SNV frequency data. When multiple phylogenies are consistent with a given set of SNV frequencies, PhyloSub represents the uncertainty in the tumor phylogeny using a "partial order plot". Experiments on a simulated dataset and two real datasets comprising tumor samples from acute myeloid leukemia and chronic lymphocytic leukemia patients demonstrate that PhyloSub can infer both linear (or chain) and branching lineages and its inferences are in good agreement with ground truth, where it is available. CONCLUSIONS: PhyloSub can be applied to frequencies of any "binary" somatic mutation, including SNVs as well as small insertions and deletions. The PhyloSub and partial order plot software is available from https://github.com/morrislab/phylosub/.


Asunto(s)
Evolución Clonal/genética , Biología Computacional/métodos , Neoplasias/genética , Polimorfismo de Nucleótido Simple/genética , Algoritmos , Teorema de Bayes , Técnicas Citológicas , Evolución Molecular , Genotipo , Humanos , Mutación , Neoplasias/clasificación , Filogenia , Programas Informáticos
9.
Nat Biotechnol ; 2024 Jun 11.
Artículo en Inglés | MEDLINE | ID: mdl-38862616

RESUMEN

Subclonal reconstruction algorithms use bulk DNA sequencing data to quantify parameters of tumor evolution, allowing an assessment of how cancers initiate, progress and respond to selective pressures. We launched the ICGC-TCGA (International Cancer Genome Consortium-The Cancer Genome Atlas) DREAM Somatic Mutation Calling Tumor Heterogeneity and Evolution Challenge to benchmark existing subclonal reconstruction algorithms. This 7-year community effort used cloud computing to benchmark 31 subclonal reconstruction algorithms on 51 simulated tumors. Algorithms were scored on seven independent tasks, leading to 12,061 total runs. Algorithm choice influenced performance substantially more than tumor features but purity-adjusted read depth, copy-number state and read mappability were associated with the performance of most algorithms on most tasks. No single algorithm was a top performer for all seven tasks and existing ensemble strategies were unable to outperform the best individual methods, highlighting a key research need. All containerized methods, evaluation code and datasets are available to support further assessment of the determinants of subclonal reconstruction accuracy and development of improved methods to understand tumor evolution.

10.
Nucleic Acid Ther ; 31(6): 392-403, 2021 12.
Artículo en Inglés | MEDLINE | ID: mdl-34388351

RESUMEN

Steric-blocking oligonucleotides (SBOs) are short, single-stranded nucleic acids designed to modulate gene expression by binding to RNA transcripts and blocking access from cellular machinery such as splicing factors. SBOs have the potential to bind to near-complementary sites in the transcriptome, causing off-target effects. In this study, we used RNA-seq to evaluate the off-target differential splicing events of 81 SBOs and differential expression events of 46 SBOs. Our results suggest that differential splicing events are predominantly hybridization driven, whereas differential expression events are more common and driven by other mechanisms (including spurious experimental variation). We further evaluated the performance of in silico screens for off-target splicing events, and found an edit distance cutoff of three to result in a sensitivity of 14% and false discovery rate (FDR) of 99%. A machine learning model incorporating splicing predictions substantially improved the ability to prioritize low edit distance hits, increasing sensitivity from 4% to 26% at a fixed FDR of 90%. Despite these large improvements in performance, this approach does not detect the majority of events at an FDR <99%. Our results suggest that in silico methods are currently of limited use for predicting the off-target effects of SBOs, and experimental screening by RNA-seq should be the preferred approach.


Asunto(s)
Oligonucleótidos , Transcriptoma , Empalme Alternativo , Oligonucleótidos/genética , Oligonucleótidos Antisentido , ARN/genética , ARN/metabolismo , Empalme del ARN/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA