Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 27
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
bioRxiv ; 2024 Mar 31.
Artigo em Inglês | MEDLINE | ID: mdl-38585946

RESUMO

Gene expression is a multi-step transformation of biological information from its storage form (DNA) into functional forms (protein and some RNAs). Regulatory activities at each step of this transformation multiply a single gene into a myriad of proteoforms. Proteogenomics is the study of how genomic and transcriptomic variation creates this proteoform diversity, and is limited by the challenges of modeling the complexities of gene-expression. We therefore created moPepGen, a graph-based algorithm that comprehensively enumerates proteoforms in linear time. moPepGen works with multiple technologies, in multiple species and on all types of genetic and transcriptomic data. In human cancer proteomes, it detects and quantifies previously unobserved noncanonical peptides arising from germline and somatic genomic variants, noncoding open reading frames, RNA fusions and RNA circularization. By enabling efficient identification and quantitation of previously hidden proteins in both existing and new proteomic data, moPepGen facilitates all proteogenomics applications. It is available at: https://github.com/uclahs-cds/package-moPepGen.

2.
Bioinformatics ; 40(2)2024 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-38341660

RESUMO

MOTIVATION: The ongoing expansion in the volume of biomedical data has contributed to a growing complexity in the tools and technologies used in research with an increased reliance on complex workflows written in orchestration languages such as Nextflow to integrate algorithms into processing pipelines. The growing use of workflows involving various tools and algorithms has led to increased scrutiny of software development practices to avoid errors in individual tools and in the connections between them. RESULTS: To facilitate test-driven development of Nextflow pipelines, we created NFTest, a framework for automated pipeline testing and validation with customizability options for Nextflow features. It is open-source, easy to initialize and use, and customizable to allow for testing of complex workflows with test success configurable through a broad range of assertions. NFTest simplifies the testing burden on developers by automating tests once defined and providing a flexible interface for running tests to validate workflows. This reduces the barrier to rigorous biomedical workflow testing and paves the way toward reducing computational errors in biomedicine. AVAILABILITY AND IMPLEMENTATION: NFTest is an open-source Python framework under the GPLv2 license and is freely available at https://github.com/uclahs-cds/tool-NFTest. The call-sSNV Nextflow pipeline is available at: https://github.com/uclahs-cds/pipeline-call-sSNV.


Assuntos
Biologia Computacional , Software , Algoritmos , Idioma , Fluxo de Trabalho
3.
Bioinformatics ; 40(2)2024 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-38341658

RESUMO

MOTIVATION: The volume of biomedical data generated each year is growing exponentially as high-throughput molecular, imaging and mHealth technologies expand. This rise in data volume has contributed to an increasing reliance on and demand for computational methods, and consequently to increased attention to software quality and data integrity. RESULTS: To simplify data verification in diverse data-processing pipelines, we created PipeVal, a light-weight, easy-to-use, extensible tool for file validation. It is open-source, easy to integrate with complex workflows, and modularized for extensibility for new file formats. PipeVal can be rapidly inserted into existing methods and pipelines to automatically validate and verify inputs and outputs. This can reduce wasted compute time attributed to file corruption or invalid file paths, and significantly improve the quality of data-intensive software. AVAILABILITY AND IMPLEMENTATION: PipeVal is an open-source Python package under the GPLv2 license and it is freely available at https://github.com/uclahs-cds/package-PipeVal. The docker image is available at: https://github.com/uclahs-cds/package-PipeVal/pkgs/container/pipeval.


Assuntos
Software , Fluxo de Trabalho
4.
Cell Rep ; 43(3): 113826, 2024 Mar 26.
Artigo em Inglês | MEDLINE | ID: mdl-38412093

RESUMO

Anaplastic thyroid carcinoma is arguably the most lethal human malignancy. It often co-occurs with differentiated thyroid cancers, yet the molecular origins of its aggressivity are unknown. We sequenced tumor DNA from 329 regions of thyroid cancer, including 213 from patients with primary anaplastic thyroid carcinomas. We also whole genome sequenced 9 patients using multi-region sequencing of both differentiated and anaplastic thyroid cancer components. Using these data, we demonstrate thatanaplastic thyroid carcinomas have a higher burden of mutations than other thyroid cancers, with distinct mutational signatures and molecular subtypes. Further, different cancer driver genes are mutated in anaplastic and differentiated thyroid carcinomas, even those arising in a single patient. Finally, we unambiguously demonstrate that anaplastic thyroid carcinomas share a genomic origin with co-occurring differentiated carcinomas and emerge from a common malignant field through acquisition of characteristic clonal driver mutations.


Assuntos
Adenocarcinoma , Carcinoma Anaplásico da Tireoide , Neoplasias da Glândula Tireoide , Humanos , Carcinoma Anaplásico da Tireoide/genética , Carcinoma Anaplásico da Tireoide/patologia , Neoplasias da Glândula Tireoide/genética , Neoplasias da Glândula Tireoide/patologia , Mutação/genética , Genômica
5.
Cell ; 187(2): 446-463.e16, 2024 01 18.
Artigo em Inglês | MEDLINE | ID: mdl-38242087

RESUMO

Treatment failure for the lethal brain tumor glioblastoma (GBM) is attributed to intratumoral heterogeneity and tumor evolution. We utilized 3D neuronavigation during surgical resection to acquire samples representing the whole tumor mapped by 3D spatial coordinates. Integrative tissue and single-cell analysis revealed sources of genomic, epigenomic, and microenvironmental intratumoral heterogeneity and their spatial patterning. By distinguishing tumor-wide molecular features from those with regional specificity, we inferred GBM evolutionary trajectories from neurodevelopmental lineage origins and initiating events such as chromothripsis to emergence of genetic subclones and spatially restricted activation of differential tumor and microenvironmental programs in the core, periphery, and contrast-enhancing regions. Our work depicts GBM evolution and heterogeneity from a 3D whole-tumor perspective, highlights potential therapeutic targets that might circumvent heterogeneity-related failures, and establishes an interactive platform enabling 360° visualization and analysis of 3D spatial patterns for user-selected genes, programs, and other features across whole GBM tumors.


Assuntos
Neoplasias Encefálicas , Glioblastoma , Modelos Biológicos , Humanos , Neoplasias Encefálicas/genética , Neoplasias Encefálicas/patologia , Epigenômica , Genômica , Glioblastoma/genética , Glioblastoma/patologia , Análise de Célula Única , Microambiente Tumoral , Heterogeneidade Genética
6.
Nat Med ; 29(6): 1370-1378, 2023 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-37188783

RESUMO

Immune-mediated anti-tumoral responses, elicited by oncolytic viruses and augmented with checkpoint inhibition, may be an effective treatment approach for glioblastoma. Here in this multicenter phase 1/2 study we evaluated the combination of intratumoral delivery of oncolytic virus DNX-2401 followed by intravenous anti-PD-1 antibody pembrolizumab in recurrent glioblastoma, first in a dose-escalation and then in a dose-expansion phase, in 49 patients. The primary endpoints were overall safety and objective response rate. The primary safety endpoint was met, whereas the primary efficacy endpoint was not met. There were no dose-limiting toxicities, and full dose combined treatment was well tolerated. The objective response rate was 10.4% (90% confidence interval (CI) 4.2-20.7%), which was not statistically greater than the prespecified control rate of 5%. The secondary endpoint of overall survival at 12 months was 52.7% (95% CI 40.1-69.2%), which was statistically greater than the prespecified control rate of 20%. Median overall survival was 12.5 months (10.7-13.5 months). Objective responses led to longer survival (hazard ratio 0.20, 95% CI 0.05-0.87). A total of 56.2% (95% CI 41.1-70.5%) of patients had a clinical benefit defined as stable disease or better. Three patients completed treatment with durable responses and remain alive at 45, 48 and 60 months. Exploratory mutational, gene-expression and immunophenotypic analyses revealed that the balance between immune cell infiltration and expression of checkpoint inhibitors may potentially inform on response to treatment and mechanisms of resistance. Overall, the combination of intratumoral DNX-2401 followed by pembrolizumab was safe with notable survival benefit in select patients (ClinicalTrials.gov registration: NCT02798406).


Assuntos
Glioblastoma , Terapia Viral Oncolítica , Vírus Oncolíticos , Humanos , Glioblastoma/tratamento farmacológico , Recidiva Local de Neoplasia/tratamento farmacológico , Anticorpos Monoclonais Humanizados , Terapia Viral Oncolítica/efeitos adversos , Vírus Oncolíticos/genética , Protocolos de Quimioterapia Combinada Antineoplásica/efeitos adversos
7.
J Natl Cancer Inst ; 115(4): 468-472, 2023 04 11.
Artigo em Inglês | MEDLINE | ID: mdl-36610996

RESUMO

Prostate cancer is one of the most heritable cancers. Hundreds of germline polymorphisms have been linked to prostate cancer diagnosis and prognosis. Polygenic risk scores can predict genetic risk of a prostate cancer diagnosis. Although these scores inform the probability of developing a tumor, it remains unknown how germline risk influences the tumor molecular evolution. We cultivated a cohort of 1250 localized European-descent patients with germline and somatic DNA profiling. Men of European descent with higher genetic risk were diagnosed earlier and had less genomic instability and fewer driver genes mutated. Higher genetic risk was associated with better outcome. These data imply a polygenic "two-hit" model where germline risk reduces the number of somatic alterations required for tumorigenesis. These findings support further clinical studies of polygenic risk scores as inexpensive and minimally invasive adjuncts to standard risk stratification. Further studies are required to interrogate generalizability to more ancestrally and clinically diverse populations.


Assuntos
Neoplasias da Próstata , Masculino , Humanos , Neoplasias da Próstata/genética , Neoplasias da Próstata/patologia , Fatores de Risco , Prognóstico , Predisposição Genética para Doença
8.
Nat Commun ; 13(1): 1898, 2022 04 07.
Artigo em Inglês | MEDLINE | ID: mdl-35393414

RESUMO

Recent advances in cancer therapeutics clearly demonstrate the need for innovative multiplex therapies that attack the tumour on multiple fronts. Oncolytic or "cancer-killing" viruses (OVs) represent up-and-coming multi-mechanistic immunotherapeutic drugs for the treatment of cancer. In this study, we perform an in-vitro screen based on virus-encoded artificial microRNAs (amiRNAs) and find that a unique amiRNA, herein termed amiR-4, confers a replicative advantage to the VSVΔ51 OV platform. Target validation of amiR-4 reveals ARID1A, a protein involved in chromatin remodelling, as an important player in resistance to OV replication. Virus-directed targeting of ARID1A coupled with small-molecule inhibition of the methyltransferase EZH2 leads to the synthetic lethal killing of both infected and uninfected tumour cells. The bystander killing of uninfected cells is mediated by intercellular transfer of extracellular vesicles carrying amiR-4 cargo. Altogether, our findings establish that OVs can serve as replicating vehicles for amiRNA therapeutics with the potential for combination with small molecule and immune checkpoint inhibitor therapy.


Assuntos
Vesículas Extracelulares , MicroRNAs , Neoplasias , Terapia Viral Oncolítica , Vírus Oncolíticos , Humanos , MicroRNAs/genética , Neoplasias/terapia , Vírus Oncolíticos/genética
9.
Nat Commun ; 12(1): 6893, 2021 11 25.
Artigo em Inglês | MEDLINE | ID: mdl-34824250

RESUMO

Replicative immortality is a hallmark of cancer, and can be achieved through telomere lengthening and maintenance. Although the role of telomere length in cancer has been well studied, its association to genomic features is less well known. Here, we report the telomere lengths of 392 localized prostate cancer tumours and characterize their relationship to genomic, transcriptomic and proteomic features. Shorter tumour telomere lengths are associated with elevated genomic instability, including single-nucleotide variants, indels and structural variants. Genes involved in cell proliferation and signaling are correlated with tumour telomere length at all levels of the central dogma. Telomere length is also associated with multiple clinical features of a tumour. Longer telomere lengths in non-tumour samples are associated with a lower rate of biochemical relapse. In summary, we describe the multi-level integration of telomere length, genomics, transcriptomics and proteomics in localized prostate cancer.


Assuntos
Neoplasias da Próstata/genética , Telômero/genética , Variações do Número de Cópias de DNA , Epigenoma , Fusão Gênica , Genômica , Humanos , Masculino , Neoplasias da Próstata/metabolismo , Neoplasias da Próstata/patologia , Proteoma , Telomerase/genética , Telomerase/metabolismo , Transcriptoma
10.
J Natl Cancer Inst ; 113(6): 742-751, 2021 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-33429428

RESUMO

BACKGROUND: Patients with human papillomavirus-related oropharyngeal cancers have excellent outcomes but experience clinically significant toxicities when treated with standard chemoradiotherapy (70 Gy). We hypothesized that functional imaging could identify patients who could be safely deescalated to 30 Gy of radiotherapy. METHODS: In 19 patients, pre- and intratreatment dynamic fluorine-18-labeled fluoromisonidazole positron emission tomography (PET) was used to assess tumor hypoxia. Patients without hypoxia at baseline or intratreatment received 30 Gy; patients with persistent hypoxia received 70 Gy. Neck dissection was performed at 4 months in deescalated patients to assess pathologic response. Magnetic resonance imaging (weekly), circulating plasma cell-free DNA, RNA-sequencing, and whole-genome sequencing (WGS) were performed to identify potential molecular determinants of response. Samples from an independent prospective study were obtained to reproduce molecular findings. All statistical tests were 2-sided. RESULTS: Fifteen of 19 patients had no hypoxia on baseline PET or resolution on intratreatment PET and were deescalated to 30 Gy. Of these 15 patients, 11 had a pathologic complete response. Two-year locoregional control and overall survival were 94.4% (95% confidence interval = 84.4% to 100%) and 94.7% (95% confidence interval = 85.2% to 100%), respectively. No acute grade 3 radiation-related toxicities were observed. Microenvironmental features on serial imaging correlated better with pathologic response than tumor burden metrics or circulating plasma cell-free DNA. A WGS-based DNA repair defect was associated with response (P = .02) and was reproduced in an independent cohort (P = .03). CONCLUSIONS: Deescalation of radiotherapy to 30 Gy on the basis of intratreatment hypoxia imaging was feasible, safe, and associated with minimal toxicity. A DNA repair defect identified by WGS was predictive of response. Intratherapy personalization of chemoradiotherapy may facilitate marked deescalation of radiotherapy.


Assuntos
Neoplasias Orofaríngeas , Quimiorradioterapia/métodos , Humanos , Neoplasias Orofaríngeas/radioterapia , Tomografia por Emissão de Pósitrons , Estudos Prospectivos , Dosagem Radioterapêutica , Hipóxia Tumoral
11.
Nat Commun ; 11(1): 441, 2020 01 23.
Artigo em Inglês | MEDLINE | ID: mdl-31974375

RESUMO

Prostate cancer is the second most commonly diagnosed malignancy among men worldwide. Recurrently mutated in primary and metastatic prostate tumors, FOXA1 encodes a pioneer transcription factor involved in disease onset and progression through both androgen receptor-dependent and androgen receptor-independent mechanisms. Despite its oncogenic properties however, the regulation of FOXA1 expression remains unknown. Here, we identify a set of six cis-regulatory elements in the FOXA1 regulatory plexus harboring somatic single-nucleotide variants in primary prostate tumors. We find that deletion and repression of these cis-regulatory elements significantly decreases FOXA1 expression and prostate cancer cell growth. Six of the ten single-nucleotide variants mapping to FOXA1 regulatory plexus significantly alter the transactivation potential of cis-regulatory elements by modulating the binding of transcription factors. Collectively, our results identify cis-regulatory elements within the FOXA1 plexus mutated in primary prostate tumors as potential targets for therapeutic intervention.


Assuntos
Fator 3-alfa Nuclear de Hepatócito/genética , Mutação , Neoplasias da Próstata/genética , Sequências Reguladoras de Ácido Nucleico , Sítios de Ligação , Linhagem Celular Tumoral , Proliferação de Células/genética , Regulação Neoplásica da Expressão Gênica , Humanos , Masculino , Fatores de Transcrição/metabolismo
12.
Cancer Cell ; 36(6): 674-689.e6, 2019 12 09.
Artigo em Inglês | MEDLINE | ID: mdl-31735626

RESUMO

Thousands of noncoding somatic single-nucleotide variants (SNVs) of unknown function are reported in tumors. Partitioning the genome according to cistromes reveals the enrichment of somatic SNVs in prostate tumors as opposed to adjacent normal tissue cistromes of master transcription regulators, including AR, FOXA1, and HOXB13. This parallels enrichment of prostate cancer genetic predispositions over these transcription regulators' tumor cistromes, exemplified at the 8q24 locus harboring both risk variants and somatic SNVs in cis-regulatory elements upregulating MYC expression. However, Massively Parallel Reporter Assays reveal that few SNVs can alter the transactivation potential of individual cis-regulatory elements. Instead, similar to inherited risk variants, SNVs accumulate in cistromes of master transcription regulators required for prostate cancer development.


Assuntos
Regulação Neoplásica da Expressão Gênica/genética , Fator 3-alfa Nuclear de Hepatócito/metabolismo , Proteínas de Homeodomínio/metabolismo , Neoplasias da Próstata/metabolismo , Linhagem Celular Tumoral , Proteínas de Homeodomínio/genética , Humanos , Masculino , Mutação/genética , Neoplasias da Próstata/patologia , Regulação para Cima/genética
13.
Nat Commun ; 10(1): 5251, 2019 11 20.
Artigo em Inglês | MEDLINE | ID: mdl-31748536

RESUMO

Metastatic castration-resistant prostate cancer (mCRPC) has a highly complex genomic landscape. With the recent development of novel treatments, accurate stratification strategies are needed. Here we present the whole-genome sequencing (WGS) analysis of fresh-frozen metastatic biopsies from 197 mCRPC patients. Using unsupervised clustering based on genomic features, we define eight distinct genomic clusters. We observe potentially clinically relevant genotypes, including microsatellite instability (MSI), homologous recombination deficiency (HRD) enriched with genomic deletions and BRCA2 aberrations, a tandem duplication genotype associated with CDK12-/- and a chromothripsis-enriched subgroup. Our data suggests that stratification on WGS characteristics may improve identification of MSI, CDK12-/- and HRD patients. From WGS and ChIP-seq data, we show the potential relevance of recurrent alterations in non-coding regions identified with WGS and highlight the central role of AR signaling in tumor progression. These data underline the potential value of using WGS to accurately stratify mCRPC patients into clinically actionable subgroups.


Assuntos
Neoplasias Ósseas/genética , Neoplasias Hepáticas/genética , Neoplasias de Próstata Resistentes à Castração/genética , Proteína BRCA2/genética , Neoplasias Ósseas/secundário , Sequenciamento de Cromatina por Imunoprecipitação , Quinases Ciclina-Dependentes/genética , Variações do Número de Cópias de DNA , Elementos Facilitadores Genéticos/genética , Genótipo , Humanos , Neoplasias Hepáticas/secundário , Linfonodos , Masculino , Instabilidade de Microssatélites , Metástase Neoplásica , Proteínas de Fusão Oncogênica/genética , Neoplasias da Próstata/genética , Neoplasias da Próstata/patologia , Neoplasias de Próstata Resistentes à Castração/classificação , Neoplasias de Próstata Resistentes à Castração/secundário , Receptores Androgênicos/genética , Reparo de DNA por Recombinação/genética , Neoplasias de Tecidos Moles/genética , Neoplasias de Tecidos Moles/secundário , Sequenciamento Completo do Genoma
14.
Nat Med ; 25(10): 1615-1626, 2019 10.
Artigo em Inglês | MEDLINE | ID: mdl-31591588

RESUMO

Oncogenesis is driven by germline, environmental and stochastic factors. It is unknown how these interact to produce the molecular phenotypes of tumors. We therefore quantified the influence of germline polymorphisms on the somatic epigenome of 589 localized prostate tumors. Predisposition risk loci influence a tumor's epigenome, uncovering a mechanism for cancer susceptibility. We identified and validated 1,178 loci associated with altered methylation in tumoral but not nonmalignant tissue. These tumor methylation quantitative trait loci influence chromatin structure, as well as RNA and protein abundance. One prominent tumor methylation quantitative trait locus is associated with AKT1 expression and is predictive of relapse after definitive local therapy in both discovery and validation cohorts. These data reveal intricate crosstalk between the germ line and the epigenome of primary tumors, which may help identify germline biomarkers of aggressive disease to aid patient triage and optimize the use of more invasive or expensive diagnostic assays.


Assuntos
Metilação de DNA/genética , Epigenoma/genética , Mutação em Linhagem Germinativa/genética , Neoplasias da Próstata/genética , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Predisposição Genética para Doença , Genoma Humano/genética , Humanos , Masculino , Recidiva Local de Neoplasia/genética , Recidiva Local de Neoplasia/patologia , Neoplasias da Próstata/patologia , Proteínas Proto-Oncogênicas c-akt/genética , Locos de Características Quantitativas/genética
15.
Nat Genet ; 51(2): 308-318, 2019 02.
Artigo em Inglês | MEDLINE | ID: mdl-30643250

RESUMO

Many primary-tumor subregions have low levels of molecular oxygen, termed hypoxia. Hypoxic tumors are at elevated risk for local failure and distant metastasis, but the molecular hallmarks of tumor hypoxia remain poorly defined. To fill this gap, we quantified hypoxia in 8,006 tumors across 19 tumor types. In ten tumor types, hypoxia was associated with elevated genomic instability. In all 19 tumor types, hypoxic tumors exhibited characteristic driver-mutation signatures. We observed widespread hypoxia-associated dysregulation of microRNAs (miRNAs) across cancers and functionally validated miR-133a-3p as a hypoxia-modulated miRNA. In localized prostate cancer, hypoxia was associated with elevated rates of chromothripsis, allelic loss of PTEN and shorter telomeres. These associations are particularly enriched in polyclonal tumors, representing a constellation of features resembling tumor nimbosus, an aggressive cellular phenotype. Overall, this work establishes that tumor hypoxia may drive aggressive molecular features across cancers and shape the clinical trajectory of individual tumors.


Assuntos
Hipóxia/genética , Neoplasias da Próstata/genética , Hipóxia Tumoral/genética , Alelos , Linhagem Celular Tumoral , Cromotripsia , Perfilação da Expressão Gênica/métodos , Regulação Neoplásica da Expressão Gênica/genética , Instabilidade Genômica/genética , Humanos , Masculino , MicroRNAs/genética , Células PC-3 , PTEN Fosfo-Hidrolase/genética , Telômero/genética
16.
Cancer Cell ; 34(6): 996-1011.e8, 2018 12 10.
Artigo em Inglês | MEDLINE | ID: mdl-30537516

RESUMO

Identifying the earliest somatic changes in prostate cancer can give important insights into tumor evolution and aids in stratifying high- from low-risk disease. We integrated whole genome, transcriptome and methylome analysis of early-onset prostate cancers (diagnosis ≤55 years). Characterization across 292 prostate cancer genomes revealed age-related genomic alterations and a clock-like enzymatic-driven mutational process contributing to the earliest mutations in prostate cancer patients. Our integrative analysis identified four molecular subgroups, including a particularly aggressive subgroup with recurrent duplications associated with increased expression of ESRP1, which we validate in 12,000 tissue microarray tumors. Finally, we combined the patterns of molecular co-occurrence and risk-based subgroup information to deconvolve the molecular and clinical trajectories of prostate cancer from single patient samples.


Assuntos
Biomarcadores Tumorais/genética , Metilação de DNA , Regulação Neoplásica da Expressão Gênica , Neoplasias da Próstata/genética , Transcriptoma , Adulto , Biomarcadores Tumorais/metabolismo , Evolução Molecular , Humanos , Masculino , Pessoa de Meia-Idade , Mutação , Neoplasias da Próstata/metabolismo , Neoplasias da Próstata/patologia , Proteínas de Ligação a RNA/genética , Proteínas de Ligação a RNA/metabolismo , Fatores de Risco , Sequenciamento Completo do Genoma/métodos
17.
Genome Biol ; 19(1): 188, 2018 11 06.
Artigo em Inglês | MEDLINE | ID: mdl-30400818

RESUMO

BACKGROUND: The phenotypes of cancer cells are driven in part by somatic structural variants. Structural variants can initiate tumors, enhance their aggressiveness, and provide unique therapeutic opportunities. Whole-genome sequencing of tumors can allow exhaustive identification of the specific structural variants present in an individual cancer, facilitating both clinical diagnostics and the discovery of novel mutagenic mechanisms. A plethora of somatic structural variant detection algorithms have been created to enable these discoveries; however, there are no systematic benchmarks of them. Rigorous performance evaluation of somatic structural variant detection methods has been challenged by the lack of gold standards, extensive resource requirements, and difficulties arising from the need to share personal genomic information. RESULTS: To facilitate structural variant detection algorithm evaluations, we create a robust simulation framework for somatic structural variants by extending the BAMSurgeon algorithm. We then organize and enable a crowdsourced benchmarking within the ICGC-TCGA DREAM Somatic Mutation Calling Challenge (SMC-DNA). We report here the results of structural variant benchmarking on three different tumors, comprising 204 submissions from 15 teams. In addition to ranking methods, we identify characteristic error profiles of individual algorithms and general trends across them. Surprisingly, we find that ensembles of analysis pipelines do not always outperform the best individual method, indicating a need for new ways to aggregate somatic structural variant detection approaches. CONCLUSIONS: The synthetic tumors and somatic structural variant detection leaderboards remain available as a community benchmarking resource, and BAMSurgeon is available at https://github.com/adamewing/bamsurgeon .


Assuntos
Benchmarking , Simulação por Computador , Crowdsourcing , Variação Genética , Genoma Humano , Genômica/métodos , Neoplasias/genética , Algoritmos , Bases de Dados Genéticas , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Software
18.
BMC Bioinformatics ; 19(1): 339, 2018 Sep 25.
Artigo em Inglês | MEDLINE | ID: mdl-30253747

RESUMO

BACKGROUND: Platform-specific error profiles necessitate confirmatory studies where predictions made on data generated using one technology are additionally verified by processing the same samples on an orthogonal technology. However, verifying all predictions can be costly and redundant, and testing a subset of findings is often used to estimate the true error profile. RESULTS: To determine how to create subsets of predictions for validation that maximize accuracy of global error profile inference, we developed Valection, a software program that implements multiple strategies for the selection of verification candidates. We evaluated these selection strategies on one simulated and two experimental datasets. CONCLUSIONS: Valection is implemented in multiple programming languages, available at: http://labs.oicr.on.ca/boutros-lab/software/valection.


Assuntos
Análise de Sequência de DNA/métodos , Validação de Programas de Computador
19.
Cell ; 173(4): 1003-1013.e15, 2018 05 03.
Artigo em Inglês | MEDLINE | ID: mdl-29681457

RESUMO

The majority of newly diagnosed prostate cancers are slow growing, with a long natural life history. Yet a subset can metastasize with lethal consequences. We reconstructed the phylogenies of 293 localized prostate tumors linked to clinical outcome data. Multiple subclones were detected in 59% of patients, and specific subclonal architectures associate with adverse clinicopathological features. Early tumor development is characterized by point mutations and deletions followed by later subclonal amplifications and changes in trinucleotide mutational signatures. Specific genes are selectively mutated prior to or following subclonal diversification, including MTOR, NKX3-1, and RB1. Patients with low-risk monoclonal tumors rarely relapse after primary therapy (7%), while those with high-risk polyclonal tumors frequently do (61%). The presence of multiple subclones in an index biopsy may be necessary, but not sufficient, for relapse of localized prostate cancer, suggesting that evolution-aware biomarkers should be studied in prospective studies of low-risk tumors suitable for active surveillance.


Assuntos
Neoplasias da Próstata/patologia , Biomarcadores Tumorais/sangue , Sequenciamento de Nucleotídeos em Larga Escala , Proteínas de Homeodomínio/genética , Proteínas de Homeodomínio/metabolismo , Humanos , Masculino , Gradação de Tumores , Recidiva Local de Neoplasia , Polimorfismo de Nucleotídeo Único , Modelos de Riscos Proporcionais , Estudos Prospectivos , Neoplasias da Próstata/classificação , Neoplasias da Próstata/genética , Proteínas de Ligação a Retinoblastoma/genética , Proteínas de Ligação a Retinoblastoma/metabolismo , Serina-Treonina Quinases TOR/genética , Serina-Treonina Quinases TOR/metabolismo , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Ubiquitina-Proteína Ligases/genética , Ubiquitina-Proteína Ligases/metabolismo
20.
BMC Bioinformatics ; 19(1): 28, 2018 01 31.
Artigo em Inglês | MEDLINE | ID: mdl-29385983

RESUMO

BACKGROUND: The clinical sequencing of cancer genomes to personalize therapy is becoming routine across the world. However, concerns over patient re-identification from these data lead to questions about how tightly access should be controlled. It is not thought to be possible to re-identify patients from somatic variant data. However, somatic variant detection pipelines can mistakenly identify germline variants as somatic ones, a process called "germline leakage". The rate of germline leakage across different somatic variant detection pipelines is not well-understood, and it is uncertain whether or not somatic variant calls should be considered re-identifiable. To fill this gap, we quantified germline leakage across 259 sets of whole-genome somatic single nucleotide variant (SNVs) predictions made by 21 teams as part of the ICGC-TCGA DREAM Somatic Mutation Calling Challenge. RESULTS: The median somatic SNV prediction set contained 4325 somatic SNVs and leaked one germline polymorphism. The level of germline leakage was inversely correlated with somatic SNV prediction accuracy and positively correlated with the amount of infiltrating normal cells. The specific germline variants leaked differed by tumour and algorithm. To aid in quantitation and correction of leakage, we created a tool, called GermlineFilter, for use in public-facing somatic SNV databases. CONCLUSIONS: The potential for patient re-identification from leaked germline variants in somatic SNV predictions has led to divergent open data access policies, based on different assessments of the risks. Indeed, a single, well-publicized re-identification event could reshape public perceptions of the values of genomic data sharing. We find that modern somatic SNV prediction pipelines have low germline-leakage rates, which can be further reduced, especially for cloud-sharing, using pre-filtering software.


Assuntos
Genoma Humano , Células Germinativas/metabolismo , Polimorfismo de Nucleotídeo Único , Algoritmos , Humanos , Internet , Neoplasias/genética , Neoplasias/patologia , Interface Usuário-Computador , Sequenciamento Completo do Genoma
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...