Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Sci Rep ; 11(1): 14411, 2021 07 13.
Artículo en Inglés | MEDLINE | ID: mdl-34257393

RESUMEN

Over the past years, large consortia have been established to fuel the sequencing of whole genomes of many cancer patients. Despite the increased abundance in tools to study the impact of SNVs, non-coding SVs have been largely ignored in these data. Here, we introduce svMIL2, an improved version of our Multiple Instance Learning-based method to study the effect of somatic non-coding SVs disrupting boundaries of TADs and CTCF loops in 1646 cancer genomes. We demonstrate that svMIL2 predicts pathogenic non-coding SVs with an average AUC of 0.86 across 12 cancer types, and identifies non-coding SVs affecting well-known driver genes. The disruption of active (super) enhancers in open chromatin regions appears to be a common mechanism by which non-coding SVs exert their pathogenicity. Finally, our results reveal that the contribution of pathogenic non-coding SVs as opposed to driver SNVs may highly vary between cancers, with notably high numbers of genes being disrupted by pathogenic non-coding SVs in ovarian and pancreatic cancer. Taken together, our machine learning method offers a potent way to prioritize putatively pathogenic non-coding SVs and leverage non-coding SVs to identify driver genes. Moreover, our analysis of 1646 cancer genomes demonstrates the importance of including non-coding SVs in cancer diagnostics.


Asunto(s)
Genoma Humano , Variación Estructural del Genoma , Humanos , Aprendizaje Automático , Neoplasias/genética
2.
Bioinformatics ; 36(Suppl_2): i692-i699, 2020 12 30.
Artículo en Inglés | MEDLINE | ID: mdl-33381833

RESUMEN

MOTIVATION: Despite the fact that structural variants (SVs) play an important role in cancer, methods to predict their effect, especially for SVs in non-coding regions, are lacking, leaving them often overlooked in the clinic. Non-coding SVs may disrupt the boundaries of Topologically Associated Domains (TADs), thereby affecting interactions between genes and regulatory elements such as enhancers. However, it is not known when such alterations are pathogenic. Although machine learning techniques are a promising solution to answer this question, representing the large number of interactions that an SV can disrupt in a single feature matrix is not trivial. RESULTS: We introduce svMIL: a method to predict pathogenic TAD boundary-disrupting SV effects based on multiple instance learning, which circumvents the need for a traditional feature matrix by grouping SVs into bags that can contain any number of disruptions. We demonstrate that svMIL can predict SV pathogenicity, measured through same-sample gene expression aberration, for various cancer types. In addition, our approach reveals that somatic pathogenic SVs alter different regulatory interactions than somatic non-pathogenic SVs and germline SVs. AVAILABILITY AND IMPLEMENTATION: All code for svMIL is publicly available on GitHub: https://github.com/UMCUGenetics/svMIL. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Neoplasias , Humanos , Aprendizaje Automático , Neoplasias/genética
3.
Br J Cancer ; 120(4): 444-452, 2019 02.
Artículo en Inglés | MEDLINE | ID: mdl-30739914

RESUMEN

BACKGROUND: Testicular germ cell cancer (TGCC), being the most frequent malignancy in young Caucasian males, is initiated from an embryonic germ cell. This study determines intratumour heterogeneity to unravel tumour progression from initiation until metastasis. METHODS: In total, 42 purified samples of four treatment-resistant nonseminomatous (NS) TGCC were investigated, including the precursor germ cell neoplasia in situ (GCNIS) and metastatic specimens, using whole-genome and targeted sequencing. Their evolution was reconstructed. RESULTS: Intratumour molecular heterogeneity did not correspond to the supposed primary tumour histological evolution. Metastases after systemic treatment could be derived from cancer stem cells not identified in the primary cancer. GCNIS mostly lacked the molecular marks of the primary NS and comprised dominant clones that failed to progress. A BRCA-like mutational signature was observed without evidence for direct involvement of BRCA1 and BRCA2 genes. CONCLUSIONS: Our data strongly support the hypothesis that NS is initiated by whole-genome duplication, followed by chromosome copy number alterations in the cancer stem cell population, and accumulation of low numbers of somatic mutations, even in therapy-resistant cases. These observations of heterogeneity at all stages of tumourigenesis should be considered when treating patients with GCNIS-only disease, or with clinically overt NS.


Asunto(s)
Neoplasias de Células Germinales y Embrionarias/genética , Neoplasias Testiculares/genética , Evolución Molecular , Genes BRCA1 , Genes BRCA2 , Humanos , Pérdida de Heterocigocidad , Masculino , Mutación , Metástasis de la Neoplasia , Neoplasias de Células Germinales y Embrionarias/patología , Neoplasias Testiculares/patología , Secuenciación Completa del Genoma
4.
PLoS One ; 13(11): e0208002, 2018.
Artículo en Inglés | MEDLINE | ID: mdl-30496231

RESUMEN

Most tumors are composed of a heterogeneous population of subclones. A more detailed insight into the subclonal evolution of these tumors can be helpful to study progression and treatment response. Problematically, tumor samples are typically very heterogeneous, making deconvolving individual tumor subclones a major challenge. To overcome this limitation, reducing heterogeneity, such as by means of microdissections, coupled with targeted sequencing, is a viable approach. However, computational methods that enable reconstruction of the evolutionary relationships require unbiased read depth measurements, which are commonly challenging to obtain in this setting. We introduce TargetClone, a novel method to reconstruct the subclonal evolution tree of tumors from single-nucleotide polymorphism allele frequency and somatic single-nucleotide variant measurements. Furthermore, our method infers copy numbers, alleles and the fraction of the tumor component in each sample. TargetClone was specifically designed for targeted sequencing data obtained from microdissected samples. We demonstrate that our method obtains low error rates on simulated data. Additionally, we show that our method is able to reconstruct expected trees in a testicular germ cell cancer and ovarian cancer dataset. The TargetClone package including tree visualization is written in Python and is publicly available at https://github.com/UMCUGenetics/targetclone.


Asunto(s)
Evolución Clonal/genética , Biología Computacional/métodos , Neoplasias/genética , Algoritmos , Alelos , Variaciones en el Número de Copia de ADN/genética , Frecuencia de los Genes/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Mutación/genética , Polimorfismo de Nucleótido Simple/genética , Programas Informáticos
5.
Nat Commun ; 8(1): 1326, 2017 11 06.
Artículo en Inglés | MEDLINE | ID: mdl-29109544

RESUMEN

Despite improvements in genomics technology, the detection of structural variants (SVs) from short-read sequencing still poses challenges, particularly for complex variation. Here we analyse the genomes of two patients with congenital abnormalities using the MinION nanopore sequencer and a novel computational pipeline-NanoSV. We demonstrate that nanopore long reads are superior to short reads with regard to detection of de novo chromothripsis rearrangements. The long reads also enable efficient phasing of genetic variations, which we leveraged to determine the parental origin of all de novo chromothripsis breakpoints and to resolve the structure of these complex rearrangements. Additionally, genome-wide surveillance of inherited SVs reveals novel variants, missed in short-read data sets, a large proportion of which are retrotransposon insertions. We provide a first exploration of patient genome sequencing with a nanopore sequencer and demonstrate the value of long-read sequencing in mapping and phasing of SVs for both clinical and research applications.


Asunto(s)
Mapeo Cromosómico/métodos , Cromotripsis , Análisis Mutacional de ADN/métodos , Nanoporos , Anomalías Múltiples/genética , Algoritmos , Mapeo Cromosómico/estadística & datos numéricos , Biología Computacional , Análisis Mutacional de ADN/estadística & datos numéricos , Reordenamiento Génico , Variación Genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/estadística & datos numéricos , Humanos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA