RESUMO
Circular extrachromosomal DNA (ecDNA) is a form of oncogene amplification found across cancer types and associated with poor outcome in patients. ecDNA can be structurally complex and can contain rearranged DNA sequences derived from multiple chromosome locations. As the structure of ecDNA can impact oncogene regulation and may indicate mechanisms of its formation, disentangling it at high resolution from sequencing data is essential. Even though methods have been developed to identify and reconstruct ecDNA in cancer genome sequencing, it remains challenging to resolve complex ecDNA structures, in particular amplicons with shared genomic footprints. We here introduce Decoil, a computational method that combines a breakpoint-graph approach with LASSO regression to reconstruct complex ecDNA and deconvolve co-occurring ecDNA elements with overlapping genomic footprints from long-read nanopore sequencing. Decoil outperforms de novo assembly and alignment-based methods in simulated long-read sequencing data for both simple and complex ecDNAs. Applying Decoil on whole-genome sequencing data uncovered different ecDNA topologies and explored ecDNA structure heterogeneity in neuroblastoma tumors and cell lines, indicating that this method may improve ecDNA structural analyses in cancer.
Assuntos
DNA Circular , Humanos , DNA Circular/genética , Análise de Sequência de DNA/métodos , Sequenciamento por Nanoporos/métodos , Software , Biologia Computacional/métodos , Neuroblastoma/genética , Linhagem Celular Tumoral , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Genoma HumanoRESUMO
DNA amplifications in cancer do not only harbor oncogenes. We sought to determine whether passenger coamplifications could create collateral therapeutic vulnerabilities. Through an analysis of >3,000 cancer genomes followed by the interrogation of CRISPR-Cas9 loss-of-function screens across >700 cancer cell lines, we determined that passenger coamplifications are accompanied by distinct dependency profiles. In a proof-of-principle study, we demonstrate that the coamplification of the bona fide passenger gene DEAD-Box Helicase 1 (DDX1) creates an increased dependency on the mTOR pathway. Interaction proteomics identified tricarboxylic acid (TCA) cycle components as previously unrecognized DDX1 interaction partners. Live-cell metabolomics highlighted that this interaction could impair TCA activity, which in turn resulted in enhanced mTORC1 activity. Consequently, genetic and pharmacologic disruption of mTORC1 resulted in pronounced cell death in vitro and in vivo. Thus, structurally linked coamplification of a passenger gene and an oncogene can result in collateral vulnerabilities. SIGNIFICANCE: We demonstrate that coamplification of passenger genes, which were largely neglected in cancer biology in the past, can create distinct cancer dependencies. Because passenger coamplifications are frequent in cancer, this principle has the potential to expand target discovery in oncology. This article is featured in Selected Articles from This Issue, p. 384.
Assuntos
Neoplasias , Oncogenes , Humanos , Neoplasias/genética , Oncologia , Morte Celular , Alvo Mecanístico do Complexo 1 de Rapamicina/genéticaRESUMO
Extrachromosomal DNAs (ecDNAs) are common in cancer, but many questions about their origin, structural dynamics and impact on intratumor heterogeneity are still unresolved. Here we describe single-cell extrachromosomal circular DNA and transcriptome sequencing (scEC&T-seq), a method for parallel sequencing of circular DNAs and full-length mRNA from single cells. By applying scEC&T-seq to cancer cells, we describe intercellular differences in ecDNA content while investigating their structural heterogeneity and transcriptional impact. Oncogene-containing ecDNAs were clonally present in cancer cells and drove intercellular oncogene expression differences. In contrast, other small circular DNAs were exclusive to individual cells, indicating differences in their selection and propagation. Intercellular differences in ecDNA structure pointed to circular recombination as a mechanism of ecDNA evolution. These results demonstrate scEC&T-seq as an approach to systematically characterize both small and large circular DNA in cancer cells, which will facilitate the analysis of these DNA elements in cancer and beyond.
Assuntos
Neoplasias , Transcriptoma , Humanos , Transcriptoma/genética , DNA , Neoplasias/genética , Oncogenes , DNA Circular/genéticaRESUMO
Cancer genomes harbor a broad spectrum of structural variants (SVs) driving tumorigenesis, a relevant subset of which escape discovery using short-read sequencing. We employed Oxford Nanopore Technologies (ONT) long-read sequencing in a paired diagnostic and post-therapy medulloblastoma to unravel the haplotype-resolved somatic genetic and epigenetic landscape. We assembled complex rearrangements, including a 1.55-Mbp chromothripsis event, and we uncover a complex SV pattern termed templated insertion (TI) thread, characterized by short (mostly <1 kb) insertions showing prevalent self-concatenation into highly amplified structures of up to 50 kbp in size. TI threads occur in 3% of cancers, with a prevalence up to 74% in liposarcoma, and frequent colocalization with chromothripsis. We also perform long-read-based methylome profiling and discover allele-specific methylation (ASM) effects, complex rearrangements exhibiting differential methylation, and differential promoter methylation in cancer-driver genes. Our study shows the advantage of long-read sequencing in the discovery and characterization of complex somatic rearrangements.
RESUMO
Instability of simple DNA repeats has been known as a common cause of hereditary ataxias for over 20 years. Routine genetic diagnostics of these phenotypically similar diseases still rely on an iterative workflow for quantification of repeat units by PCR-based methods of limited precision. We established and validated clinical nanopore Cas9-targeted sequencing, an amplification-free method for simultaneous analysis of 10 repeat loci associated with clinically overlapping hereditary ataxias. The method combines target enrichment by CRISPR-Cas9, Oxford Nanopore long-read sequencing and a bioinformatics pipeline using the tools STRique and Megalodon for parallel detection of length, sequence, methylation and composition of the repeat loci. Clinical nanopore Cas9-targeted sequencing allowed for the precise and parallel analysis of 10 repeat loci associated with adult-onset ataxia and revealed additional parameter such as FMR1 promotor methylation and repeat sequence required for diagnosis at the same time. Using clinical nanopore Cas9-targeted sequencing we analysed 100 clinical samples of undiagnosed ataxia patients and identified causative repeat expansions in 28 patients. Parallel repeat analysis enabled a molecular diagnosis of ataxias independent of preconceptions on the basis of clinical presentation. Biallelic expansions within RFC1 were identified as the most frequent cause of ataxia. We characterized the RFC1 repeat composition of all patients and identified a novel repeat motif, AGGGG. Our results highlight the power of clinical nanopore Cas9-targeted sequencing as a readily expandable workflow for the in-depth analysis and diagnosis of phenotypically overlapping repeat expansion disorders.
Assuntos
Ataxia Cerebelar , Degenerações Espinocerebelares , Adulto , Humanos , Ataxia/genética , Ataxia Cerebelar/genética , Biologia Computacional , Sequenciamento de Nucleotídeos em Larga Escala , Proteína do X Frágil da Deficiência IntelectualRESUMO
The CORUM database has been providing comprehensive reference information about experimentally characterized, mammalian protein complexes and their associated biological and biomedical properties since 2007. Given that most catalytic and regulatory functions of the cell are carried out by protein complexes, their composition and characterization is of greatest importance in basic and disease biology. The new CORUM 4.0 release encompasses 5204 protein complexes offering the largest and most comprehensive publicly available dataset of manually curated mammalian protein complexes. The CORUM dataset is built from 5299 different genes, representing 26% of the protein coding genes in humans. Complex information from 3354 scientific articles is mainly obtained from human (70%), mouse (16%) and rat (9%) cells and tissues. Recent curation work includes sets of protein complexes, Functional Complex Groups, that offer comprehensive collections of published data in specific biological processes and molecular functions. In addition, a new graphical analysis tool was implemented that displays co-expression data from the subunits of protein complexes. CORUM is freely accessible at http://mips.helmholtz-muenchen.de/corum/.
Assuntos
Bases de Dados de Proteínas , Complexos Multiproteicos , Animais , Humanos , Camundongos , Ratos , Bases de Dados Factuais , Mamíferos , Complexos Multiproteicos/químicaRESUMO
SMIP004-7 is a small molecule inhibitor of mitochondrial respiration with selective in vivo anti-cancer activity through an as-yet unknown molecular target. We demonstrate here that SMIP004-7 targets drug-resistant cancer cells with stem-like features by inhibiting mitochondrial respiration complex I (NADH:ubiquinone oxidoreductase, complex I [CI]). Instead of affecting the quinone-binding site targeted by most CI inhibitors, SMIP004-7 and its cytochrome P450-dependent activated metabolite(s) have an uncompetitive mechanism of inhibition involving a distinct N-terminal region of catalytic subunit NDUFS2 that leads to rapid disassembly of CI. SMIP004-7 and an improved chemical analog selectively engage NDUFS2 in vivo to inhibit the growth of triple-negative breast cancer transplants, a response mediated at least in part by boosting CD4+ and CD8+ T cell-mediated immune surveillance. Thus, SMIP004-7 defines an emerging class of ubiquinone uncompetitive CI inhibitors for cell autonomous and microenvironmental metabolic targeting of mitochondrial respiration in cancer.
Assuntos
Neoplasias , Ubiquinona , Complexo I de Transporte de Elétrons/metabolismo , Mitocôndrias/metabolismo , Neoplasias/metabolismo , Ubiquinona/metabolismo , Ubiquinona/farmacologiaRESUMO
CORUM is a database that provides a manually curated repository of experimentally characterized protein complexes from mammalian organisms, mainly human (67%), mouse (15%) and rat (10%). Given the vital functions of these macromolecular machines, their identification and functional characterization is foundational to our understanding of normal and disease biology. The new CORUM 3.0 release encompasses 4274 protein complexes offering the largest and most comprehensive publicly available dataset of mammalian protein complexes. The CORUM dataset is built from 4473 different genes, representing 22% of the protein coding genes in humans. Protein complexes are described by a protein complex name, subunit composition, cellular functions as well as the literature references. Information about stoichiometry of subunits depends on availability of experimental data. Recent developments include a graphical tool displaying known interactions between subunits. This allows the prediction of structural interconnections within protein complexes of unknown structure. In addition, we present a set of 58 protein complexes with alternatively spliced subunits. Those were found to affect cellular functions such as regulation of apoptotic activity, protein complex assembly or define cellular localization. CORUM is freely accessible at http://mips.helmholtz-muenchen.de/corum/.