RESUMEN
Most mutations in cancer genomes are thought to be acquired after the initiating event, which may cause genomic instability and drive clonal evolution. However, for acute myeloid leukemia (AML), normal karyotypes are common, and genomic instability is unusual. To better understand clonal evolution in AML, we sequenced the genomes of M3-AML samples with a known initiating event (PML-RARA) versus the genomes of normal karyotype M1-AML samples and the exomes of hematopoietic stem/progenitor cells (HSPCs) from healthy people. Collectively, the data suggest that most of the mutations found in AML genomes are actually random events that occurred in HSPCs before they acquired the initiating mutation; the mutational history of that cell is "captured" as the clone expands. In many cases, only one or two additional, cooperating mutations are needed to generate the malignant founding clone. Cells from the founding clone can acquire additional cooperating mutations, yielding subclones that can contribute to disease progression and/or relapse.
Asunto(s)
Evolución Clonal , Leucemia Mieloide Aguda/genética , Mutación , Adulto , Anciano , Análisis Mutacional de ADN , Progresión de la Enfermedad , Femenino , Estudio de Asociación del Genoma Completo , Células Madre Hematopoyéticas/metabolismo , Humanos , Leucemia Mieloide Aguda/fisiopatología , Masculino , Persona de Mediana Edad , Proteínas de Fusión Oncogénica/genética , Recurrencia , Piel/metabolismo , Adulto JovenRESUMEN
The Drug-Gene Interaction Database (DGIdb, www.dgidb.org) is a web resource that consolidates disparate data sources describing drug-gene interactions and gene druggability. It provides an intuitive graphical user interface and a documented application programming interface (API) for querying these data. DGIdb was assembled through an extensive manual curation effort, reflecting the combined information of twenty-seven sources. For DGIdb 2.0, substantial updates have been made to increase content and improve its usefulness as a resource for mining clinically actionable drug targets. Specifically, nine new sources of drug-gene interactions have been added, including seven resources specifically focused on interactions linked to clinical trials. These additions have more than doubled the overall count of drug-gene interactions. The total number of druggable gene claims has also increased by 30%. Importantly, a majority of the unrestricted, publicly-accessible sources used in DGIdb are now automatically updated on a weekly basis, providing the most current information for these sources. Finally, a new web view and API have been developed to allow searching for interactions by drug identifiers to complement existing gene-based search functionality. With these updates, DGIdb represents a comprehensive and user friendly tool for mining the druggable genome for precision medicine hypothesis generation.
Asunto(s)
Bases de Datos Farmacéuticas , Descubrimiento de Drogas , Genes/efectos de los fármacos , Minería de Datos , LigandosRESUMEN
The Drug-Gene Interaction database (DGIdb) mines existing resources that generate hypotheses about how mutated genes might be targeted therapeutically or prioritized for drug development. It provides an interface for searching lists of genes against a compendium of drug-gene interactions and potentially 'druggable' genes. DGIdb can be accessed at http://dgidb.org/.
Asunto(s)
Minería de Datos/métodos , Bases de Datos Genéticas , Descubrimiento de Drogas/métodos , Antineoplásicos/química , Neoplasias de la Mama/tratamiento farmacológico , Neoplasias de la Mama/genética , Biología Computacional/métodos , Interacciones Farmacológicas , Regulación de la Expresión Génica/efectos de los fármacos , Variación Genética , Genoma , Genómica/métodos , Humanos , Neoplasias Pulmonares/tratamiento farmacológico , Neoplasias Pulmonares/genética , Mutación , Programas Informáticos , Tecnología Farmacéutica/métodosRESUMEN
Massively parallel RNA sequencing (RNA-seq) has rapidly become the assay of choice for interrogating RNA transcript abundance and diversity. This article provides a detailed introduction to fundamental RNA-seq molecular biology and informatics concepts. We make available open-access RNA-seq tutorials that cover cloud computing, tool installation, relevant file formats, reference genomes, transcriptome annotations, quality-control strategies, expression, differential expression, and alternative splicing analysis methods. These tutorials and additional training resources are accompanied by complete analysis pipelines and test datasets made available without encumbrance at www.rnaseq.wiki.
Asunto(s)
Biología Computacional/métodos , Internet , ARN , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Perfilación de la Expresión Génica , Humanos , ARN/análisis , ARN/química , ARN/genética , ARN/aislamiento & purificaciónRESUMEN
Background: Neoantigen targeting therapies including personalized vaccines have shown promise in the treatment of cancers, particularly when used in combination with checkpoint blockade therapy. At least 100 clinical trials involving these therapies are underway globally. Accurate identification and prioritization of neoantigens is highly relevant to designing these trials, predicting treatment response, and understanding mechanisms of resistance. With the advent of massively parallel DNA and RNA sequencing technologies, it is now possible to computationally predict neoantigens based on patient-specific variant information. However, numerous factors must be considered when prioritizing neoantigens for use in personalized therapies. Complexities such as alternative transcript annotations, various binding, presentation and immunogenicity prediction algorithms, and variable peptide lengths/registers all potentially impact the neoantigen selection process. There has been a rapid development of computational tools that attempt to account for these complexities. While these tools generate numerous algorithmic predictions for neoantigen characterization, results from these pipelines are difficult to navigate and require extensive knowledge of the underlying tools for accurate interpretation. This often leads to over-simplification of pipeline outputs to make them tractable, for example limiting prediction to a single RNA isoform or only summarizing the top ranked of many possible peptide candidates. In addition to variant detection, gene expression and predicted peptide binding affinities, recent studies have also demonstrated the importance of mutation location, allele-specific anchor locations, and variation of T-cell response to long versus short peptides. Due to the intricate nature and number of salient neoantigen features, presenting all relevant information to facilitate candidate selection for downstream applications is a difficult challenge that current tools fail to address. Results: We have created pVACview, the first interactive tool designed to aid in the prioritization and selection of neoantigen candidates for personalized neoantigen therapies including cancer vaccines. pVACview has a user-friendly and intuitive interface where users can upload, explore, select and export their neoantigen candidates. The tool allows users to visualize candidates across three different levels, including variant, transcript and peptide information. Conclusions: pVACview will allow researchers to analyze and prioritize neoantigen candidates with greater efficiency and accuracy in basic and translational settings The application is available as part of the pVACtools pipeline at pvactools.org and as an online server at pvacview.org.
RESUMEN
ABSTRACT: Personalized cancer vaccines designed to target neoantigens represent a promising new treatment paradigm in oncology. In contrast to classical idiotype vaccines, we hypothesized that "polyvalent" vaccines could be engineered for the personalized treatment of follicular lymphoma (FL) using neoantigen discovery by combined whole-exome sequencing (WES) and RNA sequencing (RNA-seq). Fifty-eight tumor samples from 57 patients with FL underwent WES and RNA-seq. Somatic and B-cell clonotype neoantigens were predicted and filtered to identify high-quality neoantigens. B-cell clonality was determined by the alignment of B-cell receptor (BCR) CDR3 regions from RNA-seq data, grouping at the protein level, and comparison with the BCR repertoire from healthy individuals using RNA-seq data. An average of 52 somatic mutations per patient (range, 2-172) were identified, and ≥2 (median, 15) high-quality neoantigens were predicted for 56 of 58 FL samples. The predicted neoantigen peptides were composed of missense mutations (77%), indels (9%), gene fusions (3%), and BCR sequences (11%). Building off of these preclinical analyses, we initiated a pilot clinical trial using personalized neoantigen vaccination combined with PD-1 blockade in patients with relapsed or refractory FL (#NCT03121677). Synthetic long peptide vaccines targeting predicted high-quality neoantigens were successfully synthesized for and administered to all 4 patients enrolled. Initial results demonstrate feasibility, safety, and potential immunologic and clinical responses. Our study suggests that a genomics-driven personalized cancer vaccine strategy is feasible for patients with FL, and this may overcome prior challenges in the field. This trial was registered at www.ClinicalTrials.gov as #NCT03121677.
Asunto(s)
Antígenos de Neoplasias , Vacunas contra el Cáncer , Linfoma Folicular , Medicina de Precisión , Humanos , Linfoma Folicular/terapia , Linfoma Folicular/inmunología , Linfoma Folicular/genética , Vacunas contra el Cáncer/inmunología , Vacunas contra el Cáncer/uso terapéutico , Antígenos de Neoplasias/inmunología , Medicina de Precisión/métodos , Persona de Mediana Edad , Femenino , Masculino , Anciano , Adulto , Secuenciación del Exoma , MutaciónRESUMEN
Aggregative multicellularity relies on cooperation among formerly independent cells to form a multicellular body. Previous work with Dictyostelium discoideum showed that experimental evolution under low relatedness profoundly decreased cooperation, as indicated by the loss of fruiting body formation in many clones and an increase of cheaters that contribute proportionally more to spores than to the dead stalk. Using whole-genome sequencing and variant analysis of these lines, we identified 38 single nucleotide polymorphisms in 29 genes. Each gene had 1 variant except for grlG (encoding a G protein-coupled receptor), which had 10 unique SNPs and 5 structural variants. Variants in the 5' half of grlG-the region encoding the signal peptide and the extracellular binding domain-were significantly associated with the loss of fruiting body formation; the association was not significant in the 3' half of the gene. These results suggest that the loss of grlG was adaptive under low relatedness and that at least the 5' half of the gene is important for cooperation and multicellular development. This is surprising given some previous evidence that grlG encodes a folate receptor involved in predation, which occurs only during the single-celled stage. However, non-fruiting mutants showed little increase in a parallel evolution experiment where the multicellular stage was prevented from happening. This shows that non-fruiting mutants are not generally selected by any predation advantage but rather by something-likely cheating-during the multicellular stage.
Asunto(s)
Amoeba , Dictyostelium , Evolución Biológica , Dictyostelium/genética , ReproducciónRESUMEN
The U2AF1 gene is a core part of mRNA splicing machinery and frequently contains somatic mutations that contribute to oncogenesis in myelodysplastic syndrome, acute myeloid leukemia, and other cancers. A change introduced in the GRCh38 version of the human reference build prevents detection of mutations in this gene, and others, by variant calling pipelines. This study describes the problem in detail and shows that a modified GRCh38 reference build with unchanged coordinates can be used to ameliorate the issue.
Asunto(s)
Leucemia Mieloide Aguda , Humanos , Leucemia Mieloide Aguda/genética , Mutación , Factor de Empalme U2AF/genéticaRESUMEN
Osteosarcoma is a rare disease in children but is one of the most common cancers in adult large breed dogs. The mutational landscape of both the primary and pulmonary metastatic tumor in two dogs with appendicular osteosarcoma (OSA) was comprehensively evaluated using an automated whole genome sequencing, exome and RNA-seq pipeline that was adapted for this study for use in dogs. Chromosomal lesions were the most common type of mutation. The mutational landscape varied substantially between dogs but the lesions within the same patient were similar. Copy number neutral loss of heterozygosity in mutant TP53 was the most significant driver mutation and involved a large region in the middle of chromosome 5. Canine and human OSA is characterized by loss of cell cycle checkpoint integrity and DNA damage response pathways. Mutational profiling of individual patients with canine OSA would be recommended prior to targeted therapy, given the heterogeneity seen in our study and previous studies.
Asunto(s)
Neoplasias Óseas , Enfermedades de los Perros/genética , Neoplasias Primarias Secundarias/veterinaria , Osteosarcoma , Animales , Neoplasias Óseas/genética , Neoplasias Óseas/veterinaria , Perros , Genes p53/genética , Masculino , Mutación , Neoplasias Primarias Secundarias/genética , Osteosarcoma/genética , Osteosarcoma/veterinariaRESUMEN
Bam-readcount is a utility for generating low-level information about sequencing data at specific nucleotide positions. Originally designed to help filter genomic mutation calls, the metrics it outputs are useful as input for variant detection tools and for resolving ambiguity between variant callers1,2. In addition, it has found broad applicability in diverse fields including tumor evolution, single-cell genomics, climate change ecology, and tracking community spread of SARS-CoV-2.3-6.
RESUMEN
Tumor heterogeneity and evolution drive treatment resistance in metastatic colorectal cancer (mCRC). Patient-derived xenografts (PDXs) can model mCRC biology; however, their ability to accurately mimic human tumor heterogeneity is unclear. Current genomic studies in mCRC have limited scope and lack matched PDXs. Therefore, the landscape of tumor heterogeneity and its impact on the evolution of metastasis and PDXs remain undefined. We performed whole-genome, deep exome, and targeted validation sequencing of multiple primary regions, matched distant metastases, and PDXs from 11 patients with mCRC. We observed intricate clonal heterogeneity and evolution affecting metastasis dissemination and PDX clonal selection. Metastasis formation followed both monoclonal and polyclonal seeding models. In four cases, metastasis-seeding clones were not identified in any primary region, consistent with a metastasis-seeding-metastasis model. PDXs underrepresented the subclonal heterogeneity of parental tumors. These suggest that single sample tumor sequencing and current PDX models may be insufficient to guide precision medicine.
Asunto(s)
Evolución Clonal , Neoplasias del Colon , Animales , Evolución Clonal/genética , Neoplasias del Colon/genética , Modelos Animales de Enfermedad , Exoma/genética , Genómica , Humanos , Metástasis de la Neoplasia , Secuenciación del ExomaRESUMEN
To detect diverse and novel RNA species comprehensively, we compared deep small RNA and RNA sequencing (RNA-seq) methods applied to a primary acute myeloid leukemia (AML) sample. We were able to discover previously unannotated small RNAs using deep sequencing of a library method using broader insert size selection. We analyzed the long noncoding RNA (lncRNA) landscape in AML by comparing deep sequencing from multiple RNA-seq library construction methods for the sample that we studied and then integrating RNA-seq data from 179 AML cases. This identified lncRNAs that are completely novel, differentially expressed, and associated with specific AML subtypes. Our study revealed the complexity of the noncoding RNA transcriptome through a combined strategy of strand-specific small RNA and total RNA-seq. This dataset will serve as an invaluable resource for future RNA-based analyses.
Asunto(s)
Regulación Leucémica de la Expresión Génica , Leucemia Mieloide/genética , ARN Largo no Codificante/genética , Transcriptoma/genética , Enfermedad Aguda , Secuencia de Bases , Análisis por Conglomerados , Perfilación de la Expresión Génica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , HumanosRESUMEN
The genomic events responsible for the pathogenesis of relapsed adult B-lymphoblastic leukemia (B-ALL) are not yet clear. We performed integrative analysis of whole-genome, whole-exome, custom capture, whole-transcriptome (RNA-seq), and locus-specific genomic assays across nine time points from a patient with primary de novo B-ALL. Comprehensive genome and transcriptome characterization revealed a dramatic tumor evolution during progression, yielding a tumor with complex clonal architecture at second relapse. We observed and validated point mutations in EP300 and NF1, a highly expressed EP300-ZNF384 gene fusion, a microdeletion in IKZF1, a focal deletion affecting SETD2, and large deletions affecting RB1, PAX5, NF1, and ETV6. Although the genome analysis revealed events of potential biological relevance, no clinically actionable treatment options were evident at the time of the second relapse. However, transcriptome analysis identified aberrant overexpression of the targetable protein kinase encoded by the FLT3 gene. Although the patient had refractory disease after salvage therapy for the second relapse, treatment with the FLT3 inhibitor sunitinib rapidly induced a near complete molecular response, permitting the patient to proceed to a matched-unrelated donor stem cell transplantation. The patient remains in complete remission more than 4 years later. Analysis of this patient's relapse genome revealed an unexpected, actionable therapeutic target that led to a specific therapy associated with a rapid clinical response. For some patients with relapsed or refractory cancers, this approach may indicate a novel therapeutic intervention that could alter outcome.
Asunto(s)
Genómica , Leucemia-Linfoma Linfoblástico de Células Precursoras B/genética , Leucemia-Linfoma Linfoblástico de Células Precursoras B/terapia , Activación Transcripcional , Tirosina Quinasa 3 Similar a fms/genética , Adulto , Protocolos de Quimioterapia Combinada Antineoplásica/uso terapéutico , Biopsia , Médula Ósea/patología , Trasplante de Médula Ósea , Ciclofosfamida/uso terapéutico , Análisis Citogenético , Dexametasona/uso terapéutico , Doxorrubicina/uso terapéutico , Citometría de Flujo , Perfilación de la Expresión Génica , Variación Genética , Genómica/métodos , Enfermedad Injerto contra Huésped/tratamiento farmacológico , Enfermedad Injerto contra Huésped/etiología , Humanos , Masculino , Leucemia-Linfoma Linfoblástico de Células Precursoras B/diagnóstico , Recurrencia , Trasplante Homólogo , Vincristina/uso terapéuticoRESUMEN
Tumors are typically sequenced to depths of 75-100× (exome) or 30-50× (whole genome). We demonstrate that current sequencing paradigms are inadequate for tumors that are impure, aneuploid or clonally heterogeneous. To reassess optimal sequencing strategies, we performed ultra-deep (up to ~312×) whole genome sequencing (WGS) and exome capture (up to ~433×) of a primary acute myeloid leukemia, its subsequent relapse, and a matched normal skin sample. We tested multiple alignment and variant calling algorithms and validated ~200,000 putative SNVs by sequencing them to depths of ~1,000×. Additional targeted sequencing provided over 10,000× coverage and ddPCR assays provided up to ~250,000× sampling of selected sites. We evaluated the effects of different library generation approaches, depth of sequencing, and analysis strategies on the ability to effectively characterize a complex tumor. This dataset, representing the most comprehensively sequenced tumor described to date, will serve as an invaluable community resource (dbGaP accession id phs000159).