RESUMO
The development of targeted anti-cancer therapies through the study of cancer genomes is intended to increase survival rates and decrease treatment-related toxicity. We treated a transposon-driven, functional genomic mouse model of medulloblastoma with 'humanized' in vivo therapy (microneurosurgical tumour resection followed by multi-fractionated, image-guided radiotherapy). Genetic events in recurrent murine medulloblastoma exhibit a very poor overlap with those in matched murine diagnostic samples (<5%). Whole-genome sequencing of 33 pairs of human diagnostic and post-therapy medulloblastomas demonstrated substantial genetic divergence of the dominant clone after therapy (<12% diagnostic events were retained at recurrence). In both mice and humans, the dominant clone at recurrence arose through clonal selection of a pre-existing minor clone present at diagnosis. Targeted therapy is unlikely to be effective in the absence of the target, therefore our results offer a simple, proximal, and remediable explanation for the failure of prior clinical trials of targeted therapy.
Assuntos
Neoplasias Cerebelares/terapia , Células Clonais/efeitos dos fármacos , Células Clonais/metabolismo , Meduloblastoma/terapia , Recidiva Local de Neoplasia/genética , Recidiva Local de Neoplasia/patologia , Seleção Genética/efeitos dos fármacos , Animais , Neoplasias Cerebelares/genética , Neoplasias Cerebelares/patologia , Neoplasias Cerebelares/radioterapia , Neoplasias Cerebelares/cirurgia , Células Clonais/patologia , Radiação Cranioespinal , Análise Mutacional de DNA , Modelos Animais de Doenças , Drosophila melanogaster/citologia , Drosophila melanogaster/genética , Feminino , Genoma Humano/genética , Humanos , Masculino , Meduloblastoma/genética , Meduloblastoma/patologia , Meduloblastoma/radioterapia , Meduloblastoma/cirurgia , Camundongos , Terapia de Alvo Molecular/métodos , Recidiva Local de Neoplasia/terapia , Radioterapia Guiada por Imagem , Transdução de Sinais , Ensaios Antitumorais Modelo de XenoenxertoRESUMO
BACKGROUND: Many mutations that contribute to the pathogenesis of acute myeloid leukemia (AML) are undefined. The relationships between patterns of mutations and epigenetic phenotypes are not yet clear. METHODS: We analyzed the genomes of 200 clinically annotated adult cases of de novo AML, using either whole-genome sequencing (50 cases) or whole-exome sequencing (150 cases), along with RNA and microRNA sequencing and DNA-methylation analysis. RESULTS: AML genomes have fewer mutations than most other adult cancers, with an average of only 13 mutations found in genes. Of these, an average of 5 are in genes that are recurrently mutated in AML. A total of 23 genes were significantly mutated, and another 237 were mutated in two or more samples. Nearly all samples had at least 1 nonsynonymous mutation in one of nine categories of genes that are almost certainly relevant for pathogenesis, including transcription-factor fusions (18% of cases), the gene encoding nucleophosmin (NPM1) (27%), tumor-suppressor genes (16%), DNA-methylation-related genes (44%), signaling genes (59%), chromatin-modifying genes (30%), myeloid transcription-factor genes (22%), cohesin-complex genes (13%), and spliceosome-complex genes (14%). Patterns of cooperation and mutual exclusivity suggested strong biologic relationships among several of the genes and categories. CONCLUSIONS: We identified at least one potential driver mutation in nearly all AML samples and found that a complex interplay of genetic events contributes to AML pathogenesis in individual patients. The databases from this study are widely available to serve as a foundation for further investigations of AML pathogenesis, classification, and risk stratification. (Funded by the National Institutes of Health.).
Assuntos
Leucemia Mieloide Aguda/genética , Mutação , Adulto , Ilhas de CpG , Metilação de DNA , Epigenômica , Feminino , Expressão Gênica , Fusão Gênica , Genoma Humano , Humanos , Leucemia Mieloide Aguda/classificação , Masculino , MicroRNAs/genética , Pessoa de Meia-Idade , Nucleofosmina , Análise de Sequência de DNA/métodosRESUMO
In alternative expression analysis by sequencing (ALEXA-seq), we developed a method to analyze massively parallel RNA sequence data to catalog transcripts and assess differential and alternative expression of known and predicted mRNA isoforms in cells and tissues. As proof of principle, we used the approach to compare fluorouracil-resistant and -nonresistant human colorectal cancer cell lines. We assessed the sensitivity and specificity of the approach by comparison to exon tiling and splicing microarrays and validated the results with reverse transcription-PCR, quantitative PCR and Sanger sequencing. We observed global disruption of splicing in fluorouracil-resistant cells characterized by expression of new mRNA isoforms resulting from exon skipping, alternative splice site usage and intron retention. Alternative expression annotation databases, source code, a data viewer and other resources to facilitate analysis are available at http://www.alexaplatform.org/alexa_seq/.
Assuntos
Processamento Alternativo , RNA Mensageiro/genética , Análise de Sequência de RNA/métodos , Antimetabólitos Antineoplásicos/farmacologia , Linhagem Celular Tumoral , Neoplasias Colorretais/tratamento farmacológico , Neoplasias Colorretais/genética , Neoplasias Colorretais/patologia , Bases de Dados Genéticas , Resistencia a Medicamentos Antineoplásicos/genética , Etiquetas de Sequências Expressas , Fluoruracila/farmacologia , Expressão Gênica/efeitos dos fármacos , Perfilação da Expressão Gênica , Humanos , Análise de Sequência com Séries de Oligonucleotídeos , Isoformas de Proteínas , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Alinhamento de SequênciaRESUMO
BACKGROUND: To support the implementation of high-throughput pipelines suitable for SARS-CoV-2 sequencing and analysis in a clinical laboratory, we developed an automated sample preparation and analysis workflow. METHODS: We used the established ARTIC protocol with approximately 400â bp amplicons sequenced on Oxford Nanopore's MinION. Sequences were analyzed using Nextclade, assigning both a clade and quality score to each sample. RESULTS: A total of 2179 samples on twenty-five 96-well plates were sequenced. Plates of purified RNA were processed within 12â h, sequencing required up to 24â h, and analysis of each pooled plate required 1â h. The use of samples with known threshold cycle (Ct) values enabled normalization, acted as a quality control check, and revealed a strong correlation between sample Ct values and successful analysis, with 85% of samples with Ct < 30 achieving a "good" Nextclade score. Less abundant samples responded to enrichment with the fraction of Ct > 30 samples achieving a "good" classification rising by 60% after addition of a post-ARTIC PCR normalization. Serial dilutions of 3 variant of concern samples, diluted from approximately Ct = 16 to approximately Ct = 50, demonstrated successful sequencing to Ctâ =â 37. The sample set contained a median of 24 mutations per sample and a total of 1281 unique mutations with reduced sequence read coverage noted in some regions of some samples. A total of 10 separate strains were observed in the sample set, including 3 variants of concern prevalent in British Columbia in the spring of 2021. CONCLUSIONS: We demonstrated a robust automated sequencing pipeline that takes advantage of input Ct values to improve reliability.
Assuntos
COVID-19 , Sequenciamento por Nanoporos , Nanoporos , COVID-19/diagnóstico , COVID-19/epidemiologia , Humanos , Reprodutibilidade dos Testes , SARS-CoV-2/genéticaRESUMO
BACKGROUND: Genomic deletions and duplications are important in the pathogenesis of diseases, such as cancer and mental retardation, and have recently been shown to occur frequently in unaffected individuals as polymorphisms. Affymetrix GeneChip whole genome sampling analysis (WGSA) combined with 100 K single nucleotide polymorphism (SNP) genotyping arrays is one of several microarray-based approaches that are now being used to detect such structural genomic changes. The popularity of this technology and its associated open source data format have resulted in the development of an increasing number of software packages for the analysis of copy number changes using these SNP arrays. RESULTS: We evaluated four publicly available software packages for high throughput copy number analysis using synthetic and empirical 100 K SNP array data sets, the latter obtained from 107 mental retardation (MR) patients and their unaffected parents and siblings. We evaluated the software with regards to overall suitability for high-throughput 100 K SNP array data analysis, as well as effectiveness of normalization, scaling with various reference sets and feature extraction, as well as true and false positive rates of genomic copy number variant (CNV) detection. CONCLUSION: We observed considerable variation among the numbers and types of candidate CNVs detected by different analysis approaches, and found that multiple programs were needed to find all real aberrations in our test set. The frequency of false positive deletions was substantial, but could be greatly reduced by using the SNP genotype information to confirm loss of heterozygosity.
Assuntos
Algoritmos , Dosagem de Genes/genética , Variação Genética/genética , Genômica/normas , Análise de Sequência com Séries de Oligonucleotídeos/normas , Validação de Programas de Computador , Adulto , Criança , Genoma Humano/genética , Genômica/métodos , Humanos , Análise de Sequência com Séries de Oligonucleotídeos/métodosRESUMO
The northern sea otter inhabits coastal waters of the northern Pacific Ocean and is the largest member of the Mustelidae family. DNA sequencing methods that utilize microfluidic partitioned and non-partitioned library construction were used to establish the sea otter genome. The final assembly provided 2.426 Gbp of highly contiguous assembled genomic sequences with a scaffold N50 length of over 38 Mbp. We generated transcriptome data derived from a lymphoma to aid in the determination of functional elements. The assembled genome sequence and underlying sequence data are available at the National Center for Biotechnology Information (NCBI) under the BioProject accession number PRJNA388419.
RESUMO
The beluga whale is a cetacean that inhabits arctic and subarctic regions, and is the only living member of the genus Delphinapterus. The genome of the beluga whale was determined using DNA sequencing approaches that employed both microfluidic partitioning library and non-partitioned library construction. The former allowed for the construction of a highly contiguous assembly with a scaffold N50 length of over 19 Mbp and total reconstruction of 2.32 Gbp. To aid our understanding of the functional elements, transcriptome data was also derived from brain, duodenum, heart, lung, spleen, and liver tissue. Assembled sequence and all of the underlying sequence data are available at the National Center for Biotechnology Information (NCBI) under the Bioproject accession number PRJNA360851A.
RESUMO
Spatial heterogeneity of transcriptional and genetic markers between physically isolated biopsies of a single tumor poses major barriers to the identification of biomarkers and the development of targeted therapies that will be effective against the entire tumor. We analyzed the spatial heterogeneity of multiregional biopsies from 35 patients, using a combination of transcriptomic and genomic profiles. Medulloblastomas (MBs), but not high-grade gliomas (HGGs), demonstrated spatially homogeneous transcriptomes, which allowed for accurate subgrouping of tumors from a single biopsy. Conversely, somatic mutations that affect genes suitable for targeted therapeutics demonstrated high levels of spatial heterogeneity in MB, malignant glioma, and renal cell carcinoma (RCC). Actionable targets found in a single MB biopsy were seldom clonal across the entire tumor, which brings the efficacy of monotherapies against a single target into question. Clinical trials of targeted therapies for MB should first ensure the spatially ubiquitous nature of the target mutation.
Assuntos
Neoplasias Cerebelares/genética , Regulação Neoplásica da Expressão Gênica , Meduloblastoma/genética , Transcriptoma , Adulto , Idoso , Idoso de 80 Anos ou mais , Neoplasias Cerebelares/patologia , Criança , Pré-Escolar , Análise por Conglomerados , Variações do Número de Cópias de DNA , Feminino , Perfilação da Expressão Gênica/métodos , Heterogeneidade Genética , Estudo de Associação Genômica Ampla , Humanos , Mutação INDEL , Masculino , Meduloblastoma/patologia , Pessoa de Meia-Idade , Mutação , Polimorfismo de Nucleotídeo Único , Análise de Componente Principal , Reação em Cadeia da Polimerase Via Transcriptase ReversaAssuntos
Perfilação da Expressão Gênica/instrumentação , Perfilação da Expressão Gênica/métodos , Análise de Sequência com Séries de Oligonucleotídeos/instrumentação , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Análise de Sequência de DNA/métodos , Desenho de Equipamento , Análise de Falha de Equipamento , Reprodutibilidade dos Testes , Sensibilidade e EspecificidadeRESUMO
Neuroblastoma is a malignancy of the developing sympathetic nervous system that often presents with widespread metastatic disease, resulting in survival rates of less than 50%. To determine the spectrum of somatic mutation in high-risk neuroblastoma, we studied 240 affected individuals (cases) using a combination of whole-exome, genome and transcriptome sequencing as part of the Therapeutically Applicable Research to Generate Effective Treatments (TARGET) initiative. Here we report a low median exonic mutation frequency of 0.60 per Mb (0.48 nonsilent) and notably few recurrently mutated genes in these tumors. Genes with significant somatic mutation frequencies included ALK (9.2% of cases), PTPN11 (2.9%), ATRX (2.5%, and an additional 7.1% had focal deletions), MYCN (1.7%, causing a recurrent p.Pro44Leu alteration) and NRAS (0.83%). Rare, potentially pathogenic germline variants were significantly enriched in ALK, CHEK2, PINK1 and BARD1. The relative paucity of recurrent somatic mutations in neuroblastoma challenges current therapeutic strategies that rely on frequently altered oncogenic drivers.
Assuntos
Exoma , Mutação , Neuroblastoma , Linhagem Celular Tumoral , Predisposição Genética para Doença , Genoma Humano , Humanos , Neuroblastoma/genética , Neuroblastoma/fisiopatologia , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNA , TranscriptomaRESUMO
We describe the details of a serial analysis of gene expression (SAGE) library construction and analysis platform that has enabled the generation of >298 high-quality SAGE libraries and >30 million SAGE tags primarily from sub-microgram amounts of total RNA purified from samples acquired by microdissection. Several RNA isolation methods were used to handle the diversity of samples processed, and various measures were applied to minimize ditag PCR carryover contamination. Modifications in the SAGE protocol resulted in improved cloning and DNA sequencing efficiencies. Bioinformatic measures to automatically assess DNA sequencing results were implemented to analyze the integrity of ditag structure, linker or cross-species ditag contamination, and yield of high-quality tags per sequence read. Our analysis of singleton tag errors resulted in a method for correcting such errors to statistically determine tag accuracy. From the libraries generated, we produced an essentially complete mapping of reliable 21-base-pair tags to the mouse reference genome sequence for a meta-library of approximately 5 million tags. Our analyses led us to reject the commonly held notion that duplicate ditags are artifacts. Rather than the usual practice of discarding such tags, we conclude that they should be retained to avoid introducing bias into the results and thereby maintain the quantitative nature of the data, which is a major theoretical advantage of SAGE as a tool for global transcriptional profiling.
Assuntos
Perfilação da Expressão Gênica/métodos , Biblioteca Gênica , Animais , Caenorhabditis elegans/genética , Linhagem Celular , Separação Celular , Bases de Dados de Ácidos Nucleicos , Células-Tronco Embrionárias/química , Citometria de Fluxo , Genoma , Humanos , Camundongos , Microdissecção , Análise de Sequência de DNA , Software , Peixe-Zebra/genéticaRESUMO
To facilitate discovery of novel human embryonic stem cell (ESC) transcripts, we generated 2.5 million LongSAGE tags from 9 human ESC lines. Analysis of this data revealed that ESCs express proportionately more RNA binding proteins compared with terminally differentiated cells, and identified novel ESC transcripts, at least one of which may represent a marker of the pluripotent state.
Assuntos
Células-Tronco Embrionárias/metabolismo , Perfilação da Expressão Gênica , Células-Tronco Pluripotentes/metabolismo , Sequência de Bases , Linhagem Celular , Humanos , Proteínas de Ligação a RNA/genética , Alinhamento de SequênciaRESUMO
The cause of mental retardation in one-third to one-half of all affected individuals is unknown. Microscopically detectable chromosomal abnormalities are the most frequently recognized cause, but gain or loss of chromosomal segments that are too small to be seen by conventional cytogenetic analysis has been found to be another important cause. Array-based methods offer a practical means of performing a high-resolution survey of the entire genome for submicroscopic copy-number variants. We studied 100 children with idiopathic mental retardation and normal results of standard chromosomal analysis, by use of whole-genome sampling analysis with Affymetrix GeneChip Human Mapping 100K arrays. We found de novo deletions as small as 178 kb in eight cases, de novo duplications as small as 1.1 Mb in two cases, and unsuspected mosaic trisomy 9 in another case. This technology can detect at least twice as many potentially pathogenic de novo copy-number variants as conventional cytogenetic analysis can in people with mental retardation.