RESUMO
Most mutations in cancer genomes are thought to be acquired after the initiating event, which may cause genomic instability and drive clonal evolution. However, for acute myeloid leukemia (AML), normal karyotypes are common, and genomic instability is unusual. To better understand clonal evolution in AML, we sequenced the genomes of M3-AML samples with a known initiating event (PML-RARA) versus the genomes of normal karyotype M1-AML samples and the exomes of hematopoietic stem/progenitor cells (HSPCs) from healthy people. Collectively, the data suggest that most of the mutations found in AML genomes are actually random events that occurred in HSPCs before they acquired the initiating mutation; the mutational history of that cell is "captured" as the clone expands. In many cases, only one or two additional, cooperating mutations are needed to generate the malignant founding clone. Cells from the founding clone can acquire additional cooperating mutations, yielding subclones that can contribute to disease progression and/or relapse.
Assuntos
Evolução Clonal , Leucemia Mieloide Aguda/genética , Mutação , Adulto , Idoso , Análise Mutacional de DNA , Progressão da Doença , Feminino , Estudo de Associação Genômica Ampla , Células-Tronco Hematopoéticas/metabolismo , Humanos , Leucemia Mieloide Aguda/fisiopatologia , Masculino , Pessoa de Meia-Idade , Proteínas de Fusão Oncogênica/genética , Recidiva , Pele/metabolismo , Adulto JovemRESUMO
Synpolydactyly 1, also called syndactyly type II (SDTY2), is a genetic limb malformation characterized by polydactyly with syndactyly involving the webbing of the third and fourth fingers, and the fourth and fifth toes. It is caused by heterozygous alterations in HOXD13 with incomplete penetrance and phenotypic variability. In our study, a five-generation family with an SPD phenotype was enrolled in our Rare Disease Genomics Protocol. A comprehensive examination of three generations using Illumina short-read whole-genome sequencing (WGS) did not identify any causative variants. Subsequent WGS using Pacific Biosciences (PacBio) long-read HiFi Circular Consensus Sequencing (CCS) revealed a heterozygous 27-bp duplication in the polyalanine tract of HOXD13. Sanger sequencing of all available family members confirmed that the variant segregates with affected individuals. Reanalysis of an unrelated family with a similar SPD phenotype uncovered a 21-bp (7-alanine) duplication in the same region of HOXD13. Although ExpansionHunter identified these events in most individuals in a retrospective analysis, low sequence coverage due to high GC content in the HOXD13 polyalanine tract makes detection of these events challenging. Our findings highlight the value of long-read WGS in elucidating the molecular etiology of congenital limb malformation disorders.
Assuntos
Proteínas de Homeodomínio , Sindactilia , Fatores de Transcrição , Proteínas de Homeodomínio/genética , Humanos , Linhagem , Estudos Retrospectivos , Sindactilia/genética , Fatores de Transcrição/genética , Sequenciamento Completo do GenomaRESUMO
Cancer immunoediting, the process by which the immune system controls tumour outgrowth and shapes tumour immunogenicity, is comprised of three phases: elimination, equilibrium and escape. Although many immune components that participate in this process are known, its underlying mechanisms remain poorly defined. A central tenet of cancer immunoediting is that T-cell recognition of tumour antigens drives the immunological destruction or sculpting of a developing cancer. However, our current understanding of tumour antigens comes largely from analyses of cancers that develop in immunocompetent hosts and thus may have already been edited. Little is known about the antigens expressed in nascent tumour cells, whether they are sufficient to induce protective antitumour immune responses or whether their expression is modulated by the immune system. Here, using massively parallel sequencing, we characterize expressed mutations in highly immunogenic methylcholanthrene-induced sarcomas derived from immunodeficient Rag2(-/-) mice that phenotypically resemble nascent primary tumour cells. Using class I prediction algorithms, we identify mutant spectrin-ß2 as a potential rejection antigen of the d42m1 sarcoma and validate this prediction by conventional antigen expression cloning and detection. We also demonstrate that cancer immunoediting of d42m1 occurs via a T-cell-dependent immunoselection process that promotes outgrowth of pre-existing tumour cell clones lacking highly antigenic mutant spectrin-ß2 and other potential strong antigens. These results demonstrate that the strong immunogenicity of an unedited tumour can be ascribed to expression of highly antigenic mutant proteins and show that outgrowth of tumour cells that lack these strong antigens via a T-cell-dependent immunoselection process represents one mechanism of cancer immunoediting.
Assuntos
Exoma/genética , Exoma/imunologia , Vigilância Imunológica/imunologia , Neoplasias/genética , Neoplasias/imunologia , Linfócitos T/imunologia , Algoritmos , Animais , Proteínas de Transporte/genética , Proteínas de Transporte/imunologia , Proteínas de Ligação a DNA/deficiência , Proteínas de Ligação a DNA/genética , Antígenos de Histocompatibilidade Classe I/imunologia , Humanos , Masculino , Metilcolantreno , Camundongos , Proteínas dos Microfilamentos/genética , Proteínas dos Microfilamentos/imunologia , Modelos Imunológicos , Neoplasias/induzido quimicamente , Neoplasias/patologia , Reprodutibilidade dos Testes , Sarcoma/induzido quimicamente , Sarcoma/genética , Sarcoma/imunologia , Sarcoma/patologiaRESUMO
Most patients with acute myeloid leukaemia (AML) die from progressive disease after relapse, which is associated with clonal evolution at the cytogenetic level. To determine the mutational spectrum associated with relapse, we sequenced the primary tumour and relapse genomes from eight AML patients, and validated hundreds of somatic mutations using deep sequencing; this allowed us to define clonality and clonal evolution patterns precisely at relapse. In addition to discovering novel, recurrently mutated genes (for example, WAC, SMC3, DIS3, DDX41 and DAXX) in AML, we also found two major clonal evolution patterns during AML relapse: (1) the founding clone in the primary tumour gained mutations and evolved into the relapse clone, or (2) a subclone of the founding clone survived initial therapy, gained additional mutations and expanded at relapse. In all cases, chemotherapy failed to eradicate the founding clone. The comparison of relapse-specific versus primary tumour mutations in all eight cases revealed an increase in transversions, probably due to DNA damage caused by cytotoxic chemotherapy. These data demonstrate that AML relapse is associated with the addition of new mutations and clonal evolution, which is shaped, in part, by the chemotherapy that the patients receive to establish and maintain remissions.
Assuntos
Evolução Clonal/genética , Genoma Humano/genética , Leucemia Mieloide Aguda/genética , Leucemia Mieloide Aguda/patologia , Antineoplásicos/efeitos adversos , Antineoplásicos/uso terapêutico , Células Clonais/efeitos dos fármacos , Células Clonais/metabolismo , Células Clonais/patologia , Dano ao DNA/efeitos dos fármacos , Análise Mutacional de DNA , Genes Neoplásicos/genética , Genoma Humano/efeitos dos fármacos , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Leucemia Mieloide Aguda/tratamento farmacológico , Mutagênese/efeitos dos fármacos , Mutagênese/genética , Recidiva , Reprodutibilidade dos TestesRESUMO
In this work, we present the Genome Modeling System (GMS), an analysis information management system capable of executing automated genome analysis pipelines at a massive scale. The GMS framework provides detailed tracking of samples and data coupled with reliable and repeatable analysis pipelines. The GMS also serves as a platform for bioinformatics development, allowing a large team to collaborate on data analysis, or an individual researcher to leverage the work of others effectively within its data management system. Rather than separating ad-hoc analysis from rigorous, reproducible pipelines, the GMS promotes systematic integration between the two. As a demonstration of the GMS, we performed an integrated analysis of whole genome, exome and transcriptome sequencing data from a breast cancer cell line (HCC1395) and matched lymphoblastoid line (HCC1395BL). These data are available for users to test the software, complete tutorials and develop novel GMS pipeline configurations. The GMS is available at https://github.com/genome/gms.
Assuntos
Mapeamento Cromossômico/métodos , Genoma Humano/genética , Bases de Conhecimento , Modelos Genéticos , Análise de Sequência de DNA/métodos , Interface Usuário-Computador , Algoritmos , Simulação por Computador , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Genéticas , Humanos , Alinhamento de Sequência/métodosRESUMO
Massively parallel DNA sequencing technologies provide an unprecedented ability to screen entire genomes for genetic changes associated with tumour progression. Here we describe the genomic analyses of four DNA samples from an African-American patient with basal-like breast cancer: peripheral blood, the primary tumour, a brain metastasis and a xenograft derived from the primary tumour. The metastasis contained two de novo mutations and a large deletion not present in the primary tumour, and was significantly enriched for 20 shared mutations. The xenograft retained all primary tumour mutations and displayed a mutation enrichment pattern that resembled the metastasis. Two overlapping large deletions, encompassing CTNNA1, were present in all three tumour samples. The differential mutation frequencies and structural variation patterns in metastasis and xenograft compared with the primary tumour indicate that secondary tumours may arise from a minority of cells within the primary tumour.
Assuntos
Neoplasias Encefálicas/genética , Neoplasias Encefálicas/secundário , Neoplasias da Mama/genética , Genoma Humano/genética , Mutação/genética , Transplante de Neoplasias , Adulto , Neoplasias da Mama/patologia , Variações do Número de Cópias de DNA/genética , Análise Mutacional de DNA , Progressão da Doença , Feminino , Frequência do Gene/genética , Genômica , Humanos , Translocação Genética/genética , Transplante Heterólogo , alfa Catenina/genéticaRESUMO
BACKGROUND: The genetic alterations responsible for an adverse outcome in most patients with acute myeloid leukemia (AML) are unknown. METHODS: Using massively parallel DNA sequencing, we identified a somatic mutation in DNMT3A, encoding a DNA methyltransferase, in the genome of cells from a patient with AML with a normal karyotype. We sequenced the exons of DNMT3A in 280 additional patients with de novo AML to define recurring mutations. RESULTS: A total of 62 of 281 patients (22.1%) had mutations in DNMT3A that were predicted to affect translation. We identified 18 different missense mutations, the most common of which was predicted to affect amino acid R882 (in 37 patients). We also identified six frameshift, six nonsense, and three splice-site mutations and a 1.5-Mbp deletion encompassing DNMT3A. These mutations were highly enriched in the group of patients with an intermediate-risk cytogenetic profile (56 of 166 patients, or 33.7%) but were absent in all 79 patients with a favorable-risk cytogenetic profile (P<0.001 for both comparisons). The median overall survival among patients with DNMT3A mutations was significantly shorter than that among patients without such mutations (12.3 months vs. 41.1 months, P<0.001). DNMT3A mutations were associated with adverse outcomes among patients with an intermediate-risk cytogenetic profile or FLT3 mutations, regardless of age, and were independently associated with a poor outcome in Cox proportional-hazards analysis. CONCLUSIONS: DNMT3A mutations are highly recurrent in patients with de novo AML with an intermediate-risk cytogenetic profile and are independently associated with a poor outcome. (Funded by the National Institutes of Health and others.).
Assuntos
DNA (Citosina-5-)-Metiltransferases/genética , Leucemia Mieloide Aguda/genética , Mutação , Adulto , Metilação de DNA , DNA Metiltransferase 3A , Análise Mutacional de DNA/métodos , Feminino , Mutação da Fase de Leitura , Expressão Gênica , Humanos , Cariotipagem , Leucemia Mieloide Aguda/mortalidade , Masculino , Pessoa de Meia-Idade , Técnicas de Amplificação de Ácido Nucleico , Prognóstico , Modelos de Riscos Proporcionais , Análise de SobrevidaRESUMO
Natural killer (NK) cells are innate lymphocytes important for early host defense against infectious pathogens and surveillance against malignant transformation. Resting murine NK cells regulate the translation of effector molecule mRNAs (e.g., granzyme B, GzmB) through unclear molecular mechanisms. MicroRNAs (miRNAs) are small noncoding RNAs that post-transcriptionally regulate the translation of their mRNA targets, and are therefore candidates for mediating this control process. While the expression and importance of miRNAs in T and B lymphocytes have been established, little is known about miRNAs in NK cells. Here, we used two next-generation sequencing (NGS) platforms to define the miRNA transcriptomes of resting and cytokine-activated primary murine NK cells, with confirmation by quantitative real-time PCR (qRT-PCR) and microarrays. We delineate a bioinformatics analysis pipeline that identified 302 known and 21 novel mature miRNAs from sequences obtained from NK cell small RNA libraries. These miRNAs are expressed over a broad range and exhibit isomiR complexity, and a subset is differentially expressed following cytokine activation. Using these miRNA NGS data, miR-223 was identified as a mature miRNA present in resting NK cells with decreased expression following cytokine activation. Furthermore, we demonstrate that miR-223 specifically targets the 3' untranslated region of murine GzmB in vitro, indicating that this miRNA may contribute to control of GzmB translation in resting NK cells. Thus, the sequenced NK cell miRNA transcriptome provides a valuable framework for further elucidation of miRNA expression and function in NK cell biology.
Assuntos
Perfilação da Expressão Gênica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Células Matadoras Naturais/metabolismo , MicroRNAs/genética , Animais , Sequência de Bases , Células Cultivadas , Biologia Computacional/instrumentação , Biologia Computacional/métodos , Regulação da Expressão Gênica/efeitos dos fármacos , Granzimas/genética , Sequenciamento de Nucleotídeos em Larga Escala/instrumentação , Interleucina-15/farmacologia , Ativação Linfocitária/efeitos dos fármacos , Ativação Linfocitária/genética , Camundongos , Camundongos Endogâmicos C57BL , MicroRNAs/isolamento & purificação , MicroRNAs/metabolismo , MicroRNAs/fisiologia , Dados de Sequência Molecular , Hibridização de Ácido Nucleico/métodos , Análise de Sequência de RNA/instrumentação , Análise de Sequência de RNA/métodos , Homologia de Sequência do Ácido NucleicoRESUMO
BACKGROUND: The full complement of DNA mutations that are responsible for the pathogenesis of acute myeloid leukemia (AML) is not yet known. METHODS: We used massively parallel DNA sequencing to obtain a very high level of coverage (approximately 98%) of a primary, cytogenetically normal, de novo genome for AML with minimal maturation (AML-M1) and a matched normal skin genome. RESULTS: We identified 12 acquired (somatic) mutations within the coding sequences of genes and 52 somatic point mutations in conserved or regulatory portions of the genome. All mutations appeared to be heterozygous and present in nearly all cells in the tumor sample. Four of the 64 mutations occurred in at least 1 additional AML sample in 188 samples that were tested. Mutations in NRAS and NPM1 had been identified previously in patients with AML, but two other mutations had not been identified. One of these mutations, in the IDH1 gene, was present in 15 of 187 additional AML genomes tested and was strongly associated with normal cytogenetic status; it was present in 13 of 80 cytogenetically normal samples (16%). The other was a nongenic mutation in a genomic region with regulatory potential and conservation in higher mammals; we detected it in one additional AML tumor. The AML genome that we sequenced contains approximately 750 point mutations, of which only a small fraction are likely to be relevant to pathogenesis. CONCLUSIONS: By comparing the sequences of tumor and skin genomes of a patient with AML-M1, we have identified recurring mutations that may be relevant for pathogenesis.
Assuntos
Isocitrato Desidrogenase/genética , Leucemia Mieloide Aguda/genética , Mutação , Adulto , Análise Mutacional de DNA , Feminino , Frequência do Gene , Genoma Humano , Humanos , Masculino , Pessoa de Meia-Idade , Nucleofosmina , Mutação Puntual , Análise de Sequência de DNA/métodosRESUMO
Massively parallel sequencing instruments enable rapid and inexpensive DNA sequence data production. Because these instruments are new, their data require characterization with respect to accuracy and utility. To address this, we sequenced a Caernohabditis elegans N2 Bristol strain isolate using the Solexa Sequence Analyzer, and compared the reads to the reference genome to characterize the data and to evaluate coverage and representation. Massively parallel sequencing facilitates strain-to-reference comparison for genome-wide sequence variant discovery. Owing to the short-read-length sequences produced, we developed a revised approach to determine the regions of the genome to which short reads could be uniquely mapped. We then aligned Solexa reads from C. elegans strain CB4858 to the reference, and screened for single-nucleotide polymorphisms (SNPs) and small indels. This study demonstrates the utility of massively parallel short read sequencing for whole genome resequencing and for accurate discovery of genome-wide polymorphisms.
Assuntos
Caenorhabditis elegans/genética , Mapeamento Cromossômico/métodos , Análise Mutacional de DNA/métodos , Variação Genética/genética , Polimorfismo de Nucleotídeo Único/genética , Análise de Sequência de DNA/métodos , Animais , Sequência de Bases , Dados de Sequência MolecularRESUMO
MOTIVATION: Targeted 'deep' sequencing of specific genes or regions is of great interest in clinical cancer diagnostics where some sequence variants, particularly translocations and indels, have known prognostic or diagnostic significance. In this setting, it is unnecessary to sequence an entire genome, and target capture methods can be applied to limit sequencing to important regions, thereby reducing costs and the time required to complete testing. Existing 'next-gen' sequencing analysis packages are optimized for efficiency in whole-genome studies and are unable to benefit from the particular structure of targeted sequence data. RESULTS: We developed SLOPE to detect structural variants from targeted short-DNA reads. We use both real and simulated data to demonstrate SLOPE's ability to rapidly detect insertion/deletion events of various sizes as well as translocations and viral integration sites with high sensitivity and low false discovery rate. AVAILABILITY: Binary code available at http://www-genepi.med.utah.edu/suppl/SLOPE/index.html
Assuntos
Análise de Sequência de DNA/métodos , Software , Genoma , Polimorfismo de Nucleotídeo Único , Translocação GenéticaRESUMO
The Oxytricha trifallax mitochondrial genome contains the largest sequenced ciliate mitochondrial chromosome (~70 kb) plus a ~5-kb linear plasmid bearing mitochondrial telomeres. We identify two new ciliate split genes (rps3 and nad2) as well as four new mitochondrial genes (ribosomal small subunit protein genes: rps- 2, 7, 8, 10), previously undetected in ciliates due to their extreme divergence. The increased size of the Oxytricha mitochondrial genome relative to other ciliates is primarily a consequence of terminal expansions, rather than the retention of ancestral mitochondrial genes. Successive segmental duplications, visible in one of the two Oxytricha mitochondrial subterminal regions, appear to have contributed to the genome expansion. Consistent with pseudogene formation and decay, the subtermini possess shorter, more loosely packed open reading frames than the remainder of the genome. The mitochondrial plasmid shares a 251-bp region with 82% identity to the mitochondrial chromosome, suggesting that it most likely integrated into the chromosome at least once. This region on the chromosome is also close to the end of the most terminal member of a series of duplications, hinting at a possible association between the plasmid and the duplications. The presence of mitochondrial telomeres on the mitochondrial plasmid suggests that such plasmids may be a vehicle for lateral transfer of telomeric sequences between mitochondrial genomes. We conjecture that the extreme divergence observed in ciliate mitochondrial genomes may be due, in part, to repeated invasions by relatively error-prone DNA polymerase-bearing mobile elements.
Assuntos
Genoma Mitocondrial/genética , Genoma de Protozoário/genética , Oxytricha/genética , Sequência de Aminoácidos , Sequência de Bases , Mapeamento Cromossômico , Cromossomos/genética , DNA Mitocondrial/genética , RNA Polimerases Dirigidas por DNA/genética , Genes de Protozoários/genética , Código Genético/genética , Macronúcleo/enzimologia , Macronúcleo/genética , Dados de Sequência Molecular , Fases de Leitura Aberta/genética , Plasmídeos/genética , Sinais Direcionadores de Proteínas/genética , Proteínas de Protozoários/química , Proteínas de Protozoários/genética , RNA Ribossômico/genética , RNA de Transferência/genética , Duplicações Segmentares Genômicas/genética , Sintenia/genéticaRESUMO
Exome sequencing of 343 families, each with a single child on the autism spectrum and at least one unaffected sibling, reveal de novo small indels and point substitutions, which come mostly from the paternal line in an age-dependent manner. We do not see significantly greater numbers of de novo missense mutations in affected versus unaffected children, but gene-disrupting mutations (nonsense, splice site, and frame shifts) are twice as frequent, 59 to 28. Based on this differential and the number of recurrent and total targets of gene disruption found in our and similar studies, we estimate between 350 and 400 autism susceptibility genes. Many of the disrupted genes in these studies are associated with the fragile X protein, FMRP, reinforcing links between autism and synaptic plasticity. We find FMRP-associated genes are under greater purifying selection than the remainder of genes and suggest they are especially dosage-sensitive targets of cognitive disorders.