RESUMO
The development of targeted anti-cancer therapies through the study of cancer genomes is intended to increase survival rates and decrease treatment-related toxicity. We treated a transposon-driven, functional genomic mouse model of medulloblastoma with 'humanized' in vivo therapy (microneurosurgical tumour resection followed by multi-fractionated, image-guided radiotherapy). Genetic events in recurrent murine medulloblastoma exhibit a very poor overlap with those in matched murine diagnostic samples (<5%). Whole-genome sequencing of 33 pairs of human diagnostic and post-therapy medulloblastomas demonstrated substantial genetic divergence of the dominant clone after therapy (<12% diagnostic events were retained at recurrence). In both mice and humans, the dominant clone at recurrence arose through clonal selection of a pre-existing minor clone present at diagnosis. Targeted therapy is unlikely to be effective in the absence of the target, therefore our results offer a simple, proximal, and remediable explanation for the failure of prior clinical trials of targeted therapy.
Assuntos
Neoplasias Cerebelares/terapia , Células Clonais/efeitos dos fármacos , Células Clonais/metabolismo , Meduloblastoma/terapia , Recidiva Local de Neoplasia/genética , Recidiva Local de Neoplasia/patologia , Seleção Genética/efeitos dos fármacos , Animais , Neoplasias Cerebelares/genética , Neoplasias Cerebelares/patologia , Neoplasias Cerebelares/radioterapia , Neoplasias Cerebelares/cirurgia , Células Clonais/patologia , Radiação Cranioespinal , Análise Mutacional de DNA , Modelos Animais de Doenças , Drosophila melanogaster/citologia , Drosophila melanogaster/genética , Feminino , Genoma Humano/genética , Humanos , Masculino , Meduloblastoma/genética , Meduloblastoma/patologia , Meduloblastoma/radioterapia , Meduloblastoma/cirurgia , Camundongos , Terapia de Alvo Molecular/métodos , Recidiva Local de Neoplasia/terapia , Radioterapia Guiada por Imagem , Transdução de Sinais , Ensaios Antitumorais Modelo de XenoenxertoRESUMO
Follicular lymphoma (FL) and diffuse large B-cell lymphoma (DLBCL) are the two most common non-Hodgkin lymphomas (NHLs). Here we sequenced tumour and matched normal DNA from 13 DLBCL cases and one FL case to identify genes with mutations in B-cell NHL. We analysed RNA-seq data from these and another 113 NHLs to identify genes with candidate mutations, and then re-sequenced tumour and matched normal DNA from these cases to confirm 109 genes with multiple somatic mutations. Genes with roles in histone modification were frequent targets of somatic mutation. For example, 32% of DLBCL and 89% of FL cases had somatic mutations in MLL2, which encodes a histone methyltransferase, and 11.4% and 13.4% of DLBCL and FL cases, respectively, had mutations in MEF2B, a calcium-regulated gene that cooperates with CREBBP and EP300 in acetylating histones. Our analysis suggests a previously unappreciated disruption of chromatin biology in lymphomagenesis.
Assuntos
Histonas/metabolismo , Linfoma não Hodgkin/genética , Mutação/genética , Cromatina/genética , Cromatina/metabolismo , Proteínas de Ligação a DNA/genética , Proteínas de Ligação a DNA/metabolismo , Genoma Humano/genética , Histona Acetiltransferases/genética , Histona Acetiltransferases/metabolismo , Histona Metiltransferases , Histona-Lisina N-Metiltransferase/genética , Histona-Lisina N-Metiltransferase/metabolismo , Humanos , Perda de Heterozigosidade/genética , Linfoma Folicular/enzimologia , Linfoma Folicular/genética , Linfoma Difuso de Grandes Células B/enzimologia , Linfoma Difuso de Grandes Células B/genética , Linfoma não Hodgkin/enzimologia , Proteínas de Domínio MADS/genética , Proteínas de Domínio MADS/metabolismo , Fatores de Transcrição MEF2 , Fatores de Regulação Miogênica/genética , Fatores de Regulação Miogênica/metabolismo , Proteínas de Neoplasias/genética , Proteínas de Neoplasias/metabolismoRESUMO
BACKGROUND: Many mutations that contribute to the pathogenesis of acute myeloid leukemia (AML) are undefined. The relationships between patterns of mutations and epigenetic phenotypes are not yet clear. METHODS: We analyzed the genomes of 200 clinically annotated adult cases of de novo AML, using either whole-genome sequencing (50 cases) or whole-exome sequencing (150 cases), along with RNA and microRNA sequencing and DNA-methylation analysis. RESULTS: AML genomes have fewer mutations than most other adult cancers, with an average of only 13 mutations found in genes. Of these, an average of 5 are in genes that are recurrently mutated in AML. A total of 23 genes were significantly mutated, and another 237 were mutated in two or more samples. Nearly all samples had at least 1 nonsynonymous mutation in one of nine categories of genes that are almost certainly relevant for pathogenesis, including transcription-factor fusions (18% of cases), the gene encoding nucleophosmin (NPM1) (27%), tumor-suppressor genes (16%), DNA-methylation-related genes (44%), signaling genes (59%), chromatin-modifying genes (30%), myeloid transcription-factor genes (22%), cohesin-complex genes (13%), and spliceosome-complex genes (14%). Patterns of cooperation and mutual exclusivity suggested strong biologic relationships among several of the genes and categories. CONCLUSIONS: We identified at least one potential driver mutation in nearly all AML samples and found that a complex interplay of genetic events contributes to AML pathogenesis in individual patients. The databases from this study are widely available to serve as a foundation for further investigations of AML pathogenesis, classification, and risk stratification. (Funded by the National Institutes of Health.).
Assuntos
Leucemia Mieloide Aguda/genética , Mutação , Adulto , Ilhas de CpG , Metilação de DNA , Epigenômica , Feminino , Expressão Gênica , Fusão Gênica , Genoma Humano , Humanos , Leucemia Mieloide Aguda/classificação , Masculino , MicroRNAs/genética , Pessoa de Meia-Idade , Nucleofosmina , Análise de Sequência de DNA/métodosRESUMO
Large datasets can be screened for sequences from a specific organism, quickly and with low memory requirements, by a data structure that supports time- and memory-efficient set membership queries. Bloom filters offer such queries but require that false positives be controlled. We present BioBloom Tools, a Bloom filter-based sequence-screening tool that is faster than BWA, Bowtie 2 (popular alignment algorithms) and FACS (a membership query algorithm). It delivers accuracies comparable with these tools, controls false positives and has low memory requirements. Availability and implementaion: www.bcgsc.ca/platform/bioinfo/software/biobloomtools.
Assuntos
Análise de Sequência de DNA/métodos , Software , Algoritmos , Animais , Humanos , CamundongosRESUMO
BACKGROUND: Chimeric transcripts, including partial and internal tandem duplications (PTDs, ITDs) and gene fusions, are important in the detection, prognosis, and treatment of human cancers. RESULTS: We describe Barnacle, a production-grade analysis tool that detects such chimeras in de novo assemblies of RNA-seq data, and supports prioritizing them for review and validation by reporting the relative coverage of co-occurring chimeric and wild-type transcripts. We demonstrate applications in large-scale disease studies, by identifying PTDs in MLL, ITDs in FLT3, and reciprocal fusions between PML and RARA, in two deeply sequenced acute myeloid leukemia (AML) RNA-seq datasets. CONCLUSIONS: Our analyses of real and simulated data sets show that, with appropriate filter settings, Barnacle makes highly specific predictions for three types of chimeric transcripts that are important in a range of cancers: PTDs, ITDs, and fusions. High specificity makes manual review and validation efficient, which is necessary in large-scale disease studies. Characterizing an extended range of chimera types will help generate insights into progression, treatment, and outcomes for complex diseases.
Assuntos
Duplicação Gênica/genética , Perfilação da Expressão Gênica/métodos , Fusão Gênica/genética , Genômica , Neoplasias da Mama/genética , Éxons/genética , Humanos , Leucemia Mieloide Aguda/genética , Anotação de Sequência Molecular , RNA Mensageiro/genética , Estatística como AssuntoRESUMO
We describe Trans-ABySS, a de novo short-read transcriptome assembly and analysis pipeline that addresses variation in local read densities by assembling read substrings with varying stringencies and then merging the resulting contigs before analysis. Analyzing 7.4 gigabases of 50-base-pair paired-end Illumina reads from an adult mouse liver poly(A) RNA library, we identified known, new and alternative structures in expressed transcripts, and achieved high sensitivity and specificity relative to reference-based assembly methods.
Assuntos
Biologia Computacional/métodos , Perfilação da Expressão Gênica , Análise de Sequência de DNA/métodos , Animais , CamundongosRESUMO
Next-generation sequencing is making sequence-based molecular pathology and personalized oncology viable. We selected an individual initially diagnosed with conventional but aggressive prostate adenocarcinoma and sequenced the genome and transcriptome from primary and metastatic tissues collected prior to hormone therapy. The histology-pathology and copy number profiles were remarkably homogeneous, yet it was possible to propose the quadrant of the prostate tumour that likely seeded the metastatic diaspora. Despite a homogeneous cell type, our transcriptome analysis revealed signatures of both luminal and neuroendocrine cell types. Remarkably, the repertoire of expressed but apparently private gene fusions, including C15orf21:MYC, recapitulated this biology. We hypothesize that the amplification and over-expression of the stem cell gene MSI2 may have contributed to the stable hybrid cellular identity. This hybrid luminal-neuroendocrine tumour appears to represent a novel and highly aggressive case of prostate cancer with unique biological features and, conceivably, a propensity for rapid progression to castrate-resistance. Overall, this work highlights the importance of integrated analyses of genome, exome and transcriptome sequences for basic tumour biology, sequence-based molecular pathology and personalized oncology.
Assuntos
Adenocarcinoma/genética , Regulação Neoplásica da Expressão Gênica , Genômica , Neoplasias da Próstata/genética , Adenocarcinoma/secundário , Adenocarcinoma/terapia , Terapia Combinada , DNA de Neoplasias/análise , Amplificação de Genes , Dosagem de Genes , Perfilação da Expressão Gênica , Fusão Gênica , Humanos , Masculino , Pessoa de Meia-Idade , Células Neuroendócrinas/metabolismo , Células Neuroendócrinas/patologia , Prognóstico , Prostatectomia , Neoplasias da Próstata/patologia , Neoplasias da Próstata/terapia , Proteínas de Ligação a RNA/genética , Proteínas de Ligação a RNA/metabolismo , Análise de Sequência de DNA , Análise de Sequência de RNARESUMO
Oligodendroglioma is characterized by unique clinical, pathological, and genetic features. Recurrent losses of chromosomes 1p and 19q are strongly associated with this brain cancer but knowledge of the identity and function of the genes affected by these alterations is limited. We performed exome sequencing on a discovery set of 16 oligodendrogliomas with 1p/19q co-deletion to identify new molecular features at base-pair resolution. As anticipated, there was a high rate of IDH mutations: all cases had mutations in either IDH1 (14/16) or IDH2 (2/16). In addition, we discovered somatic mutations and insertions/deletions in the CIC gene on chromosome 19q13.2 in 13/16 tumours. These discovery set mutations were validated by deep sequencing of 13 additional tumours, which revealed seven others with CIC mutations, thus bringing the overall mutation rate in oligodendrogliomas in this study to 20/29 (69%). In contrast, deep sequencing of astrocytomas and oligoastrocytomas without 1p/19q loss revealed that CIC alterations were otherwise rare (1/60; 2%). Of the 21 non-synonymous somatic mutations in 20 CIC-mutant oligodendrogliomas, nine were in exon 5 within an annotated DNA-interacting domain and three were in exon 20 within an annotated protein-interacting domain. The remaining nine were found in other exons and frequently included truncations. CIC mutations were highly associated with oligodendroglioma histology, 1p/19q co-deletion, and IDH1/2 mutation (p < 0.001). Although we observed no differences in the clinical outcomes of CIC mutant versus wild-type tumours, in a background of 1p/19q co-deletion, hemizygous CIC mutations are likely important. We hypothesize that the mutant CIC on the single retained 19q allele is linked to the pathogenesis of oligodendrogliomas with IDH mutation. Our detailed study of genetic aberrations in oligodendroglioma suggests a functional interaction between CIC mutation, IDH1/2 mutation, and 1p/19q co-deletion.
Assuntos
Biomarcadores Tumorais/genética , Neoplasias Encefálicas/genética , Isocitrato Desidrogenase/genética , Oligodendroglioma/genética , Proteínas Repressoras/genética , Biomarcadores Tumorais/análise , Neoplasias Encefálicas/mortalidade , Neoplasias Encefálicas/patologia , Cromossomos Humanos Par 1/genética , Cromossomos Humanos Par 19/genética , Intervalo Livre de Doença , Humanos , Estimativa de Kaplan-Meier , Mutação , Gradação de Tumores , Oligodendroglioma/mortalidade , Oligodendroglioma/patologiaRESUMO
BACKGROUND: The original sequencing and annotation of the Caenorhabditis elegans genome along with recent advances in sequencing technology provide an exceptional opportunity for the genomic analysis of wild-type and mutant strains. Using the Illumina Genome Analyzer, we sequenced the entire genome of Rec-1, a strain that alters the distribution of meiotic crossovers without changing the overall frequency. Rec-1 was derived from ethylmethane sulfonate (EMS)-treated strains, one of which had a high level of transposable element mobility. Sequencing of this strain provides an opportunity to examine the consequences on the genome of altering the distribution of meiotic recombination events. RESULTS: Using Illumina sequencing and MAQ software, 83% of the base pair sequence reads were aligned to the reference genome available at Wormbase, providing a 21-fold coverage of the genome. Using the software programs MAQ and Slider, we observed 1124 base pair differences between Rec-1 and the reference genome in Wormbase (WS190), and 441 between the mutagenized Rec-1 (BC313) and the wild-type N2 strain (VC2010). The most frequent base-substitution was G:C to A:T, 141 for the entire genome most of which were on chromosomes I or X, 55 and 31 respectively. With this data removed, no obvious pattern in the distribution of the base differences along the chromosomes was apparent. No major chromosomal rearrangements were observed, but additional insertions of transposable elements were detected. There are 11 extra copies of Tc1, and 8 of Tc2 in the Rec-1 genome, most likely the remains of past high-hopper activity in a progenitor strain. CONCLUSION: Our analysis of high-throughput sequencing was able to detect regions of direct repeat sequences, deletions, insertions of transposable elements, and base pair differences. A subset of sequence alterations affecting coding regions were confirmed by an independent approach using oligo array comparative genome hybridization. The major phenotype of the Rec-1 strain is an alteration in the preferred position of the meiotic recombination event with no other significant phenotypic consequences. In this study, we observed no evidence of a mutator effect at the nucleotide level attributable to the Rec-1 mutation.
Assuntos
Caenorhabditis elegans/genética , Genoma Helmíntico , Recombinação Genética , Animais , Sequência de Bases , Hibridização Genômica Comparativa , Elementos de DNA Transponíveis , DNA de Helmintos/genética , Meiose , Dados de Sequência Molecular , Mutagênese Insercional , Sequências Repetitivas de Ácido Nucleico , Análise de Sequência de DNA , SoftwareRESUMO
MOTIVATION: A plethora of alignment tools have been created that are designed to best fit different types of alignment conditions. While some of these are made for aligning Illumina Sequence Analyzer reads, none of these are fully utilizing its probability (prb) output. In this article, we will introduce a new alignment approach (Slider) that reduces the alignment problem space by utilizing each read base's probabilities given in the prb files. RESULTS: Compared with other aligners, Slider has higher alignment accuracy and efficiency. In addition, given that Slider matches bases with probabilities other than the most probable, it significantly reduces the percentage of base mismatches. The result is that its SNP predictions are more accurate than other SNP prediction approaches used today that start from the most probable sequence, including those using base quality.
Assuntos
Algoritmos , Polimorfismo de Nucleotídeo Único/genética , Probabilidade , Alinhamento de Sequência/métodos , Pareamento Incorreto de Bases , Sequência de Bases , Biologia Computacional , Bases de Dados de Ácidos Nucleicos , Humanos , Fatores de TempoRESUMO
Sample tracking and identity are essential when processing multiple samples in parallel. Sequencing applications often involve high sample numbers, and the data are frequently used in a clinical setting. As such, a simple and accurate intrinsic sample tracking process through a sequencing pipeline is essential. Various solutions have been implemented to verify sample identity, including variant detection at the start and end of the pipeline using arrays or genotyping, bioinformatic comparisons, and optical barcoding of samples. None of these approaches are optimal. To establish a more effective approach using genetic barcoding, we developed a panel of unique DNA sequences cloned into a common vector. A unique DNA sequence is added to the sample when it is first received and can be detected by PCR and/or sequencing at any stage of the process. The control sequences are approximately 200 bases long with low identity to any sequence in the National Center for Biotechnology Information nonredundant database (<30 bases) and contain no long homopolymer (>7) stretches. When a spiked next-generation sequencing library is sequenced, sequence reads derived from this control sequence are generated along with the standard sequencing run and are used to confirm sample identity and determine cross-contamination levels. This approach is used in our targeted clinical diagnostic whole-genome and RNA-sequencing pipelines and is an inexpensive, flexible, and platform-agnostic solution.
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento de Nucleotídeos em Larga Escala/normas , Biologia Computacional , Contaminação por DNA , Bases de Dados de Ácidos Nucleicos , Biblioteca Gênica , Humanos , Padrões de Referência , Reprodutibilidade dos Testes , Análise de Sequência de DNARESUMO
Temozolomide (TMZ) is a chemotherapy agent used to treat Grade IV astrocytoma, also known as glioblastoma (GBM). TMZ treatment causes DNA damage that results in tumor cell apoptosis and increases the survival rate of GBM patients. However, chemoresistance as a result of TMZ-induced autophagy significantly reduces this anticancer effects over time. Statins are competitive inhibitors of HMG-CoA reductase, the rate-limiting enzyme of the mevalonate (MEV) cascade. Statins are best known for their cholesterol (CH)-lowering effect. Long-term consumption of statins, prior to and in parallel with other cancer therapeutic approaches, has been reported to increase the survival rate of patients with various forms of cancers. In this study, we investigated the potentiation of TMZ-induced apoptosis by simvastatin (Simva) in human GBM cell lines and patient GBM cells, using cell monolayers and three-dimensional cell culture systems. The incubation of cells with a combination of Simva and TMZ resulted in a significant increase in apoptotic cells compared to cells treated with TMZ alone. Incubation of cells with CH or MEV cascade intermediates failed to compensate the decrease in cell viability induced by the combined Simva and TMZ treatment. Simva treatment inhibited the autophagy flux induced by TMZ by blocking autophago-lysosome formation. Our results suggest that Simva sensitizes GBM cells to TMZ-induced cell death in a MEV cascade-independent manner and identifies the inhibition of autophagosome-lysosome fusion as a promising therapeutic strategy in the treatment of GBM.
Assuntos
Autofagossomos/efeitos dos fármacos , Autofagossomos/metabolismo , Morte Celular/efeitos dos fármacos , Lisossomos/efeitos dos fármacos , Lisossomos/metabolismo , Sinvastatina/farmacologia , Temozolomida/farmacologia , Animais , Linhagem Celular Tumoral , Feminino , Glioblastoma/metabolismo , Humanos , Macrolídeos/farmacologia , Camundongos , Ensaios Antitumorais Modelo de XenoenxertoRESUMO
OBJECTIVES: To identify patient-generated priority topics for future primary care research in British Columbia (BC), Canada within a diverse patient population. DESIGN: Mixed-methods priority setting exercises framed by the dialogue model, using the nominal group technique (rank-ordered scoring) and province-wide online surveys capturing importance ratings of the top 10 primary healthcare topics from patients and primary care providers. SETTING: BC, Canada. PARTICIPANTS: Topic identification was completed by 10 patient partners (7 female, 3 male) from the BC Primary Health Care Research Network Patient Advisory; online surveys were completed by 464 patients and 173 primary care providers. RESULTS: The 10 members recruited to the patient advisory provided over 80 experiences of what stood out for them in BC primary care, which were grouped thematically into 18 topics, 10 of which were retained in province-wide surveys. Top-rated survey topics for both patients (n=464) and providers (n=173) included being unable to find a regular family doctor/other primary healthcare provider, support for living with chronic conditions, mental health resources and information sharing, including electronic medical records. However, all 10 topics were rated important, on average, by both groups. CONCLUSIONS: The current project activities demonstrate the feasibility of including patients in priority setting exercises for primary healthcare in general, rather than focusing on a condition-specific population or disease area. There was considerable overlap between patient-generated topics and topics previously identified by other stakeholders, but patients identified two additional topics (mental health resources, improve and strengthen patient-provider communication). More similarities than differences in topic importance between patients and providers emerged in the online surveys. The project activities that follow (rapid literature reviews, multistakeholder dialogue) will highlight under-researched topics and inform the development of specific research questions.
Assuntos
Prioridades em Saúde , Pesquisa sobre Serviços de Saúde , Participação do Paciente/estatística & dados numéricos , Satisfação do Paciente/estatística & dados numéricos , Atenção Primária à Saúde , Adulto , Comitês Consultivos , Idoso , Colúmbia Britânica/epidemiologia , Educação , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Pesquisa QualitativaRESUMO
In parallel with the production of genomic sequence data, attention is being focused on the generation of comprehensive cDNA-sequence resources. Such efforts are increasingly emphasizing the production of high-accuracy sequence corresponding to the entire insert of cDNA clones, especially those presumed to reflect the full-length mRNA. The complete sequencing of cDNA clones on a large scale presents unique challenges because of the generally small, yet heterogeneous, sizes of the cloned inserts. We have developed a strategy for high-throughput sequencing of cDNA clones using the transposon Tn5. This approach has been tailored for implementation within an existing large-scale 'shotgun-style' sequencing program, although it could be readily adapted for use in virtually any sequencing environment. In addition, we have developed a modified version of our strategy that can be applied to cDNA clones with large cloning vectors, thereby overcoming a potential limitation of transposon-based approaches. Here we describe the details of our cDNA-sequencing pipeline, including a summary of the experience in sequencing more than 4200 cDNA clones to produce more than 8 million base pairs of high-accuracy cDNA sequence. These data provide both convincing evidence that the insertion of Tn5 into cDNA clones is sufficiently random for its effective use in large-scale cDNA sequencing as well as interesting insight about the sequence context preferred for insertion by Tn5.
Assuntos
Elementos de DNA Transponíveis/genética , DNA Complementar/genética , Análise de Sequência de DNA/métodos , Composição de Bases , Distribuição Binomial , Clonagem Molecular , Vetores Genéticos/genética , Mutagênese Insercional/genética , Mapeamento Físico do Cromossomo/métodos , Recombinação Genética/genética , Sensibilidade e EspecificidadeRESUMO
We describe an efficient high-throughput method for accurate DNA sequencing of entire cDNA clones. Developed as part of our involvement in the Mammalian Gene Collection full-length cDNA sequencing initiative, the method has been used and refined in our laboratory since September 2000. Amenable to large scale projects, we have used the method to generate >7 Mb of accurate sequence from 3695 candidate full-length cDNAs. Sequencing is accomplished through the insertion of Mu transposon into cDNAs, followed by sequencing reactions primed with Mu-specific sequencing primers. Transposon insertion reactions are not performed with individual cDNAs but rather on pools of up to 96 clones. This pooling strategy reduces the number of transposon insertion sequencing libraries that would otherwise be required, reducing the costs and enhancing the efficiency of the transposon library construction procedure. Sequences generated using transposon-specific sequencing primers are assembled to yield the full-length cDNA sequence, with sequence editing and other sequence finishing activities performed as required to resolve sequence ambiguities. Although analysis of the many thousands (22 785) of sequenced Mu transposon insertion events revealed a weak sequence preference for Mu insertion, we observed insertion of the Mu transposon into 1015 of the possible 1024 5mer candidate insertion sites.
Assuntos
Bacteriófago mu/genética , Elementos de DNA Transponíveis/genética , DNA Complementar/genética , Mutagênese Insercional/genética , Recombinação Genética/genética , Análise de Sequência de DNA/métodos , Composição de Bases , Clonagem Molecular , Primers do DNA/genética , Biblioteca Gênica , Vetores Genéticos/genética , Método de Monte Carlo , Mapeamento Físico do Cromossomo/métodos , Sensibilidade e Especificidade , Análise de Sequência de DNA/economia , Especificidade por Substrato , Fatores de TempoRESUMO
BACKGROUND: The sequencing and analysis of ESTs is for now the only practical approach for large-scale gene discovery and annotation in conifers because their very large genomes are unlikely to be sequenced in the near future. Our objective was to produce extensive collections of ESTs and cDNA clones to support manufacture of cDNA microarrays and gene discovery in white spruce (Picea glauca [Moench] Voss). RESULTS: We produced 16 cDNA libraries from different tissues and a variety of treatments, and partially sequenced 50,000 cDNA clones. High quality 3' and 5' reads were assembled into 16,578 consensus sequences, 45% of which represented full length inserts. Consensus sequences derived from 5' and 3' reads of the same cDNA clone were linked to define 14,471 transcripts. A large proportion (84%) of the spruce sequences matched a pine sequence, but only 68% of the spruce transcripts had homologs in Arabidopsis or rice. Nearly all the sequences that matched the Populus trichocarpa genome (the only sequenced tree genome) also matched rice or Arabidopsis genomes. We used several sequence similarity search approaches for assignment of putative functions, including blast searches against general and specialized databases (transcription factors, cell wall related proteins), Gene Ontology term assignation and Hidden Markov Model searches against PFAM protein families and domains. In total, 70% of the spruce transcripts displayed matches to proteins of known or unknown function in the Uniref100 database (blastx e-value < 1e-10). We identified multigenic families that appeared larger in spruce than in the Arabidopsis or rice genomes. Detailed analysis of translationally controlled tumour proteins and S-adenosylmethionine synthetase families confirmed a twofold size difference. Sequences and annotations were organized in a dedicated database, SpruceDB. Several search tools were developed to mine the data either based on their occurrence in the cDNA libraries or on functional annotations. CONCLUSION: This report illustrates specific approaches for large-scale gene discovery and annotation in an organism that is very distantly related to any of the fully sequenced genomes. The ArboreaSet sequences and cDNA clones represent a valuable resource for investigations ranging from plant comparative genomics to applied conifer genetics.
Assuntos
Etiquetas de Sequências Expressas , Regulação da Expressão Gênica de Plantas , Genes de Plantas , Picea/genética , Arabidopsis/genética , Parede Celular/metabolismo , Análise por Conglomerados , Mapeamento de Sequências Contíguas , Citoesqueleto/metabolismo , DNA Complementar/metabolismo , Bases de Dados como Assunto , Bases de Dados Genéticas , Biblioteca Gênica , Genoma de Planta , Genômica , Família Multigênica , Oryza/genética , RNA Mensageiro/metabolismo , SoftwareRESUMO
We have designed and implemented a system to manage whole genome shotgun sequences and whole genome sequence assembly data flow. The Sequence Assembly Manager (SAM) consists primarily of a MySQL relational database and Perl applications designed to easily manipulate and coordinate the analysis of sequence information and to view and report genome assembly progress through its Common Gateway Interface (CGI) web interface. The application includes a tool to compare sequence assemblies to fingerprint maps that has been used successfully to improve and validate both maps and sequence assemblies of the Rhodococcus sp.RHAI and Cryptococcus neoformans WM276 genomes.
Assuntos
Algoritmos , Mapeamento Cromossômico/métodos , Impressões Digitais de DNA/métodos , Bases de Dados de Ácidos Nucleicos , Análise de Sequência de DNA/métodos , Software , Interface Usuário-ComputadorRESUMO
JAGuaR is an alignment protocol for RNA-seq reads that uses an extended reference to increase alignment sensitivity. It uses BWA to align reads to the genome and reference transcript models (including annotated exon-exon junctions) specifically allowing for the possibility of a single read spanning multiple exons. Reads aligned to the transcript models are then re-mapped on to genomic coordinates, transforming alignments that span multiple exons into large-gapped alignments on the genome. While JAGuaR does not detect novel junctions, we demonstrate how JAGuaR generates fast and accurate transcriptome alignments, which allows for both sensitive and specific SNV calling.
Assuntos
RNA/genética , Alinhamento de Sequência/métodos , Análise de Sequência de RNA/métodos , Software , Algoritmos , Animais , Sequência de Bases , Éxons , Perfilação da Expressão Gênica/métodos , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Splicing de RNA/genéticaRESUMO
Diffuse intrinsic pontine gliomas (DIPGs) are highly infiltrative malignant glial neoplasms of the ventral pons that, due to their location within the brain, are unsuitable for surgical resection and consequently have a universally dismal clinical outcome. The median survival time is 9-12 months, with neither chemotherapeutic nor targeted agents showing substantial survival benefit in clinical trials in children with these tumors. We report the identification of recurrent activating mutations in the ACVR1 gene, which encodes a type I activin receptor serine/threonine kinase, in 21% of DIPG samples. Strikingly, these somatic mutations (encoding p.Arg206His, p.Arg258Gly, p.Gly328Glu, p.Gly328Val, p.Gly328Trp and p.Gly356Asp substitutions) have not been reported previously in cancer but are identical to mutations found in the germ line of individuals with the congenital childhood developmental disorder fibrodysplasia ossificans progressiva (FOP) and have been shown to constitutively activate the BMP-TGF-ß signaling pathway. These mutations represent new targets for therapeutic intervention in this otherwise incurable disease.
Assuntos
Receptores de Ativinas Tipo I/genética , Neoplasias do Tronco Encefálico/genética , Regulação Neoplásica da Expressão Gênica/genética , Genoma Humano/genética , Glioma/genética , Mutação de Sentido Incorreto/genética , Sequência de Bases , Neoplasias do Tronco Encefálico/classificação , Criança , Estudos de Coortes , Exoma/genética , Glioma/classificação , Humanos , Dados de Sequência Molecular , Miosite Ossificante/genética , Análise de Sequência de DNA , Transdução de Sinais/genéticaRESUMO
During meiosis, evolutionarily conserved mechanisms regulate chromosome remodeling, leading to the formation of a tight bivalent structure. This bivalent, a linked pair of homologous chromosomes, is essential for proper chromosome segregation in meiosis. The formation of a tight bivalent involves chromosome condensation and restructuring around the crossover. The synaptonemal complex (SC), which mediates homologous chromosome association before crossover formation, disassembles concurrently with increased condensation during bivalent remodeling. Both chromosome condensation and SC disassembly are likely critical steps in acquiring functional bivalent structure. The mechanisms controlling SC disassembly, however, remain unclear. Here we identify akir-1 as a gene involved in key events of meiotic prophase I in Caenorhabditis elegans. AKIR-1 is a protein conserved among metazoans that lacks any previously known function in meiosis. We show that akir-1 mutants exhibit severe meiotic defects in late prophase I, including improper disassembly of the SC and aberrant chromosome condensation, independently of the condensin complexes. These late-prophase defects then lead to aberrant reconfiguring of the bivalent. The meiotic divisions are delayed in akir-1 mutants and are accompanied by lagging chromosomes. Our analysis therefore provides evidence for an important role of proper SC disassembly in configuring a functional bivalent structure.