Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Genome Res ; 21(5): 756-67, 2011 May.
Artigo em Inglês | MEDLINE | ID: mdl-21460061

RESUMO

Recent advances in proteomic mass spectrometry (MS) offer the chance to marry high-throughput peptide sequencing to transcript models, allowing the validation, refinement, and identification of new protein-coding loci. We present a novel pipeline that integrates highly sensitive and statistically robust peptide spectrum matching with genome-wide protein-coding predictions to perform large-scale gene validation and discovery in the mouse genome for the first time. In searching an excess of 10 million spectra, we have been able to validate 32%, 17%, and 7% of all protein-coding genes, exons, and splice boundaries, respectively. Moreover, we present strong evidence for the identification of multiple alternatively spliced translations from 53 genes and have uncovered 10 entirely novel protein-coding genes, which are not covered in any mouse annotation data sources. One such novel protein-coding gene is a fusion protein that spans the Ins2 and Igf2 loci to produce a transcript encoding the insulin II and the insulin-like growth factor 2-derived peptides. We also report nine processed pseudogenes that have unique peptide hits, demonstrating, for the first time, that they are not just transcribed but are translated and are therefore resurrected into new coding loci. This work not only highlights an important utility for MS data in genome annotation but also provides unique insights into the gene structure and propagation in the mouse genome. All these data have been subsequently used to improve the publicly available mouse annotation available in both the Vega and Ensembl genome browsers (http://vega.sanger.ac.uk).


Assuntos
Processamento Alternativo , Genes , Peptídeos/genética , Proteômica/métodos , Pseudogenes/genética , Espectrometria de Massas em Tandem/métodos , Animais , Genoma , Genômica/métodos , Camundongos , Peptídeos/química
2.
mSystems ; 9(7): e0050524, 2024 Jul 23.
Artigo em Inglês | MEDLINE | ID: mdl-38953320

RESUMO

Nanopore direct RNA sequencing (DRS) enables the capture and full-length sequencing of native RNAs, without recoding or amplification bias. Resulting data sets may be interrogated to define the identity and location of chemically modified ribonucleotides, as well as the length of poly(A) tails, on individual RNA molecules. The success of these analyses is highly dependent on the provision of high-resolution transcriptome annotations in combination with workflows that minimize misalignments and other analysis artifacts. Existing software solutions for generating high-resolution transcriptome annotations are poorly suited to small gene-dense genomes of viruses due to the challenge of identifying distinct transcript isoforms where alternative splicing and overlapping RNAs are prevalent. To resolve this, we identified key characteristics of DRS data sets that inform resulting read alignments and developed the nanopore guided annotation of transcriptome architectures (NAGATA) software package (https://github.com/DepledgeLab/NAGATA). We demonstrate, using a combination of synthetic and original DRS data sets derived from adenoviruses, herpesviruses, coronaviruses, and human cells, that NAGATA outperforms existing transcriptome annotation software and yields a consistently high level of precision and recall when reconstructing both gene sparse and gene-dense transcriptomes. Finally, we apply NAGATA to generate the first high-resolution transcriptome annotation of the neglected pathogen human adenovirus type F41 (HAdV-41) for which we identify 77 distinct transcripts encoding at least 23 different proteins. IMPORTANCE: The transcriptome of an organism denotes the full repertoire of encoded RNAs that may be expressed. This is critical to understanding the biology of an organism and for accurate transcriptomic and epitranscriptomic-based analyses. Annotating transcriptomes remains a complex task, particularly in small gene-dense organisms such as viruses which maximize their coding capacity through overlapping RNAs. To resolve this, we have developed a new software nanopore guided annotation of transcriptome architectures (NAGATA) which utilizes nanopore direct RNA sequencing (DRS) datasets to rapidly produce high-resolution transcriptome annotations for diverse viruses and other organisms.


Assuntos
Anotação de Sequência Molecular , Software , Transcriptoma , Humanos , Transcriptoma/genética , Anotação de Sequência Molecular/métodos , Análise de Sequência de RNA/métodos , Herpesviridae/genética , Coronavirus/genética , Sequenciamento por Nanoporos/métodos , Nanoporos , Adenoviridae/genética
3.
bioRxiv ; 2024 Apr 03.
Artigo em Inglês | MEDLINE | ID: mdl-38617228

RESUMO

High-resolution annotations of transcriptomes from all domains of life are essential for many sequencing-based RNA analyses, including Nanopore direct RNA sequencing (DRS), which would otherwise be hindered by misalignments and other analysis artefacts. DRS allows the capture and full-length sequencing of native RNAs, without recoding or amplification bias, and resulting data may be interrogated to define the identity and location of chemically modified ribonucleotides, as well as the length of poly(A) tails on individual RNA molecules. Existing software solutions for generating high-resolution transcriptome annotations are poorly suited to small gene dense organisms such as viruses due to the challenge of identifying distinct transcript isoforms where alternative splicing and overlapping RNAs are prevalent. To resolve this, we identified key characteristics of DRS datasets and developed a novel approach to transcriptome. We demonstrate, using a combination of synthetic and original datasets, that our novel approach yields a high level of precision and recall when reconstructing both gene sparse and gene dense transcriptomes from DRS datasets. We further apply this approach to generate a new high resolution transcriptome annotation of the neglected pathogen human adenovirus type F 41 for which we identify 77 distinct transcripts encoding at least 23 different proteins.

4.
mBio ; 13(3): e0370221, 2022 06 28.
Artigo em Inglês | MEDLINE | ID: mdl-35579392

RESUMO

The chemical modification of ribonucleotides plays an integral role in the biology of diverse viruses and their eukaryotic host cells. Mapping the precise identity, location, and abundance of modified ribonucleotides remains a key goal of many studies aimed at characterizing the function and importance of a given modification. While mapping of specific RNA modifications through short-read sequencing approaches has powered a wealth of new discoveries in the past decade, this approach is limited by inherent biases and an absence of linkage information. Moreover, in viral contexts, the challenge is increased due to the compact nature of viral genomes giving rise to many overlapping transcript isoforms that cannot be adequately resolved using short-read sequencing approaches. The recent emergence of nanopore sequencing, specifically the ability to directly sequence native RNAs from virus-infected host cells, provides not just a new methodology for mapping modified ribonucleotides but also a new conceptual framework for what can be derived from the resulting sequencing data. In this minireview, we provide a detailed overview of how nanopore direct RNA sequencing works, the computational approaches applied to identify modified ribonucleotides, and the core concepts underlying both. We further highlight recent studies that have applied this approach to interrogating viral biology and finish by discussing key experimental considerations and how we predict that these methodologies will continue to evolve.


Assuntos
Nanoporos , Vírus , Sequenciamento de Nucleotídeos em Larga Escala/métodos , RNA/genética , RNA Viral/genética , Ribonucleotídeos , Análise de Sequência de RNA/métodos , Vírus/genética
5.
Clin Cancer Res ; 25(16): 5156-5166, 2019 08 15.
Artigo em Inglês | MEDLINE | ID: mdl-31227497

RESUMO

PURPOSE: Uveal melanoma is a primary malignancy of the eye with oncogenic mutations in GNAQ, GNA11, or CYSLTR2, and additional mutations in BAP1 (usually associated with LOH of Chr 3), SF3B1, or EIF1AX. There are other characteristic chromosomal alterations, but their significance is not clear. EXPERIMENTAL DESIGN: To investigate genes driving chromosomal alterations, we integrated copy number, transcriptome, and mutation data from three cohorts and followed up key findings. RESULTS: We observed significant enrichment of transcripts on chromosomes 1p, 3, 6, 8, and 16q and identified seven shared focal copy number alterations (FCNAs) on Chr 1p36, 2q37, 3, 6q25, 6q27, and 8q24. Integrated analyses revealed clusters of genes in focal copy number regions whose expression was associated with metastasis and worse overall survival. This included genes from Chr 1p36, 3p21, and 8q24.3. At Chr 6q27, we identified two tumors with homozygous deletion of PHF10/BAF45a and one with a frameshift mutation with concomitant loss of the wild-type allele. Downregulation of PHF10 in uveal melanoma cell lines and tumors altered a number of biological pathways including development and adhesion. These findings provide support for a role for PHF10 as a novel tumor suppressor at Chr 6q27. CONCLUSIONS: Integration of copy number, transcriptome, and mutation data revealed novel candidate genes playing a role in uveal melanoma pathogenesis and a potential tumor suppressor role for PHF10.


Assuntos
Transformação Celular Neoplásica/genética , Variações do Número de Cópias de DNA , Genes Supressores de Tumor , Proteínas de Homeodomínio/genética , Melanoma/genética , Proteínas de Neoplasias/genética , Fatores de Transcrição/genética , Neoplasias Uveais/genética , Metilação de DNA , Perfilação da Expressão Gênica , Técnicas de Silenciamento de Genes , Homozigoto , Humanos , Melanoma/mortalidade , Melanoma/patologia , Mutação , Polimorfismo de Nucleotídeo Único , Prognóstico , Transcriptoma , Neoplasias Uveais/mortalidade , Neoplasias Uveais/patologia
6.
G3 (Bethesda) ; 6(8): 2343-54, 2016 08 09.
Artigo em Inglês | MEDLINE | ID: mdl-27233670

RESUMO

In mammals the regulation of genomic instability plays a key role in tumor suppression and also controls genome plasticity, which is important for recombination during the processes of immunity and meiosis. Most studies to identify regulators of genomic instability have been performed in cells in culture or in systems that report on gross rearrangements of the genome, yet subtle differences in the level of genomic instability can contribute to whole organism phenotypes such as tumor predisposition. Here we performed a genome-wide association study in a population of 1379 outbred Crl:CFW(SW)-US_P08 mice to dissect the genetic landscape of micronucleus formation, a biomarker of chromosomal breaks, whole chromosome loss, and extranuclear DNA. Variation in micronucleus levels is a complex trait with a genome-wide heritability of 53.1%. We identify seven loci influencing micronucleus formation (false discovery rate <5%), and define candidate genes at each locus. Intriguingly at several loci we find evidence for sexual dimorphism in micronucleus formation, with a locus on chromosome 11 being specific to males.


Assuntos
Quebra Cromossômica , Estudo de Associação Genômica Ampla , Micronúcleos com Defeito Cromossômico , Locos de Características Quantitativas/genética , Animais , Mapeamento Cromossômico , Feminino , Instabilidade Genômica , Genótipo , Masculino , Camundongos , Fenótipo , Polimorfismo de Nucleotídeo Único , Caracteres Sexuais
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa