Your browser doesn't support javascript.
loading
The multiassembly problem: reconstructing multiple transcript isoforms from EST fragment mixtures.
Xing, Yi; Resch, Alissa; Lee, Christopher.
Afiliação
  • Xing Y; UCLA-DOE Center for Genomics and Proteomics, Molecular Biology Institute and Department of Chemistry & Biochemistry, University of California, Los Angeles, Los Angeles, California 90095-1570, USA.
Genome Res ; 14(3): 426-41, 2004 Mar.
Article em En | MEDLINE | ID: mdl-14962984
Recent evidence of abundant transcript variation (e.g., alternative splicing, alternative initiation, alternative polyadenylation) in complex genomes indicates that cataloging the complete set of transcripts from an organism is an important project. One challenge is the fact that most high-throughput experimental methods for characterizing transcripts (such as EST sequencing) give highly detailed information about short fragments of transcripts or protein products, instead of a complete characterization of a full-length form. We analyze this "multiassembly problem"-reconstructing the most likely set of full-length isoform sequences from a mixture of EST fragment data-and present a graph-based algorithm for solving it. In a variety of tests, we demonstrate that this algorithm deals appropriately with coupling of distinct alternative splicing events, increasing fragmentation of the input data and different types of transcript variation (such as alternative splicing, initiation, polyadenylation, and intron retention). To test the method's performance on pure fragment (EST) data, we removed all mRNA sequences, and found it produced no errors in 40 cases tested. Using this algorithm, we have constructed an Alternatively Spliced Proteins database (ASP) from analysis of human expressed and genomic sequences, consisting of 13,384 protein isoforms of 4422 genes, yielding an average of 3.0 protein isoforms per gene.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Transcrição Gênica / Transcobalaminas / Etiquetas de Sequências Expressas Limite: Humans Idioma: En Revista: Genome Res Assunto da revista: BIOLOGIA MOLECULAR / GENETICA Ano de publicação: 2004 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Transcrição Gênica / Transcobalaminas / Etiquetas de Sequências Expressas Limite: Humans Idioma: En Revista: Genome Res Assunto da revista: BIOLOGIA MOLECULAR / GENETICA Ano de publicação: 2004 Tipo de documento: Article País de afiliação: Estados Unidos
...