Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
1.
Nucleic Acids Res ; 31(1): 82-6, 2003 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-12519953

RESUMEN

NetAffx (http://www.affymetrix.com) details and annotates probesets on Affymetrix GeneChip microarrays. These annotations include (i) static information specific to the probeset composition; (ii) sequence annotations extracted from public databases; and (iii) protein sequence-level annotations derived from public domain programs, as well as libraries of hidden Markov models (HMMs) developed at Affymetrix. For each probeset, NetAffx lists the probe sequences, and the consensus sequence interrogated by the probes; for the larger chip sets, interactive maps display this sequence data in genomic context. Sequence annotations include Gene Ontology (GO) terms and depiction of GO graph relationships; predicted protein domains and motifs; orthologous sequences; links to relevant pathways; and links to public databases including UniGene, LocusLink, SWISS-PROT and OMIM.


Asunto(s)
Bases de Datos Genéticas , Perfilación de la Expresión Génica , Análisis de Secuencia por Matrices de Oligonucleótidos , Animales , Secuencia de Consenso , Almacenamiento y Recuperación de la Información , Cadenas de Markov , Proteínas/química , Análisis de Secuencia de Proteína , Programas Informáticos
2.
J Bioinform Comput Biol ; 1(2): 289-306, 2003 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-15290774

RESUMEN

Understanding how alternative splicing affects gene function is an important challenge facing modern-day molecular biology. Using homology-based, protein sequence analysis methods, it should be possible to investigate how transcript diversity impacts protein function. To test this, high-quality exon-intron structures were deduced for over 8000 human genes, including over 1300 (17 percent) that produce multiple transcript variants. A data mining technique (DiffMotif) was developed to identify genes in which transcript variation coincides with changes in conserved motifs between variants. Applying this method, we found that 30 percent of the multi-variant genes in our test set exhibited a differential profile of conserved InterPro and/or BLOCKS motifs across different mRNA variants. To investigate these, a visualization tool (ProtAnnot) that displays amino acid motifs in the context of genomic sequence was developed. Using this tool, genes revealed by the DiffMotif method were analyzed, and when possible, hypotheses regarding the potential role of alternative transcript structure in modulating gene function were developed. Examples of these, including: MEOX1, a homeobox-containing protein; AIRE, involved in auto-immune disease; PLAT, tissue type plasminogen activator; and CD79b, a component of the B-cell receptor complex, are presented. These results demonstrate that amino acid motif databases like BLOCKS and InterPro are useful tools for investigating how alternative transcript structure affects gene function.


Asunto(s)
Empalme Alternativo/genética , Mapeo Cromosómico/métodos , Bases de Datos de Proteínas , Genoma Humano , Alineación de Secuencia/métodos , Análisis de Secuencia de Proteína/métodos , Factores de Transcripción/genética , Algoritmos , Secuencias de Aminoácidos/genética , Secuencia Conservada , Regulación de la Expresión Génica/genética , Variación Genética , Humanos , Proteínas/química , Proteínas/genética , Relación Estructura-Actividad
3.
Bioinformatics ; 19(5): 667-8, 2003 Mar 22.
Artículo en Inglés | MEDLINE | ID: mdl-12651732

RESUMEN

GPCR-GRAPA-LIB is a library of HMMs describing G protein coupled receptor families. These families are initially defined by class of receptor ligand, with divergent families divided into subfamilies using phylogenic analysis and knowledge of GPCR function. Protein sequences are applied to the models with the GRAPA curve-based selection criteria. RefSeq sequences for Homo sapiens, Drosophila melanogaster, and Caenorhabditis elegans have been annotated using this approach.


Asunto(s)
Bases de Datos de Proteínas , Proteínas de Unión al GTP/química , Proteínas de Unión al GTP/genética , Modelos Genéticos , Modelos Estadísticos , Alineación de Secuencia/métodos , Análisis de Secuencia de Proteína/métodos , Algoritmos , Secuencia de Aminoácidos , Documentación , Evolución Molecular , Proteínas de Unión al GTP/clasificación , Datos de Secuencia Molecular , Relación Estructura-Actividad
4.
Artículo en Inglés | MEDLINE | ID: mdl-15838129

RESUMEN

Understanding the functional significance of alternative splicing and other mechanisms that generate RNA transcript diversity is an important challenge facing modern-day molecular biology. Using homology-based, protein sequence analysis methods, it should be possible to investigate how transcript diversity impacts protein structure and function. To test this, a data mining technique ("DiffHit") was developed to identify and catalog genes producing protein isoforms which exhibit distinct profiles of conserved protein motifs. We found that out of a test set of over 1,300 alternatively spliced genes with solved genomic structure, over 30% exhibited a differential profile of conserved InterPro and/or Blocks protein motifs across distinct isoforms. These results suggest that motif databases such as Blocks and InterPro are potentially useful tools for investigating how alternative transcript structure affects gene function.


Asunto(s)
Empalme Alternativo/genética , Bases de Datos de Proteínas , Genoma Humano , Almacenamiento y Recuperación de la Información/métodos , Proteoma/genética , Alineación de Secuencia/métodos , Análisis de Secuencia de Proteína/métodos , Algoritmos , Sistemas de Administración de Bases de Datos , Evolución Molecular , Perfilación de la Expresión Génica/métodos , Humanos , Isoformas de Proteínas/química , Isoformas de Proteínas/genética , Proteoma/química , Homología de Secuencia de Aminoácido , Transcripción Genética/genética
5.
J Biopharm Stat ; 14(3): 687-700, 2004 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-15468759

RESUMEN

We have developed an algorithm for inferring the degree of similarity between genes by using the graph-based structure of Gene Ontology (GO). We applied this knowledge-based similarity metric to a clique-finding algorithm for detecting sets of related genes with biological classifications. We also combined it with an expression-based distance metric to produce a co-cluster analysis, which accentuates genes with both similar expression profiles and similar biological characteristics and identifies gene clusters that are more stable and biologically meaningful. These algorithms are demonstrated in the analysis of MPRO cell differentiation time series experiments.


Asunto(s)
Algoritmos , Inteligencia Artificial , Análisis por Conglomerados , Análisis de Secuencia por Matrices de Oligonucleótidos/estadística & datos numéricos , Diferenciación Celular/efectos de los fármacos , Diferenciación Celular/fisiología , Humanos , Neutrófilos/efectos de los fármacos , Tretinoina/farmacología
6.
Pac Symp Biocomput ; : 127-38, 2002.
Artículo en Inglés | MEDLINE | ID: mdl-11928469

RESUMEN

The field of comparative genomics allows us to elucidate the molecular mechanisms necessary for the machinery of an organism by contrasting its genome against those of other organisms. In this paper, we contrast the genome of homo sapiens against C. Elegans, Drosophila melanogaster, and S. cerevisiae to gain insights on what structural domains are present in each organism. Previous work has assessed this using sequence-based homology recognition systems such as Pfam [1] and Interpro [2]. Here, we pursue a structure-based assessment, analyzing genomes according to domains in the SCOP structural domain dictionary. Compared to other eukaryotic genomes, we observe additional domains in the human genome relating to signal transduction, immune response, transport, and certain enzymes. Compared to the metazoan genomes, the yeast genome shows an absence of domains relating to immune response, cell-cell interactions, and cell signaling.


Asunto(s)
Genoma , Genómica/métodos , Análisis de Secuencia de ADN/métodos , Animales , Caenorhabditis elegans/genética , Simulación por Computador , Drosophila melanogaster/genética , Enzimas/genética , Humanos , Modelos Genéticos , Saccharomyces cerevisiae/genética , Dedos de Zinc/genética
7.
Bioinformatics ; 20(9): 1462-3, 2004 Jun 12.
Artículo en Inglés | MEDLINE | ID: mdl-14962933

RESUMEN

SUMMARY: The NetAffx Gene Ontology (GO) Mining Tool is a web-based, interactive tool that permits traversal of the GO graph in the context of microarray data. It accepts a list of Affymetrix probe sets and renders a GO graph as a heat map colored according to significance measurements. The rendered graph is interactive, with nodes linked to public web sites and to lists of the relevant probe sets. The GO Mining Tool provides visualization combining biological annotation with expression data, encompassing thousands of genes in one interactive view. AVAILABILITY: GO Mining Tool is freely available at http://www.affymetrix.com/analysis/query/go_analysis.affx


Asunto(s)
Algoritmos , Documentación/métodos , Almacenamiento y Recuperación de la Información/métodos , Procesamiento de Lenguaje Natural , Análisis de Secuencia por Matrices de Oligonucleótidos , Programas Informáticos , Interfaz Usuario-Computador , Indización y Redacción de Resúmenes/métodos , Gráficos por Computador , Sistemas de Administración de Bases de Datos , Perfilación de la Expresión Génica/métodos
8.
Bioinformatics ; 19 Suppl 1: i315-22, 2003.
Artículo en Inglés | MEDLINE | ID: mdl-12855476

RESUMEN

MOTIVATION: Alternative splicing allows a single gene to generate multiple mRNAs, which can be translated into functionally and structurally diverse proteins. One gene can have multiple variants coexisting at different concentrations. Estimating the relative abundance of each variant is important for the study of underlying biological function. Microarrays are standard tools that measure gene expression. But most design and analysis has not accounted for splice variants. Thus splice variant-specific chip designs and analysis algorithms are needed for accurate gene expression profiling. RESULTS: Inspired by Li and Wong (2001), we developed a gene structure-based algorithm to determine the relative abundance of known splice variants. Probe intensities are modeled across multiple experiments using gene structures as constraints. Model parameters are obtained through a maximum likelihood estimation (MLE) process/framework. The algorithm produces the relative concentration of each variant, as well as an affinity term associated with each probe. Validation of the algorithm is performed by a set of controlled spike experiments as well as endogenous tissue samples using a human splice variant array.


Asunto(s)
Algoritmos , Empalme Alternativo/genética , Proteínas de Drosophila , Perfilación de la Expresión Génica/métodos , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Alineación de Secuencia/métodos , Análisis de Secuencia de ADN/métodos , Sondas de ADN/genética , Diseño de Equipo , Análisis de Falla de Equipo , Variación Genética , Humanos , Modelos Genéticos , Modelos Estadísticos , Análisis de Secuencia por Matrices de Oligonucleótidos/instrumentación , Tropomiosina/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA