Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Nucleic Acids Res ; 34(Database issue): D590-8, 2006 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-16381938

RESUMO

The University of California Santa Cruz Genome Browser Database (GBD) contains sequence and annotation data for the genomes of about a dozen vertebrate species and several major model organisms. Genome annotations typically include assembly data, sequence composition, genes and gene predictions, mRNA and expressed sequence tag evidence, comparative genomics, regulation, expression and variation data. The database is optimized to support fast interactive performance with web tools that provide powerful visualization and querying capabilities for mining the data. The Genome Browser displays a wide variety of annotations at all scales from single nucleotide level up to a full chromosome. The Table Browser provides direct access to the database tables and sequence data, enabling complex queries on genome-wide datasets. The Proteome Browser graphically displays protein properties. The Gene Sorter allows filtering and comparison of genes by several metrics including expression data and several gene properties. BLAT and In Silico PCR search for sequences in entire genomes in seconds. These tools are highly integrated and provide many hyperlinks to other databases and websites. The GBD, browsing tools, downloadable data files and links to documentation and other information can be found at http://genome.ucsc.edu/.


Assuntos
Bases de Dados Genéticas , Genômica , Sequência de Aminoácidos , Animais , California , Gráficos por Computador , Cães , Expressão Gênica , Genes , Humanos , Internet , Camundongos , Polimorfismo de Nucleotídeo Único , Proteínas/química , Proteínas/genética , Proteínas/metabolismo , Proteômica , Ratos , Alinhamento de Sequência , Software , Interface Usuário-Computador
2.
Nucleic Acids Res ; 31(1): 51-4, 2003 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-12519945

RESUMO

The University of California Santa Cruz (UCSC) Genome Browser Database is an up to date source for genome sequence data integrated with a large collection of related annotations. The database is optimized to support fast interactive performance with the web-based UCSC Genome Browser, a tool built on top of the database for rapid visualization and querying of the data at many levels. The annotations for a given genome are displayed in the browser as a series of tracks aligned with the genomic sequence. Sequence data and annotations may also be viewed in a text-based tabular format or downloaded as tab-delimited flat files. The Genome Browser Database, browsing tools and downloadable data files can all be found on the UCSC Genome Bioinformatics website (http://genome.ucsc.edu), which also contains links to documentation and related technical information.


Assuntos
Bases de Dados Genéticas , Genoma Humano , Genômica , Animais , California , Sistemas de Gerenciamento de Base de Dados , Humanos , Armazenamento e Recuperação da Informação , Camundongos
3.
Pac Symp Biocomput ; : 151-63, 2001.
Artigo em Inglês | MEDLINE | ID: mdl-11262936

RESUMO

In this paper we consider the problem of extracting information from the upstream untranslated regions of genes to make predictions about their transcriptional regulation. We present a method for classifying genes based on motif-based hidden Markov models (HMMs) of their promoter regions. Sequence motifs discovered in yeast promoters are used to construct HMMs that include parameters describing the number and relative locations of motifs within each sequence. Each model provides a Fisher kernel for a support vector machine, which can be used to predict the classifications of unannotated promoters. We demonstrate this method on two classes of genes from the budding yeast, S. cerevisiae. Our results suggest that the additional sequence features captured by the HMM assist in correctly classifying promoters.


Assuntos
Modelos Genéticos , Regiões Promotoras Genéticas , Algoritmos , Sequência de Bases , Sítios de Ligação/genética , DNA Fúngico/genética , DNA Fúngico/metabolismo , Genes Fúngicos , Cadeias de Markov , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Fatores de Transcrição/metabolismo
4.
Nature ; 409(6822): 953-8, 2001 Feb 15.
Artigo em Inglês | MEDLINE | ID: mdl-11237021

RESUMO

We have placed 7,600 cytogenetically defined landmarks on the draft sequence of the human genome to help with the characterization of genes altered by gross chromosomal aberrations that cause human disease. The landmarks are large-insert clones mapped to chromosome bands by fluorescence in situ hybridization. Each clone contains a sequence tag that is positioned on the genomic sequence. This genome-wide set of sequence-anchored clones allows structural and functional analyses of the genome. This resource represents the first comprehensive integration of cytogenetic, radiation hybrid, linkage and sequence maps of the human genome; provides an independent validation of the sequence map and framework for contig order and orientation; surveys the genome for large-scale duplications, which are likely to require special attention during sequence assembly; and allows a stringent assessment of sequence differences between the dark and light bands of chromosomes. It also provides insight into large-scale chromatin structure and the evolution of chromosomes and gene families and will accelerate our understanding of the molecular bases of human disease and cancer.


Assuntos
Aberrações Cromossômicas , Marcadores Genéticos , Genoma Humano , Mapeamento Cromossômico , Cromossomos Artificiais Bacterianos , Análise Citogenética , Projeto Genoma Humano , Humanos , Hibridização in Situ Fluorescente , Mapeamento de Híbridos Radioativos , Sitios de Sequências Rotuladas
5.
Bioinformatics ; 16(10): 906-14, 2000 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-11120680

RESUMO

MOTIVATION: DNA microarray experiments generating thousands of gene expression measurements, are being used to gather information from tissue and cell samples regarding gene expression differences that will be useful in diagnosing disease. We have developed a new method to analyse this kind of data using support vector machines (SVMs). This analysis consists of both classification of the tissue samples, and an exploration of the data for mis-labeled or questionable tissue results. RESULTS: We demonstrate the method in detail on samples consisting of ovarian cancer tissues, normal ovarian tissues, and other normal tissues. The dataset consists of expression experiment results for 97,802 cDNAs for each tissue. As a result of computational analysis, a tissue sample is discovered and confirmed to be wrongly labeled. Upon correction of this mistake and the removal of an outlier, perfect classification of tissues is achieved, but not with high confidence. We identify and analyse a subset of genes from the ovarian dataset whose expression is highly differentiated between the types of tissues. To show robustness of the SVM method, two previously published datasets from other types of tissues or cells are analysed. The results are comparable to those previously obtained. We show that other machine learning methods also perform comparably to the SVM on many of those datasets. AVAILABILITY: The SVM software is available at http://www.cs. columbia.edu/ approximately bgrundy/svm.


Assuntos
Algoritmos , Neoplasias do Colo/classificação , DNA de Neoplasias/análise , Bases de Dados Factuais , Leucemia Mieloide/classificação , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Neoplasias Ovarianas/classificação , Leucemia-Linfoma Linfoblástico de Células Precursoras/classificação , Doença Aguda , Inteligência Artificial , Neoplasias do Colo/genética , Neoplasias do Colo/patologia , Feminino , Humanos , Leucemia Mieloide/genética , Leucemia Mieloide/patologia , Neoplasias Ovarianas/genética , Neoplasias Ovarianas/patologia , Ovário/patologia , Leucemia-Linfoma Linfoblástico de Células Precursoras/genética , Leucemia-Linfoma Linfoblástico de Células Precursoras/patologia , Software
6.
Proc Natl Acad Sci U S A ; 97(1): 262-7, 2000 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-10618406

RESUMO

We introduce a method of functionally classifying genes by using gene expression data from DNA microarray hybridization experiments. The method is based on the theory of support vector machines (SVMs). SVMs are considered a supervised computer learning method because they exploit prior knowledge of gene function to identify unknown genes of similar function from expression data. SVMs avoid several problems associated with unsupervised clustering methods, such as hierarchical clustering and self-organizing maps. SVMs have many mathematical features that make them attractive for gene expression analysis, including their flexibility in choosing a similarity function, sparseness of solution when dealing with large data sets, the ability to handle large feature spaces, and the ability to identify outliers. We test several SVMs that use different similarity metrics, as well as some other supervised learning methods, and find that the SVMs best identify sets of genes with a common function using expression data. Finally, we use SVMs to predict functional roles for uncharacterized yeast ORFs based on their expression data.


Assuntos
DNA/análise , Expressão Gênica/genética , Genes Fúngicos/genética , Saccharomyces cerevisiae/genética , Algoritmos , Computadores , Bases de Dados Factuais , Proteínas Fúngicas/classificação , Proteínas Fúngicas/genética , Hibridização de Ácido Nucleico , Fases de Leitura Aberta/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA