Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
1.
Nucleic Acids Res ; 35(Database issue): D247-52, 2007 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-17130144

RESUMO

PANTHER is a freely available, comprehensive software system for relating protein sequence evolution to the evolution of specific protein functions and biological roles. Since 2005, there have been three main improvements to PANTHER. First, the sequences used to create evolutionary trees are carefully selected to provide coverage of phylogenetic as well as functional information. Second, PANTHER is now a member of the InterPro Consortium, and the PANTHER hidden markov Models (HMMs) are distributed as part of InterProScan. Third, we have dramatically expanded the number of pathways associated with subfamilies in PANTHER. Pathways provide a detailed, structured representation of protein function in the context of biological reaction networks. PANTHER pathways were generated using the emerging Systems Biology Markup Language (SBML) standard using pathway network editing software called CellDesigner. The pathway collection currently contains approximately 1500 reactions in 130 pathways, curated by expert biologists with authorship attribution. The curation environment is designed to be easy to use, and the number of pathways is growing steadily. Because the reaction participants are linked to subfamilies and corresponding HMMs, reactions can be inferred across numerous different organisms. The HMMs can be downloaded by FTP, and tools for analyzing data in the context of pathways and function ontologies are available at http://www.pantherdb.org.


Assuntos
Bases de Dados de Proteínas , Evolução Molecular , Proteínas/fisiologia , Animais , Humanos , Internet , Camundongos , Modelos Biológicos , Filogenia , Proteínas/classificação , Proteínas/genética , Ratos , Análise de Sequência de Proteína , Software , Interface Usuário-Computador
2.
Nucleic Acids Res ; 35(Database issue): D224-8, 2007 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-17202162

RESUMO

InterPro is an integrated resource for protein families, domains and functional sites, which integrates the following protein signature databases: PROSITE, PRINTS, ProDom, Pfam, SMART, TIGRFAMs, PIRSF, SUPERFAMILY, Gene3D and PANTHER. The latter two new member databases have been integrated since the last publication in this journal. There have been several new developments in InterPro, including an additional reading field, new database links, extensions to the web interface and additional match XML files. InterPro has always provided matches to UniProtKB proteins on the website and in the match XML file on the FTP site. Additional matches to proteins in UniParc (UniProt archive) are now available for download in the new match XML files only. The latest InterPro release (13.0) contains more than 13 000 entries, covering over 78% of all proteins in UniProtKB. The database is available for text- and sequence-based searches via a webserver (http://www.ebi.ac.uk/interpro), and for download by anonymous FTP (ftp://ftp.ebi.ac.uk/pub/databases/interpro). The InterProScan search tool is now also available via a web service at http://www.ebi.ac.uk/Tools/webservices/WSInterProScan.html.


Assuntos
Bases de Dados de Proteínas , Internet , Estrutura Terciária de Proteína , Proteínas/química , Proteínas/classificação , Proteínas/fisiologia , Análise de Sequência de Proteína , Integração de Sistemas , Interface Usuário-Computador
3.
Nucleic Acids Res ; 34(Web Server issue): W645-50, 2006 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-16912992

RESUMO

The vast amount of protein sequence data now available, together with accumulating experimental knowledge of protein function, enables modeling of protein sequence and function evolution. The PANTHER database was designed to model evolutionary sequence-function relationships on a large scale. There are a number of applications for these data, and we have implemented web services that address three of them. The first is a protein classification service. Proteins can be classified, using only their amino acid sequences, to evolutionary groups at both the family and subfamily levels. Specific subfamilies, and often families, are further classified when possible according to their functions, including molecular function and the biological processes and pathways they participate in. The second application, then, is an expression data analysis service, where functional classification information can help find biological patterns in the data obtained from genome-wide experiments. The third application is a coding single-nucleotide polymorphism scoring service. In this case, information about evolutionarily related proteins is used to assess the likelihood of a deleterious effect on protein function arising from a single substitution at a specific amino acid position in the protein. All three web services are available at http://www.pantherdb.org/tools.


Assuntos
Bases de Dados de Proteínas , Evolução Molecular , Polimorfismo de Nucleotídeo Único , Proteínas/genética , Proteínas/fisiologia , Análise de Sequência de Proteína , Software , Substituição de Aminoácidos , Animais , Gráficos por Computador , Interpretação Estatística de Dados , Drosophila melanogaster/genética , Genômica , Humanos , Internet , Cadeias de Markov , Camundongos , Proteínas/classificação , RNA Mensageiro/metabolismo , Ratos , Interface Usuário-Computador
4.
PLoS Genet ; 1(6): e83, 2005 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-16429166

RESUMO

The human genome contains an estimated 100,000 to 300,000 DNA variants that alter an amino acid in an encoded protein. However, our ability to predict which of these variants are functionally significant is limited. We used a bioinformatics approach to define the functional significance of genetic variation in the ABCA1 gene, a cholesterol transporter crucial for the metabolism of high density lipoprotein cholesterol. To predict the functional consequence of each coding single nucleotide polymorphism and mutation in this gene, we calculated a substitution position-specific evolutionary conservation score for each variant, which considers site-specific variation among evolutionarily related proteins. To test the bioinformatics predictions experimentally, we evaluated the biochemical consequence of these sequence variants by examining the ability of cell lines stably transfected with the ABCA1 alleles to elicit cholesterol efflux. Our bioinformatics approach correctly predicted the functional impact of greater than 94% of the naturally occurring variants we assessed. The bioinformatics predictions were significantly correlated with the degree of functional impairment of ABCA1 mutations (r2 = 0.62, p = 0.0008). These results have allowed us to define the impact of genetic variation on ABCA1 function and to suggest that the in silico evolutionary approach we used may be a useful tool in general for predicting the effects of DNA variation on gene function. In addition, our data suggest that considering patterns of positive selection, along with patterns of negative selection such as evolutionary conservation, may improve our ability to predict the functional effects of amino acid variation.


Assuntos
Transportadores de Cassetes de Ligação de ATP/genética , Variação Genética , Mutação de Sentido Incorreto , Polimorfismo de Nucleotídeo Único , Transportador 1 de Cassete de Ligação de ATP , Substituição de Aminoácidos , Linhagem Celular , Colesterol/metabolismo , Sequência Conservada , DNA Complementar/genética , Evolução Molecular , Genoma Humano , Humanos , Fenótipo , Reação em Cadeia da Polimerase Via Transcriptase Reversa
5.
Nucleic Acids Res ; 33(Database issue): D284-8, 2005 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-15608197

RESUMO

PANTHER is a large collection of protein families that have been subdivided into functionally related subfamilies, using human expertise. These subfamilies model the divergence of specific functions within protein families, allowing more accurate association with function (ontology terms and pathways), as well as inference of amino acids important for functional specificity. Hidden Markov models (HMMs) are built for each family and subfamily for classifying additional protein sequences. The latest version, 5.0, contains 6683 protein families, divided into 31,705 subfamilies, covering approximately 90% of mammalian protein-coding genes. PANTHER 5.0 includes a number of significant improvements over previous versions, most notably (i) representation of pathways (primarily signaling pathways) and association with subfamilies and individual protein sequences; (ii) an improved methodology for defining the PANTHER families and subfamilies, and for building the HMMs; (iii) resources for scoring sequences against PANTHER HMMs both over the web and locally; and (iv) a number of new web resources to facilitate analysis of large gene lists, including data generated from high-throughput expression experiments. Efforts are underway to add PANTHER to the InterPro suite of databases, and to make PANTHER consistent with the PIRSF database. PANTHER is now publicly available without restriction at http://panther.appliedbiosystems.com.


Assuntos
Bases de Dados de Proteínas , Proteínas/classificação , Análise de Sequência de Proteína , Animais , Bases de Dados de Proteínas/estatística & dados numéricos , Perfilação da Expressão Gênica , Humanos , Internet , Cadeias de Markov , Camundongos , Proteínas/química , Proteínas/fisiologia , Ratos , Transdução de Sinais , Integração de Sistemas , Interface Usuário-Computador
6.
Nucleic Acids Res ; 31(1): 334-41, 2003 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-12520017

RESUMO

The PANTHER database was designed for high-throughput analysis of protein sequences. One of the key features is a simplified ontology of protein function, which allows browsing of the database by biological functions. Biologist curators have associated the ontology terms with groups of protein sequences rather than individual sequences. Statistical models (Hidden Markov Models, or HMMs) are built from each of these groups. The advantage of this approach is that new sequences can be automatically classified as they become available. To ensure accurate functional classification, HMMs are constructed not only for families, but also for functionally distinct subfamilies. Multiple sequence alignments and phylogenetic trees, including curator-assigned information, are available for each family. The current version of the PANTHER database includes training sequences from all organisms in the GenBank non-redundant protein database, and the HMMs have been used to classify gene products across the entire genomes of human, and Drosophila melanogaster. The ontology terms and protein families and subfamilies, as well as Drosophila gene c;assifications, can be browsed and searched for free. Due to outstanding contractual obligations, access to human gene classifications and to protein family trees and multiple sequence alignments will temporarily require a nominal registration fee. PANTHER is publicly available on the web at http://panther.celera.com.


Assuntos
Bases de Dados de Proteínas , Proteínas/classificação , Proteínas/fisiologia , Animais , Humanos , Filogenia , Estrutura Terciária de Proteína , Proteínas/química , Proteínas/genética , Alinhamento de Sequência
7.
Proc Natl Acad Sci U S A ; 101(43): 15398-403, 2004 Oct 26.
Artigo em Inglês | MEDLINE | ID: mdl-15492219

RESUMO

Most Mendelian diseases studied to date arise from mutations that lead to a single amino acid change in an encoded protein. An increasing number of complex diseases have also been associated with amino acid-changing single-nucleotide polymorphisms (coding SNPs, cSNPs), suggesting potential similarities between Mendelian and complex diseases at the molecular level. Here, we use two different evolutionary analyses to compare Mendelian and complex disease-associated cSNPs. In the first, we estimate the likelihood that a specific amino acid substitution in a protein will affect the protein's function, by using amino acid substitution scores derived from an alignment of related protein sequences and statistics from hidden Markov models. In the second, we use standard Ka/Ks ratios to make comparisons at the gene, rather than the individual amino acid, level. We find that Mendelian disease cSNPs have a very strong tendency to occur at highly conserved amino acid positions in proteins, suggesting that they generally have a severe impact on the function of the protein. Perhaps surprisingly, the distribution of amino acid substitution scores for complex disease cSNPs is dramatically different from the distribution for Mendelian disease cSNPs, and is indistinguishable from the distribution for "normal" human variation. Further, the distributions of Ka/Ks ratios for human and mouse orthologs indicate greater positive selection (or less negative selection) pressure on complex disease-associated genes, on average. These findings suggest that caution should be exercised when using Mendelian disease as a model for complex disease, at least with respect to molecular effects on protein function.


Assuntos
Evolução Biológica , Doenças Genéticas Inatas/genética , Polimorfismo de Nucleotídeo Único , Humanos
8.
Genome Res ; 13(9): 2129-41, 2003 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-12952881

RESUMO

In the genomic era, one of the fundamental goals is to characterize the function of proteins on a large scale. We describe a method, PANTHER, for relating protein sequence relationships to function relationships in a robust and accurate way. PANTHER is composed of two main components: the PANTHER library (PANTHER/LIB) and the PANTHER index (PANTHER/X). PANTHER/LIB is a collection of "books," each representing a protein family as a multiple sequence alignment, a Hidden Markov Model (HMM), and a family tree. Functional divergence within the family is represented by dividing the tree into subtrees based on shared function, and by subtree HMMs. PANTHER/X is an abbreviated ontology for summarizing and navigating molecular functions and biological processes associated with the families and subfamilies. We apply PANTHER to three areas of active research. First, we report the size and sequence diversity of the families and subfamilies, characterizing the relationship between sequence divergence and functional divergence across a wide range of protein families. Second, we use the PANTHER/X ontology to give a high-level representation of gene function across the human and mouse genomes. Third, we use the family HMMs to rank missense single nucleotide polymorphisms (SNPs), on a database-wide scale, according to their likelihood of affecting protein function.


Assuntos
Algoritmos , Bases de Dados de Proteínas , Proteínas/classificação , Proteínas/fisiologia , Sequência de Aminoácidos , Animais , Biologia Computacional/métodos , Biologia Computacional/estatística & dados numéricos , Sistemas de Gerenciamento de Base de Dados , Variação Genética , Humanos , Cadeias de Markov , Camundongos , Mutação de Sentido Incorreto/genética , Biblioteca de Peptídeos , Polimorfismo de Nucleotídeo Único/genética , Valor Preditivo dos Testes , Proteínas/genética , Alinhamento de Sequência , Software , Terminologia como Assunto
9.
Science ; 302(5652): 1960-3, 2003 Dec 12.
Artigo em Inglês | MEDLINE | ID: mdl-14671302

RESUMO

Even though human and chimpanzee gene sequences are nearly 99% identical, sequence comparisons can nevertheless be highly informative in identifying biologically important changes that have occurred since our ancestral lineages diverged. We analyzed alignments of 7645 chimpanzee gene sequences to their human and mouse orthologs. These three-species sequence alignments allowed us to identify genes undergoing natural selection along the human and chimp lineage by fitting models that include parameters specifying rates of synonymous and nonsynonymous nucleotide substitution. This evolutionary approach revealed an informative set of genes with significantly different patterns of substitution on the human lineage compared with the chimpanzee and mouse lineages. Partitions of genes into inferred biological classes identified accelerated evolution in several functional classes, including olfaction and nuclear transport. In addition to suggesting adaptive physiological differences between chimps and humans, human-accelerated genes are significantly more likely to underlie major known Mendelian disorders.


Assuntos
Evolução Molecular , Genoma Humano , Genoma , Pan troglodytes/genética , Seleção Genética , Transporte Ativo do Núcleo Celular/genética , Aminoácidos/metabolismo , Animais , Evolução Biológica , Biologia Computacional , Feminino , Genes , Doenças Genéticas Inatas/genética , Humanos , Funções Verossimilhança , Masculino , Camundongos/genética , Modelos Genéticos , Modelos Estatísticos , Mutação , Filogenia , Proteínas/química , Proteínas/genética , Pseudogenes , Receptores Odorantes/genética , Alinhamento de Sequência , Homologia de Sequência do Ácido Nucleico , Transdução de Sinais/genética , Olfato/genética , Especificidade da Espécie
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA