Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
1.
Nucleic Acids Res ; 35(Database issue): D224-8, 2007 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-17202162

RESUMEN

InterPro is an integrated resource for protein families, domains and functional sites, which integrates the following protein signature databases: PROSITE, PRINTS, ProDom, Pfam, SMART, TIGRFAMs, PIRSF, SUPERFAMILY, Gene3D and PANTHER. The latter two new member databases have been integrated since the last publication in this journal. There have been several new developments in InterPro, including an additional reading field, new database links, extensions to the web interface and additional match XML files. InterPro has always provided matches to UniProtKB proteins on the website and in the match XML file on the FTP site. Additional matches to proteins in UniParc (UniProt archive) are now available for download in the new match XML files only. The latest InterPro release (13.0) contains more than 13 000 entries, covering over 78% of all proteins in UniProtKB. The database is available for text- and sequence-based searches via a webserver (http://www.ebi.ac.uk/interpro), and for download by anonymous FTP (ftp://ftp.ebi.ac.uk/pub/databases/interpro). The InterProScan search tool is now also available via a web service at http://www.ebi.ac.uk/Tools/webservices/WSInterProScan.html.


Asunto(s)
Bases de Datos de Proteínas , Internet , Estructura Terciaria de Proteína , Proteínas/química , Proteínas/clasificación , Proteínas/fisiología , Análisis de Secuencia de Proteína , Integración de Sistemas , Interfaz Usuario-Computador
2.
Nucleic Acids Res ; 33(Database issue): D201-5, 2005 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-15608177

RESUMEN

InterPro, an integrated documentation resource of protein families, domains and functional sites, was created to integrate the major protein signature databases. Currently, it includes PROSITE, Pfam, PRINTS, ProDom, SMART, TIGRFAMs, PIRSF and SUPERFAMILY. Signatures are manually integrated into InterPro entries that are curated to provide biological and functional information. Annotation is provided in an abstract, Gene Ontology mapping and links to specialized databases. New features of InterPro include extended protein match views, taxonomic range information and protein 3D structure data. One of the new match views is the InterPro Domain Architecture view, which shows the domain composition of protein matches. Two new entry types were introduced to better describe InterPro entries: these are active site and binding site. PIRSF and the structure-based SUPERFAMILY are the latest member databases to join InterPro, and CATH and PANTHER are soon to be integrated. InterPro release 8.0 contains 11 007 entries, representing 2573 domains, 8166 families, 201 repeats, 26 active sites, 21 binding sites and 20 post-translational modification sites. InterPro covers over 78% of all proteins in the Swiss-Prot and TrEMBL components of UniProt. The database is available for text- and sequence-based searches via a webserver (http://www.ebi.ac.uk/interpro), and for download by anonymous FTP (ftp://ftp.ebi.ac.uk/pub/databases/interpro).


Asunto(s)
Bases de Datos de Proteínas , Proteínas/química , Proteínas/clasificación , Análisis de Secuencia de Proteína , Bases de Datos de Proteínas/tendencias , Humanos , Estructura Terciaria de Proteína , Alineación de Secuencia , Integración de Sistemas
3.
Nucleic Acids Res ; 32(Database issue): D434-7, 2004 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-14681451

RESUMEN

IntEnz is the name for the Integrated relational Enzyme database and is the official version of the Enzyme Nomenclature. The Enzyme Nomenclature comprises recommendations of the Nomenclature Committee of the International Union of Bio chemistry and Molecular Biology (NC-IUBMB) on the nomenclature and classification of enzyme-catalysed reactions. IntEnz is supported by NC-IUBMB and contains enzyme data curated and approved by this committee. The database IntEnz is available at http://www.ebi.ac.uk/intenz.


Asunto(s)
Bases de Datos de Proteínas , Enzimas/clasificación , Enzimas/metabolismo , Terminología como Asunto , Animales , Fenómenos Bioquímicos , Bioquímica , Catálisis , Humanos , Almacenamiento y Recuperación de la Información , Internet , Biología Molecular , Especificidad por Sustrato
4.
Nucleic Acids Res ; 31(1): 414-7, 2003 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-12520037

RESUMEN

The Proteome Analysis database (http://www.ebi.ac.uk/proteome/) has been developed by the Sequence Database Group at EBI utilizing existing resources and providing comparative analysis of the predicted protein coding sequences of the complete genomes of bacteria, archeae and eukaryotes. Three main projects are used, InterPro, CluSTr and GO Slim, to give an overview on families, domains, sites, and functions of the proteins from each of the complete genomes. Complete proteome analysis is available for a total of 89 proteome sets. A specifically designed application enables InterPro proteome comparisons for any one proteome against any other one or more of the proteomes in the database.


Asunto(s)
Bases de Datos de Proteínas , Proteoma/química , Animales , Proteínas Arqueales/química , Proteínas Bacterianas/química , Interpretación Estadística de Datos , Humanos , Ratones , Proteínas/química , Proteínas/clasificación , Proteínas/fisiología , Proteoma/fisiología , Análisis de Secuencia de Proteína , Homología de Secuencia de Aminoácido
5.
Nucleic Acids Res ; 31(1): 315-8, 2003 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-12520011

RESUMEN

InterPro, an integrated documentation resource of protein families, domains and functional sites, was created in 1999 as a means of amalgamating the major protein signature databases into one comprehensive resource. PROSITE, Pfam, PRINTS, ProDom, SMART and TIGRFAMs have been manually integrated and curated and are available in InterPro for text- and sequence-based searching. The results are provided in a single format that rationalises the results that would be obtained by searching the member databases individually. The latest release of InterPro contains 5629 entries describing 4280 families, 1239 domains, 95 repeats and 15 post-translational modifications. Currently, the combined signatures in InterPro cover more than 74% of all proteins in SWISS-PROT and TrEMBL, an increase of nearly 15% since the inception of InterPro. New features of the database include improved searching capabilities and enhanced graphical user interfaces for visualisation of the data. The database is available via a webserver (http://www.ebi.ac.uk/interpro) and anonymous FTP (ftp://ftp.ebi.ac.uk/pub/databases/interpro).


Asunto(s)
Bases de Datos de Proteínas , Proteínas/química , Animales , Gráficos por Computador , Procesamiento Proteico-Postraduccional , Estructura Terciaria de Proteína , Proteínas/genética , Proteínas/metabolismo , Secuencias Repetitivas de Aminoácido , Interfaz Usuario-Computador
6.
Bioinformatics ; 20(17): 3236-7, 2004 Nov 22.
Artículo en Inglés | MEDLINE | ID: mdl-15044231

RESUMEN

UniProt Archive (UniParc) is the most comprehensive, non-redundant protein sequence database available. Its protein sequences are retrieved from predominant, publicly accessible resources. All new and updated protein sequences are collected and loaded daily into UniParc for full coverage. To avoid redundancy, each unique sequence is stored only once with a stable protein identifier, which can be used later in UniParc to identify the same protein in all source databases. When proteins are loaded into the database, database cross-references are created to link them to the origins of the sequences. As a result, performing a sequence search against UniParc is equivalent to performing the same search against all databases cross-referenced by UniParc. UniParc contains only protein sequences and database cross-references; all other information must be retrieved from the source databases.


Asunto(s)
Sistemas de Administración de Bases de Datos , Bases de Datos de Proteínas , Documentación/métodos , Almacenamiento y Recuperación de la Información/métodos , Internet , Proteínas/química , Análisis de Secuencia de Proteína/métodos , Secuencia de Aminoácidos , Redes de Comunicación de Computadores , Difusión de la Información/métodos , Datos de Secuencia Molecular , Proteínas/clasificación , Integración de Sistemas
7.
Genome Res ; 13(4): 662-72, 2003 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-12654719

RESUMEN

Gene Ontology Annotation (GOA) is a project run by the European Bioinformatics Institute (EBI) that aims to provide assignments of terms from the Gene Ontology (GO) resource to gene products in a number of its databases (http://www.ebi.ac.uk/GOA). In the first stage of this project, GO assignments have been applied to a data set representing the complete human proteome by a combination of electronic mappings and manual curation. This vocabulary has also been applied to the nonredundant proteome sets for all other completely sequenced organisms as well as to proteins from a wide range of organisms where the proteome is not yet complete.


Asunto(s)
Biología Computacional/métodos , Bases de Datos de Proteínas/clasificación , Genómica , Proteómica , Vocabulario Controlado , Biología Computacional/tendencias , Sistemas de Administración de Bases de Datos/tendencias , Bases de Datos de Proteínas/tendencias , Genoma Humano , Genómica/tendencias , Humanos , Proteoma/clasificación , Proteoma/genética , Proteómica/tendencias
8.
Brief Bioinform ; 3(3): 225-35, 2002 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-12230031

RESUMEN

The exponential increase in the submission of nucleotide sequences to the nucleotide sequence database by genome sequencing centres has resulted in a need for rapid, automatic methods for classification of the resulting protein sequences. There are several signature and sequence cluster-based methods for protein classification, each resource having distinct areas of optimum application owing to the differences in the underlying analysis methods. In recognition of this, InterPro was developed as an integrated documentation resource for protein families, domains and functional sites, to rationalise the complementary efforts of the individual protein signature database projects. The member databases - PRINTS, PROSITE, Pfam, ProDom, SMART and TIGRFAMs - form the InterPro core. Related signatures from each member database are unified into single InterPro entries. Each InterPro entry includes a unique accession number, functional descriptions and literature references, and links are made back to the relevant member database(s). Release 4.0 of InterPro (November 2001) contains 4,691 entries, representing 3,532 families, 1,068 domains, 74 repeats and 15 sites of post-translational modification (PTMs) encoded by different regular expressions, profiles, fingerprints and hidden Markov models (HMMs). Each InterPro entry lists all the matches against SWISS-PROT and TrEMBL (2,141,621 InterPro hits from 586,124 SWISS-PROT and TrEMBL protein sequences). The database is freely accessible for text- and sequence-based searches.


Asunto(s)
Biología Computacional , Bases de Datos de Proteínas , Proteínas , Algoritmos , Humanos , Servicios de Información , Internet , Proteínas/química , Proteínas/clasificación , Programas Informáticos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA