Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 17 de 17
Filtrar
1.
Bioinformatics ; 36(17): 4643-4648, 2020 11 01.
Artículo en Inglés | MEDLINE | ID: mdl-32399560

RESUMEN

MOTIVATION: The number of protein records in the UniProt Knowledgebase (UniProtKB: https://www.uniprot.org) continues to grow rapidly as a result of genome sequencing and the prediction of protein-coding genes. Providing functional annotation for these proteins presents a significant and continuing challenge. RESULTS: In response to this challenge, UniProt has developed a method of annotation, known as UniRule, based on expertly curated rules, which integrates related systems (RuleBase, HAMAP, PIRSR, PIRNR) developed by the members of the UniProt consortium. UniRule uses protein family signatures from InterPro, combined with taxonomic and other constraints, to select sets of reviewed proteins which have common functional properties supported by experimental evidence. This annotation is propagated to unreviewed records in UniProtKB that meet the same selection criteria, most of which do not have (and are never likely to have) experimentally verified functional annotation. Release 2020_01 of UniProtKB contains 6496 UniRule rules which provide annotation for 53 million proteins, accounting for 30% of the 178 million records in UniProtKB. UniRule provides scalable enrichment of annotation in UniProtKB. AVAILABILITY AND IMPLEMENTATION: UniRule rules are integrated into UniProtKB and can be viewed at https://www.uniprot.org/unirule/. UniRule rules and the code required to run the rules, are publicly available for researchers who wish to annotate their own sequences. The implementation used to run the rules is known as UniFIRE and is available at https://gitlab.ebi.ac.uk/uniprot-public/unifire.


Asunto(s)
Bases del Conocimiento , Proteínas , Mapeo Cromosómico , Bases de Datos de Proteínas , Anotación de Secuencia Molecular , Proteínas/genética
3.
Nat Prod Rep ; 30(1): 108-60, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-23165928

RESUMEN

This review presents recommended nomenclature for the biosynthesis of ribosomally synthesized and post-translationally modified peptides (RiPPs), a rapidly growing class of natural products. The current knowledge regarding the biosynthesis of the >20 distinct compound classes is also reviewed, and commonalities are discussed.


Asunto(s)
Productos Biológicos , Péptidos , Ribosomas/metabolismo , Secuencia de Aminoácidos , Productos Biológicos/síntesis química , Productos Biológicos/química , Productos Biológicos/clasificación , Productos Biológicos/farmacología , Humanos , Datos de Secuencia Molecular , Estructura Molecular , Péptidos/síntesis química , Péptidos/química , Péptidos/clasificación , Péptidos/farmacología , Procesamiento Proteico-Postraduccional , Ribosomas/genética
4.
Database (Oxford) ; 20192019 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-30805646

RESUMEN

Methods focused on predicting 'global' annotations for proteins (such as molecular function, biological process and presence of domains or membership in a family) have reached a relatively mature stage. Methods to provide fine-grained 'local' annotation of functional sites (at the level of individual amino acid) are now coming to the forefront, especially in light of the rapid accumulation of genetic variant data. We have developed a computational method and workflow that predicts functional sites within proteins using position-specific conditional template annotation rules (namely PIR Site Rules or PIRSRs for short). Such rules are curated through review of known protein structural and other experimental data by structural biologists and are used to generate high-quality annotations for the UniProt Knowledgebase (UniProtKB) unreviewed section. To share the PIRSR functional site prediction method with the broader scientific community, we have streamlined our workflow and developed a stand-alone Java software package named PIRSitePredict. We demonstrate the use of PIRSitePredict for functional annotation of de novo assembled genome/transcriptome by annotating uncharacterized proteins from Trinity RNA-seq assembly of embryonic transcriptomes of the following three cartilaginous fishes: Leucoraja erinacea (Little Skate), Scyliorhinus canicula (Small-spotted Catshark) and Callorhinchus milii (Elephant Shark). On average about 1200 lines of annotations were predicted for each species.


Asunto(s)
Bases de Datos de Proteínas , Anotación de Secuencia Molecular , Secuencia de Aminoácidos , Animales , Embrión no Mamífero/metabolismo , Peces/embriología , Peces/genética , Genoma , Programas Informáticos , Transcriptoma/genética
5.
OMICS ; 10(2): 145-51, 2006.
Artículo en Inglés | MEDLINE | ID: mdl-16901219

RESUMEN

This article describes the origins, working practices and various development projects of the HUman Proteome Organisation's Proteomics Standards Initiative (HUPO PSI), specifically, our work on reporting requirements, data exchange formats and controlled vocabulary terms. We also offer our view of the two functional genomics projects in which the PSI plays a role (FuGE and FuGO), discussing their impact on our process and laying out the benefits we see as accruing, both to the PSI and to biomedical science as a whole as a result of their widespread acceptance.


Asunto(s)
Proteoma/normas , Proteómica/estadística & datos numéricos , Proteómica/normas , Humanos , Proteómica/instrumentación , Vocabulario Controlado
7.
Nucleic Acids Res ; 31(1): 499-501, 2003 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-12520062

RESUMEN

The RESID Database is a comprehensive collection of annotations and structures for protein pre-, co- and post-translational modifications including amino-terminal, carboxyl-terminal and peptide chain cross-link modifications. The RESID Database includes: systematic and alternate names, atomic formulas and masses, enzyme activities generating the modifications, keywords, literature citations, Gene Ontology cross-references, Protein Information Resource (PIR) and SWISS-PROT protein sequence database feature table annotations, structure diagrams and molecular models. This database is freely accessible on the Internet through the European Bioinformatics Institute at http://srs.ebi.ac.uk/srs6bin/cgi-bin/wgetz?-page+LibInfo+-lib+RESID, through the National Cancer Institute - Frederick Advanced Biomedical Computing Center at http://www.ncifcrf.gov/RESID, or through the Protein Information Resource at http://pir.georgetown.edu/pirwww/dbinfo/resid.html.


Asunto(s)
Bases de Datos de Proteínas , Proteínas/metabolismo , Animales , Regulación de la Expresión Génica , Almacenamiento y Recuperación de la Información , Proteínas/química , Proteínas/genética
8.
Nucleic Acids Res ; 32(Web Server issue): W340-5, 2004 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-15215407

RESUMEN

ProSight PTM (https://prosightptm.scs.uiuc.edu/) is a web application for identification and characterization of proteins using mass spectra data from 'top-down' fragmentation of intact protein ions (i.e. without any tryptic digestion). ProSight PTM has many tools and graphical features to facilitate analysis of single proteins, proteins in mixtures and proteins fragmented in parallel. Sequence databases from across the phylogenetic tree are supported, with a new database strategy of 'shotgun annotation' used to assist characterization of wild-type proteins. During a database search, data from divergent sources regarding potential mass differences such as polymorphisms, alternate splicing and post-translational modifications are utilized. The user can optionally control how much of this biological variability should be searched.


Asunto(s)
Espectrometría de Masas , Proteínas/química , Programas Informáticos , Empalme Alternativo , Bases de Datos de Proteínas , Humanos , Internet , Polimorfismo Genético , Procesamiento Proteico-Postraduccional , Proteínas/análisis , Proteínas/genética , Proteínas Ribosómicas/química , Análisis de Secuencia de Proteína , Lugares Marcados de Secuencia , Interfaz Usuario-Computador
9.
In Silico Biol ; 7(2 Suppl): S45-56, 2007.
Artículo en Inglés | MEDLINE | ID: mdl-17822390

RESUMEN

A structural diagram, in the form of a two-dimensional (2-D) sketch, remains the most effective portrait of a "small molecule" or chemical reaction. However, such structural diagrams, as for any other core data, cannot be used in speech (and should not be used in free text). "Good annotation practice" for biological databases is to use either consistent and widely recognised terminology or unique identifiers from a dedicated database to refer to the molecule of interest. Ideally, scientists should use terminology that is both pronounceable and meaningful. Thus, a viable solution for a bioinformatician is to use a definitive controlled vocabulary of biochemical compounds and reactions, which contains both systematic and common names. In addition, chemical ontologies provide a means for placing entities of interest into wider chemical, biological or medical contexts. We present some challenges and achievements in the standardisation of chemical language in biological databases, with emphasis on three aspects of annotation: 1. good drawing practice: how to draw unambiguous 2-D diagrams; 2. good naming practice: how to give most appropriate names; and 3. good ontology practice: how to link the entity of interest by defined logical relationships to other entities.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Factuales/normas , Modelos Químicos , Terminología como Asunto , Modelos Moleculares , Conformación Molecular , Vocabulario Controlado
10.
Proteomics ; 6(3): 738-41, 2006 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-16411267

RESUMEN

The autumn workshop of the Proteomics Standards Initiative of the Human Proteomics Organisation met to further advance the development of the existing standards in the fields of molecular interactions and mass spectrometry. In addition, new areas were addressed, in particular developing standards for the description and exchange of data from gel electrophoresis experiments. The General Proteomics Standards group is now working closely with the FuGE (Functional Genomics Experiment) efforts to define a general standard in which to encode data that will enable a systems biology approach to data analysis. Common to all these efforts is the field of protein modifications, and work has been initiated to establish an ontology in this field that can be used by both workers in the field of proteomics and the wider scientific community.


Asunto(s)
Espectrometría de Masas , Proteómica/normas , Bases de Datos de Proteínas/normas , Humanos
11.
Proteomics ; 6(16): 4439-43, 2006 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-16897683

RESUMEN

The theme of the third annual Spring workshop of the HUPO-PSI was "proteomics and beyond" and its underlying goal was to reach beyond the boundaries of the proteomics community to interact with groups working on the similar issues of developing interchange standards and minimal reporting requirements. Significant developments in many of the HUPO-PSI XML interchange formats, minimal reporting requirements and accompanying controlled vocabularies were reported, with many of these now feeding into the broader efforts of the Functional Genomics Experiment (FuGE) data model and Functional Genomics Ontology (FuGO) ontologies.


Asunto(s)
Proteómica/normas , Animales , Bases de Datos de Proteínas/normas , Humanos , Mapeo de Interacción de Proteínas , Publicaciones/normas , Proyectos de Investigación/normas , Terminología como Asunto
12.
Proteomics ; 4(6): 1527-33, 2004 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-15174122

RESUMEN

The RESID Database of Protein Modifications is a comprehensive collection of annotations and structures for protein modifications and cross-links including pre-, co-, and post-translational modifications. The database provides: systematic and alternate names, atomic formulas and masses, enzymatic activities that generate the modifications, keywords, literature citations, Gene Ontology (GO) cross-references, protein sequence database feature table annotations, structure diagrams, and molecular models. This database is freely accessible on the Internet through resources provided by the European Bioinformatics Institute (http://www.ebi.ac.uk/RESID), and by the National Cancer Institute--Frederick Advanced Biomedical Computing Center (http://www.ncifcrf.gov/RESID). Each RESID Database entry presents a chemically unique modification and shows how that modification is currently annotated in the protein sequence databases, Swiss-Prot and the Protein Information Resource (PIR). The RESID Database provides a table of corresponding equivalent feature annotations that is used in the UniProt project, an international effort to combine the resources of the Swiss-Prot, TrEMBL and PIR. As an annotation tool, the RESID Database is used in standardizing and enhancing modification descriptions in the feature tables of Swiss-Prot entries. As an Internet resource, the RESID Database assists researchers in high-throughput proteomics to search monoisotopic masses and mass differences and identify known and predicted protein modifications.


Asunto(s)
Bases de Datos Factuales , Proteínas/metabolismo , Internet , Proteínas/química , Proteínas/genética , Terminología como Asunto
13.
J Bacteriol ; 184(21): 5898-902, 2002 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-12374823

RESUMEN

The hybrid cluster protein (HCP; formerly termed the prismane protein) has been extensively studied due to its unique spectroscopic properties. Although the structural and spectroscopic characteristics are well defined, its enzymatic function, up to this point, has remained unidentified. While it was proposed that HCP acts in some step of nitrogen metabolism, a specific role for this enzyme remained unknown. Recent studies of HCP purified from Escherichia coli have identified a novel hydroxylamine reductase activity. These data reveal the ability of HCP to reduce hydroxylamine in vitro to form NH(3) and H(2)O. Further biochemical analyses were completed in order to determine the effects of various electron donors, different pH levels, and the presence of CN(-) on in vitro hydroxylamine reduction.


Asunto(s)
Proteínas Bacterianas/metabolismo , Escherichia coli/enzimología , Proteínas Hierro-Azufre/metabolismo , Oxidorreductasas/metabolismo , Absorción , Monóxido de Carbono , Cianuros , Hidroxilamina/metabolismo , Oxígeno , Espectrometría de Fluorescencia/métodos
14.
Proteomics ; 4(6): 1537-50, 2004 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-15174124

RESUMEN

High-throughput proteomic studies produce a wealth of new information regarding post-translational modifications (PTMs). The Swiss-Prot knowledge base is faced with the challenge of including this information in a consistent and structured way, in order to facilitate easy retrieval and promote understanding by biologist expert users as well as computer programs. We are therefore standardizing the annotation of PTM features represented in Swiss-Prot. Indeed, a controlled vocabulary has been associated with every described PTM. In this paper, we present the major update of the feature annotation, and, by showing a few examples, explain how the annotation is implemented and what it means. Mod-Prot, a future companion database of Swiss-Prot, devoted to the biological aspects of PTMs (i.e., general description of the process, identity of the modification enzyme(s), taxonomic range, mass modification) is briefly described. Finally we encourage once again the scientific community (i.e., both individual researchers and database maintainers) to interact with us, so that we can continuously enhance the quality and swiftness of our services.


Asunto(s)
Bases de Datos de Proteínas , Procesamiento Proteico-Postraduccional , Biología Computacional , Bases de Datos de Proteínas/normas , Predicción , Sistemas de Información , Análisis de Secuencia de Proteína , Integración de Sistemas
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA