Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 94
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Nucleic Acids Res ; 48(D1): D261-D264, 2020 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-31410491

RESUMEN

The ABCD (for AntiBodies Chemically Defined) database is a repository of sequenced antibodies, integrating curated information about the antibody and its antigen with cross-links to standardized databases of chemical and protein entities. It is freely available to the academic community, accessible through the ExPASy server (https://web.expasy.org/abcd/). The ABCD database aims at helping to improve reproducibility in academic research by providing a unique, unambiguous identifier associated to each antibody sequence. It also allows to determine rapidly if a sequenced antibody is available for a given antigen.


Asunto(s)
Anticuerpos/química , Bases de Datos de Proteínas , Secuencia de Aminoácidos , Anticuerpos/inmunología , Antígenos/química , Antígenos/inmunología
2.
Nucleic Acids Res ; 48(D1): D328-D334, 2020 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-31724716

RESUMEN

The neXtProt knowledgebase (https://www.nextprot.org) is an integrative resource providing both data on human protein and the tools to explore these. In order to provide comprehensive and up-to-date data, we evaluate and add new data sets. We describe the incorporation of three new data sets that provide expression, function, protein-protein binary interaction, post-translational modifications (PTM) and variant information. New SPARQL query examples illustrating uses of the new data were added. neXtProt has continued to develop tools for proteomics. We have improved the peptide uniqueness checker and have implemented a new protein digestion tool. Together, these tools make it possible to determine which proteases can be used to identify trypsin-resistant proteins by mass spectrometry. In terms of usability, we have finished revamping our web interface and completely rewritten our API. Our SPARQL endpoint now supports federated queries. All the neXtProt data are available via our user interface, API, SPARQL endpoint and FTP site, including the new PEFF 1.0 format files. Finally, the data on our FTP site is now CC BY 4.0 to promote its reuse.


Asunto(s)
Bases de Datos de Proteínas , Bases del Conocimiento , Humanos , Internet , Espectrometría de Masas , Péptidos/química , Proteínas Quinasas/química , Proteínas Quinasas/metabolismo , Procesamiento Proteico-Postraduccional , Proteínas/química , Proteínas/genética , Proteínas/metabolismo , Análisis de Secuencia de ARN , Programas Informáticos , Tripsina , Interfaz Usuario-Computador
3.
Bioinformatics ; 36(10): 3244-3245, 2020 05 01.
Artículo en Inglés | MEDLINE | ID: mdl-31985787

RESUMEN

SUMMARY: The Feature-Viewer is a lightweight library for the visualization of biological data mapped to a protein or nucleotide sequence. It is designed for ease of use while allowing for a full customization. The library is already used by several biological data resources and allows intuitive visual mapping of a full spectra of sequence features for different usages. AVAILABILITY AND IMPLEMENTATION: The Feature-Viewer is open source, compatible with state-of-the-art development technologies and responsive, also for mobile viewing. Documentation and usage examples are available online.


Asunto(s)
Computadores , Programas Informáticos
4.
Int J Cancer ; 146(5): 1299-1306, 2020 03 01.
Artículo en Inglés | MEDLINE | ID: mdl-31444973

RESUMEN

Despite an increased awareness of the problematic of cell line cross-contamination and misidentification, it remains nowadays a major source of erroneous experimental results in biomedical research. To prevent it, researchers are expected to frequently test the authenticity of the cell lines they are working on. STR profiling was selected as the international reference method to perform cell line authentication. While the experimental protocols and manipulations for generating a STR profile are well described, the available tools and workflows to analyze such data are lacking. The Cellosaurus knowledge resource aimed to improve the situation by compiling all the publicly available STR profiles from the literature and other databases. As a result, it grew to become the largest database in terms of human STR profiles, with 6,474 distinct cell lines having an associated STR profile (release July 31, 2019). Here we present CLASTR, the Cellosaurus STR similarity search tool enabling users to compare one or more STR profiles with those available in the Cellosaurus cell line knowledge resource. It aims to help researchers in the process of cell line authentication by providing numerous functionalities. The tool is publicly accessible on the SIB ExPASy server (https://web.expasy.org/cellosaurus-str-search) and its source code is available on GitHub under the GPL-3.0 license.


Asunto(s)
Autenticación de Línea Celular/métodos , Minería de Datos/métodos , Repeticiones de Microsatélite/genética , Animales , Biomarcadores/análisis , Línea Celular , Dermatoglifia del ADN , Bases de Datos Factuales , Perros , Humanos , Ratones , Programas Informáticos
5.
Hum Genomics ; 12(1): 36, 2018 07 11.
Artículo en Inglés | MEDLINE | ID: mdl-29996917

RESUMEN

BACKGROUND: Germline pathogenic variants in the breast cancer type 1 susceptibility gene BRCA1 are associated with a 60% lifetime risk for breast and ovarian cancer. This overall risk estimate is for all BRCA1 variants; obviously, not all variants confer the same risk of developing a disease. In cancer patients, loss of BRCA1 function in tumor tissue has been associated with an increased sensitivity to platinum agents and to poly-(ADP-ribose) polymerase (PARP) inhibitors. For clinical management of both at-risk individuals and cancer patients, it would be important that each identified genetic variant be associated with clinical significance. Unfortunately for the vast majority of variants, the clinical impact is unknown. The availability of results from studies assessing the impact of variants on protein function may provide insight of crucial importance. RESULTS AND CONCLUSION: We have collected, curated, and structured the molecular and cellular phenotypic impact of 3654 distinct BRCA1 variants. The data was modeled in triple format, using the variant as a subject, the studied function as the object, and a predicate describing the relation between the two. Each annotation is supported by a fully traceable evidence. The data was captured using standard ontologies to ensure consistency, and enhance searchability and interoperability. We have assessed the extent to which functional defects at the molecular and cellular levels correlate with the clinical interpretation of variants by ClinVar submitters. Approximately 30% of the ClinVar BRCA1 missense variants have some molecular or cellular assay available in the literature. Pathogenic variants (as assigned by ClinVar) have at least some significant functional defect in 94% of testable cases. For benign variants, 77% of ClinVar benign variants, for which neXtProt Cancer variant portal has data, shows either no or mild experimental functional defects. While this does not provide evidence for clinical interpretation of variants, it may provide some guidance for variants of unknown significance, in the absence of more reliable data. The neXtProt Cancer variant portal ( https://www.nextprot.org/portals/breast-cancer ) contains over 6300 observations at the molecular and/or cellular level for BRCA1 variants.


Asunto(s)
Proteína BRCA1/genética , Neoplasias de la Mama/genética , Predisposición Genética a la Enfermedad , Neoplasias Ováricas/genética , Adulto , Anciano , Proteína BRCA1/química , Neoplasias de la Mama/patología , Biología Computacional , Femenino , Variación Genética , Mutación de Línea Germinal/genética , Humanos , Persona de Mediana Edad , Neoplasias Ováricas/patología , Conformación Proteica
6.
Chem Res Toxicol ; 32(9): 1733-1736, 2019 09 16.
Artículo en Inglés | MEDLINE | ID: mdl-31203605

RESUMEN

Research in toxicology relies on in vitro models such as cell lines. These living models are prone to change and may be described in publications with insufficient information or quality control testing. This article sets out recommendations to improve the reliability of cell-based research.


Asunto(s)
Técnicas de Cultivo de Célula/normas , Línea Celular , Modelos Biológicos , Animales , Autenticación de Línea Celular , Humanos , Control de Calidad , Reproducibilidad de los Resultados , Toxicología/métodos , Toxicología/normas
7.
Nucleic Acids Res ; 45(D1): D177-D182, 2017 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-27899619

RESUMEN

The neXtProt human protein knowledgebase (https://www.nextprot.org) continues to add new content and tools, with a focus on proteomics and genetic variation data. neXtProt now has proteomics data for over 85% of the human proteins, as well as new tools tailored to the proteomics community.Moreover, the neXtProt release 2016-08-25 includes over 8000 phenotypic observations for over 4000 variations in a number of genes involved in hereditary cancers and channelopathies. These changes are presented in the current neXtProt update. All of the neXtProt data are available via our user interface and FTP site. We also provide an API access and a SPARQL endpoint for more technical applications.


Asunto(s)
Bases de Datos de Proteínas , Proteómica , Estudios de Asociación Genética , Variación Genética , Humanos , Internet , Fenotipo , Proteómica/métodos , Programas Informáticos , Navegador Web
8.
J Proteome Res ; 17(12): 4160-4170, 2018 12 07.
Artículo en Inglés | MEDLINE | ID: mdl-30175587

RESUMEN

The practice of data sharing in the proteomics field took off and quickly spread in recent years as a result of collective effort. Nowadays, most journal editors mandate the submission of the original raw mass spectra to one of the databases of the ProteomeXchange consortium. With the exception of large institutional initiatives such as PeptideAtlas or the GPMDB, few new studies are however based on the reanalysis of mass spectrometry data. A wealth of information is thus left unexploited in public databases and repositories. Here, we present the large-scale reanalysis of 41 publicly available data sets corresponding to experiments carried out on the HeLa cancer cell line using a custom workflow. In addition to the search of new post-translational modification sites and "missing proteins", our main goal is to identify single amino acid variants and evaluate their impact on protein expression and stability through the spectral counting quantification approach. The X!Tandem software was selected to perform the search of a total of 56 363 701 tandem mass spectra against a customized variant protein database, compiled by the application of the in-house MzVar tool on HeLa-specific somatic and genomic variants retrieved from the COSMIC cell line project. After filtering the resulting identifications with a 1% FDR threshold computed at the protein level, 49 466 unique peptides were identified in 7266 protein entries, allowing the validation of 5576 protein entries in accordance with the HPP guidelines version 2.1. A new "missing protein" was observed (FRAT2, NX_O75474, chromosome 10), and 189 new phosphorylation and 392 new protein N-terminal acetylation sites could be identified. Twenty-four variant peptides were also identified, corresponding to 21 variants in 21 proteins. For three of the nine heterozygous cases where both the variant peptide and its wild-type counterpart were detected, the application of a two-tailed sign test showed a significant difference in the abundance of the two peptide versions.


Asunto(s)
Bases de Datos de Proteínas , Variación Genética , Procesamiento Proteico-Postraduccional , Proteoma/análisis , Acetilación , Secuencia de Aminoácidos , Línea Celular Tumoral , Células HeLa , Humanos , Fosforilación , Proteómica/métodos , Programas Informáticos
9.
J Proteome Res ; 17(12): 4211-4226, 2018 12 07.
Artículo en Inglés | MEDLINE | ID: mdl-30191714

RESUMEN

20,230 protein-coding genes have been predicted from the analysis of the human genome (neXtProt release 2018-01-17), and about 10% of them are still lacking functional annotation, either predicted by bioinformatics tools or captured from experimental reports. A systematic exploration of the available literature on uncharacterized human genes/proteins led to proposal of functional annotations for 113 proteins and to consolidation of a list of 1,862 uncharacterized human proteins. The advanced search functionality of neXtProt was used extensively in order to examine the landscape of the uncharacterized human proteome in terms of subcellular locations, protein-protein interactions, tissue expression, association with diseases, and 3D structure. Finally, a deep data mining in various publicly available resources allowed building functional hypotheses for 26 uncharacterized human proteins validated at protein level (uPE1). These hypotheses cover the fields of cilia biology, male reproduction, metabolism, nervous system, immunity, inflammation, RNA metabolism, and chromatin biology. They will require experimental validation before they can be considered for annotation. Despite technological progresses, the pace of human protein characterization studies is still slow. It could be accelerated by a better integration of existing knowledge resources and by initiating large collaborative projects involving specialists of different biology fields. We hope that our analysis will contribute to set up the ground for such collaborative approaches and will be exploited by the HUPO Human Proteome Project teams committed to characterize uPE1 proteins.


Asunto(s)
Anotación de Secuencia Molecular , Proteoma/genética , Biología Computacional , Minería de Datos , Genoma Humano/genética , Humanos , Métodos , Proteoma/análisis
10.
Hum Mutat ; 38(5): 485-493, 2017 05.
Artículo en Inglés | MEDLINE | ID: mdl-28168870

RESUMEN

Voltage-gated sodium channels are pore-forming transmembrane proteins that selectively allow sodium ions to flow across the plasma membrane according to the electro-chemical gradient thus mediating the rising phase of action potentials in excitable cells and playing key roles in physiological processes such as neurotransmission, skeletal muscle contraction, heart rhythm, and pain sensation. Genetic variations in the nine human genes encoding these channels are known to cause a large range of diseases affecting the nervous and cardiac systems. Understanding the molecular effect of genetic variations is critical for elucidating the pathologic mechanisms of known variations and in predicting the effect of newly discovered ones. To this end, we have created a Web-based tool, the Ion Channels Variants Portal, which compiles all variants characterized functionally in the human sodium channel genes. This portal describes 672 variants each associated with at least one molecular or clinical phenotypic impact, for a total of 4,658 observations extracted from 264 different research articles. These data were captured as structured annotations using standardized vocabularies and ontologies, such as the Gene Ontology and the Ion Channel ElectroPhysiology Ontology. All these data are available to the scientific community via neXtProt at https://www.nextprot.org/portals/navmut.


Asunto(s)
Biología Computacional , Bases de Datos Genéticas , Mutación , Canales de Sodio Activados por Voltaje/genética , Canales de Sodio Activados por Voltaje/metabolismo , Animales , Biología Computacional/métodos , Fenómenos Electrofisiológicos/genética , Estudios de Asociación Genética , Predisposición Genética a la Enfermedad , Genotipo , Humanos , Anotación de Secuencia Molecular , Fenotipo , Dominios Proteicos , Índice de Severidad de la Enfermedad , Programas Informáticos , Canales de Sodio Activados por Voltaje/química , Navegador Web
11.
Nucleic Acids Res ; 43(Database issue): D764-70, 2015 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-25593349

RESUMEN

neXtProt (http://www.nextprot.org) is a human protein-centric knowledgebase developed at the SIB Swiss Institute of Bioinformatics. Focused solely on human proteins, neXtProt aims to provide a state of the art resource for the representation of human biology by capturing a wide range of data, precise annotations, fully traceable data provenance and a web interface which enables researchers to find and view information in a comprehensive manner. Since the introductory neXtProt publication, significant advances have been made on three main aspects: the representation of proteomics data, an extended representation of human variants and the development of an advanced search capability built around semantic technologies. These changes are presented in the current neXtProt update.


Asunto(s)
Bases de Datos de Proteínas , Variación Genética , Proteínas/genética , Proteómica , Línea Celular , Enfermedad/genética , Humanos , Internet , Proteoma
12.
J Proteome Res ; 15(11): 3971-3978, 2016 11 04.
Artículo en Inglés | MEDLINE | ID: mdl-27487287

RESUMEN

Within the C-HPP, the Swiss and French teams are responsible for the annotation of proteins from chromosomes 2 and 14, respectively. neXtProt currently reports 1231 entries on chromosome 2 and 624 entries on chromosome 14; of these, 134 and 93 entries are still not experimentally validated and are thus considered as "missing proteins" (PE2-4), respectively. Among these entries, some may never be validated by conventional MS/MS approaches because of incompatible biochemical features. Others have already been validated but are still awaiting annotation. On the basis of information retrieved from the literature and from three of the main C-HPP resources (Human Protein Atlas, PeptideAtlas, and neXtProt), a subset of 40 theoretically detectable missing proteins (25 on chromosome 2 and 15 on chromosome 14) was defined for upcoming targeted studies in sperm samples. This list is proposed as a roadmap for the French and Swiss teams in the near future.


Asunto(s)
Cromosomas Humanos Par 14 , Cromosomas Humanos Par 2 , Proteoma/análisis , Biología Computacional/tendencias , Minería de Datos/tendencias , Bases de Datos de Proteínas , Francia , Humanos , Masculino , Espermatozoides/química , Suiza , Espectrometría de Masas en Tándem/normas
13.
Nucleic Acids Res ; 42(Web Server issue): W436-41, 2014 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-24792157

RESUMEN

The SIB Swiss Institute of Bioinformatics (www.isb-sib.ch) was created in 1998 as an institution to foster excellence in bioinformatics. It is renowned worldwide for its databases and software tools, such as UniProtKB/Swiss-Prot, PROSITE, SWISS-MODEL, STRING, etc, that are all accessible on ExPASy.org, SIB's Bioinformatics Resource Portal. This article provides an overview of the scientific and training resources SIB has consistently been offering to the life science community for more than 15 years.


Asunto(s)
Biología Computacional , Bases de Datos de Compuestos Químicos , Programas Informáticos , Evolución Biológica , Bioestadística , Diseño de Fármacos , Genómica , Humanos , Internet , Conformación Proteica , Proteómica , Biología de Sistemas
14.
Proteomics ; 15(14): 2519-24, 2015 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-25944712

RESUMEN

The high throughput characterization of protein N-termini is becoming an emerging challenge in the proteomics and proteogenomics fields. The present study describes the free N-terminome analysis of human mitochondria-enriched samples using trimethoxyphenyl phosphonium (TMPP) labelling approaches. Owing to the extent of protein import and cleavage for mitochondrial proteins, determining the new N-termini generated after translocation/processing events for mitochondrial proteins is crucial to understand the transformation of precursors to mature proteins. The doublet N-terminal oriented proteomics (dN-TOP) strategy based on a double light/heavy TMPP labelling has been optimized in order to improve and automate the workflow for efficient, fast and reliable high throughput N-terminome analysis. A total of 2714 proteins were identified and 897 N-terminal peptides were characterized (424 N-α-acetylated and 473 TMPP-labelled peptides). These results allowed the precise identification of the N-terminus of 693 unique proteins corresponding to 26% of all identified proteins. Overall, 120 already annotated processing cleavage sites were confirmed while 302 new cleavage sites were characterized. The accumulation of experimental evidence of mature N-termini should allow increasing the knowledge of processing mechanisms and consequently also enhance cleavage sites prediction algorithms. Complete datasets have been deposited to the ProteomeXchange Consortium with identifiers PXD001521, PXD001522 and PXD001523 (http://proteomecentral.proteomexchange.org/dataset/PXD001521, http://proteomecentral.proteomexchange.org/dataset/PXD0001522 and http://proteomecentral.proteomexchange.org/dataset/PXD001523, respectively).


Asunto(s)
Proteínas Mitocondriales/química , Proteómica/métodos , Humanos , Marcaje Isotópico/métodos , Espectrometría de Masas/métodos , Compuestos Organofosforados/química , Conformación Proteica
15.
Biochim Biophys Acta ; 1843(12): 2913-25, 2014 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-25229427

RESUMEN

Deoxyribose-phosphate aldolase (EC 4.1.2.4), which converts 2-deoxy-d-ribose-5-phosphate into glyceraldehyde-3-phosphate and acetaldehyde, belongs to the core metabolism of living organisms. It was previously shown that human cells harbor deoxyribose phosphate aldolase activity but the protein responsible of this activity has never been formally identified. This study provides the first experimental evidence that DERA, which is mainly expressed in lung, liver and colon, is the human deoxyribose phosphate aldolase. Among human cell lines, the highest DERA mRNA level and deoxyribose phosphate aldolase activity were observed in liver-derived Huh-7 cells. DERA was shown to interact with the known stress granule component YBX1 and to be recruited to stress granules after oxidative or mitochondrial stress. In addition, cells in which DERA expression was down-regulated using shRNA formed fewer stress granules and were more prone to apoptosis after clotrimazole stress, suggesting the importance of DERA for stress granule formation. Furthermore, the expression of DERA was shown to permit cells in which mitochondrial ATP production was abolished to make use of extracellular deoxyinosine to maintain ATP levels. This study unraveled a previously undescribed pathway which may allow cells with high deoxyribose-phosphate aldolase activity, such as liver cells, to minimize or delay stress-induced damage by producing energy through deoxynucleoside degradation.

16.
Proteins ; 83(6): 1005-13, 2015 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-25820941

RESUMEN

As the volume of data relating to proteins increases, researchers rely more and more on the analysis of published data, thus increasing the importance of good access to these data that vary from the supplemental material of individual articles, all the way to major reference databases with professional staff and long-term funding. Specialist protein resources fill an important middle ground, providing interactive web interfaces to their databases for a focused topic or family of proteins, using specialized approaches that are not feasible in the major reference databases. Many are labors of love, run by a single lab with little or no dedicated funding and there are many challenges to building and maintaining them. This perspective arose from a meeting of several specialist protein resources and major reference databases held at the Wellcome Trust Genome Campus (Cambridge, UK) on August 11 and 12, 2014. During this meeting some common key challenges involved in creating and maintaining such resources were discussed, along with various approaches to address them. In laying out these challenges, we aim to inform users about how these issues impact our resources and illustrate ways in which our working together could enhance their accuracy, currency, and overall value.


Asunto(s)
Bases de Datos de Proteínas/normas , Anotación de Secuencia Molecular , Proteínas , Curaduría de Datos
18.
J Proteome Res ; 13(1): 15-20, 2014 Jan 03.
Artículo en Inglés | MEDLINE | ID: mdl-24364385

RESUMEN

One year ago the Human Proteome Project (HPP) leadership designated the baseline metrics for the Human Proteome Project to be based on neXtProt with a total of 13,664 proteins validated at protein evidence level 1 (PE1) by mass spectrometry, antibody-capture, Edman sequencing, or 3D structures. Corresponding chromosome-specific data were provided from PeptideAtlas, GPMdb, and Human Protein Atlas. This year, the neXtProt total is 15,646 and the other resources, which are inputs to neXtProt, have high-quality identifications and additional annotations for 14,012 in PeptideAtlas, 14,869 in GPMdb, and 10,976 in HPA. We propose to remove 638 genes from the denominator that are "uncertain" or "dubious" in Ensembl, UniProt/SwissProt, and neXtProt. That leaves 3844 "missing proteins", currently having no or inadequate documentation, to be found from a new denominator of 19,490 protein-coding genes. We present those tabulations and web links and discuss current strategies to find the missing proteins.


Asunto(s)
Proteoma , Cromosomas Humanos , Humanos , Espectrometría de Masas
19.
Nucleic Acids Res ; 40(Database issue): D761-9, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22102589

RESUMEN

UniPathway (http://www.unipathway.org) is a fully manually curated resource for the representation and annotation of metabolic pathways. UniPathway provides explicit representations of enzyme-catalyzed and spontaneous chemical reactions, as well as a hierarchical representation of metabolic pathways. This hierarchy uses linear subpathways as the basic building block for the assembly of larger and more complex pathways, including species-specific pathway variants. All of the pathway data in UniPathway has been extensively cross-linked to existing pathway resources such as KEGG and MetaCyc, as well as sequence resources such as the UniProt KnowledgeBase (UniProtKB), for which UniPathway provides a controlled vocabulary for pathway annotation. We introduce here the basic concepts underlying the UniPathway resource, with the aim of allowing users to fully exploit the information provided by UniPathway.


Asunto(s)
Bases de Datos Factuales , Redes y Vías Metabólicas , Bases de Datos de Proteínas , Enzimas/metabolismo , Lisina/biosíntesis , Anotación de Secuencia Molecular
20.
Nucleic Acids Res ; 40(Database issue): D76-83, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22139911

RESUMEN

neXtProt (http://www.nextprot.org/) is a new human protein-centric knowledge platform. Developed at the Swiss Institute of Bioinformatics (SIB), it aims to help researchers answer questions relevant to human proteins. To achieve this goal, neXtProt is built on a corpus containing both curated knowledge originating from the UniProtKB/Swiss-Prot knowledgebase and carefully selected and filtered high-throughput data pertinent to human proteins. This article presents an overview of the database and the data integration process. We also lay out the key future directions of neXtProt that we consider the necessary steps to make neXtProt the one-stop-shop for all research projects focusing on human proteins.


Asunto(s)
Bases de Datos de Proteínas , Humanos , Bases del Conocimiento , Proteínas/genética , Proteínas/metabolismo , Interfaz Usuario-Computador
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA