Search | VHL Regional Portal

Navigating the Global Protein-Protein Interaction Landscape Using iRefWeb.

Turinsky, Andrei L; Dupont, Sam; Botzki, Alexander; Razick, Sabry; Turner, Brian; Donaldson, Ian M; Wodak, Shoshana J.

Methods Mol Biol ; 2199: 191-207, 2021.

Article in English | MEDLINE | ID: mdl-33125652

ABSTRACT

iRefWeb is a resource that provides web interface to a large collection of protein-protein interactions aggregated from major primary databases. The underlying data-consolidation process, called iRefIndex, implements a rigorous methodology of identifying redundant protein sequences and integrating disparate data records that reference the same peptide sequences, despite many potential differences in data identifiers across various source databases. iRefWeb offers a unified user interface to all interaction records and associated information collected by iRefIndex, in addition to a number of data filters and visual features that present the supporting evidence. Users of iRefWeb can explore the consolidated landscape of protein-protein interactions, establish the provenance and reliability of each data record, and compare annotations performed by different data curator teams. The iRefWeb portal is freely available at http://wodaklab.org/iRefWeb .

Subject(s)

Database Management Systems , Databases, Protein , Internet , Protein Interaction Mapping , User-Computer Interface , Humans

The eGenVar data management system--cataloguing and sharing sensitive data and metadata for the life sciences.

Razick, Sabry; Mocnik, Rok; Thomas, Laurent F; Ryeng, Einar; Drabløs, Finn; Sætrom, Pål.

Database (Oxford) ; 2014: bau027, 2014.

Article in English | MEDLINE | ID: mdl-24682735

ABSTRACT

Systematic data management and controlled data sharing aim at increasing reproducibility, reducing redundancy in work, and providing a way to efficiently locate complementing or contradicting information. One method of achieving this is collecting data in a central repository or in a location that is part of a federated system and providing interfaces to the data. However, certain data, such as data from biobanks or clinical studies, may, for legal and privacy reasons, often not be stored in public repositories. Instead, we describe a metadata cataloguing system and a software suite for reporting the presence of data from the life sciences domain. The system stores three types of metadata: file information, file provenance and data lineage, and content descriptions. Our software suite includes both graphical and command line interfaces that allow users to report and tag files with these different metadata types. Importantly, the files remain in their original locations with their existing access-control mechanisms in place, while our system provides descriptions of their contents and relationships. Our system and software suite thereby provide a common framework for cataloguing and sharing both public and private data. Database URL: http://bigr.medisin.ntnu.no/data/eGenVar/.

Subject(s)

Biological Science Disciplines , Database Management Systems , Databases, Genetic , Information Dissemination , Software , Biological Ontologies , Search Engine , Terminology as Topic

Navigating the global protein-protein interaction landscape using iRefWeb.

Turinsky, Andrei L; Razick, Sabry; Turner, Brian; Donaldson, Ian M; Wodak, Shoshana J.

Methods Mol Biol ; 1091: 315-31, 2014.

Article in English | MEDLINE | ID: mdl-24203342

ABSTRACT

iRefWeb is a bioinformatics resource that offers access to a large collection of data on protein-protein interactions in over a thousand organisms. This collection is consolidated from 14 major public databases that curate the scientific literature. The collection is enhanced with a range of versatile data filters and search options that categorize various types of protein-protein interactions and protein complexes. Users of iRefWeb are able to retrieve all curated interactions for a given organism or those involving a given protein (or a list of proteins), narrow down their search results based on different supporting evidence, and assess the reliability of these interactions using various criteria. They may also examine all data and annotations related to any publication that described the interaction-detection experiments. iRefWeb is freely available to the research community worldwide at http://wodaklab.org/iRefWeb .

Subject(s)

Protein Interaction Mapping/methods , Protein Interaction Maps , Proteins/metabolism , Web Browser , Carrier Proteins/metabolism , Computational Biology/methods , Databases, Protein , Molecular Sequence Annotation , Protein Binding , Proteins/genetics , Proteomics/methods

iRefScape. A Cytoscape plug-in for visualization and data mining of protein interaction data from iRefIndex.

Razick, Sabry; Mora, Antonio; Michalickova, Katerina; Boddie, Paul; Donaldson, Ian M.

BMC Bioinformatics ; 12: 388, 2011 Oct 05.

Article in English | MEDLINE | ID: mdl-21975162

ABSTRACT

BACKGROUND: The iRefIndex consolidates protein interaction data from ten databases in a rigorous manner using sequence-based hash keys. Working with consolidated interaction data comes with distinct challenges: data are redundant, overlapping, highly interconnected and may be collected and represented using different curation practices. These phenomena were quantified in our previous studies. RESULTS: The iRefScape plug-in for the Cytoscape graphical viewer addresses these challenges. We show how these factors impact on data-mining tasks and how our solutions resolve them in a simple and efficient manner. A uniform accession space is used to limit redundancy and support search expansion and searching on multiple accession types. Multiple node and edge features support data filtering and mining. Node colours and features supply information about search result provenance. Overlapping evidence is presented using a multi-graph and a bi-partite representation is used to distinguish binary and n-ary source data. Searching for interactions between sets of proteins is supported and specifically includes searches on disease-related genes found in OMIM. Finally, a synchronized adjacency-matrix view facilitates visualization of relationships between sets of user defined groups. CONCLUSIONS: The iRefScape plug-in will be of interest to advanced users of interaction data. The plug-in provides access to a consolidated data set in a uniform accession space while remaining faithful to the underlying source data. Tools are provided to facilitate a range of tasks from a simple search to knowledge discovery. The plug-in uses a number of strategies that will be of interest to other plug-in developers.

Subject(s)

Data Mining , Databases, Protein , Proteins/metabolism , Database Management Systems , Databases, Genetic , Protein Interaction Mapping , Software

PSICQUIC and PSISCORE: accessing and scoring molecular interactions.

Aranda, Bruno; Blankenburg, Hagen; Kerrien, Samuel; Brinkman, Fiona S L; Ceol, Arnaud; Chautard, Emilie; Dana, Jose M; De Las Rivas, Javier; Dumousseau, Marine; Galeota, Eugenia; Gaulton, Anna; Goll, Johannes; Hancock, Robert E W; Isserlin, Ruth; Jimenez, Rafael C; Kerssemakers, Jules; Khadake, Jyoti; Lynn, David J; Michaut, Magali; O'Kelly, Gavin; Ono, Keiichiro; Orchard, Sandra; Prieto, Carlos; Razick, Sabry; Rigina, Olga; Salwinski, Lukasz; Simonovic, Milan; Velankar, Sameer; Winter, Andrew; Wu, Guanming; Bader, Gary D; Cesareni, Gianni; Donaldson, Ian M; Eisenberg, David; Kleywegt, Gerard J; Overington, John; Ricard-Blum, Sylvie; Tyers, Mike; Albrecht, Mario; Hermjakob, Henning.

Nat Methods ; 8(7): 528-9, 2011 Jun 29.

Article in English | MEDLINE | ID: mdl-21716279

Subject(s)

Proteins/chemistry , Proteins/metabolism , Software , Animals , Computational Biology , Databases, Factual , Humans , Protein Binding

Interaction databases on the same page.

Turinsky, Andrei L; Razick, Sabry; Turner, Brian; Donaldson, Ian M; Wodak, Shoshana J.

Nat Biotechnol ; 29(5): 391-3, 2011 May.

Article in English | MEDLINE | ID: mdl-21552234

Subject(s)

Databases, Protein/standards , Internet , Protein Interaction Domains and Motifs , Animals , Database Management Systems , Humans , Mice , Protein Interaction Mapping , Proteomics , Saccharomyces cerevisiae , Saccharomyces cerevisiae Proteins/metabolism

Literature curation of protein interactions: measuring agreement across major public databases.

Turinsky, Andrei L; Razick, Sabry; Turner, Brian; Donaldson, Ian M; Wodak, Shoshana J.

Database (Oxford) ; 2010: baq026, 2010.

Article in English | MEDLINE | ID: mdl-21183497

ABSTRACT

Literature curation of protein interaction data faces a number of challenges. Although curators increasingly adhere to standard data representations, the data that various databases actually record from the same published information may differ significantly. Some of the reasons underlying these differences are well known, but their global impact on the interactions collectively curated by major public databases has not been evaluated. Here we quantify the agreement between curated interactions from 15 471 publications shared across nine major public databases. Results show that on average, two databases fully agree on 42% of the interactions and 62% of the proteins curated from the same publication. Furthermore, a sizable fraction of the measured differences can be attributed to divergent assignments of organism or splice isoforms, different organism focus and alternative representations of multi-protein complexes. Our findings highlight the impact of divergent curation policies across databases, and should be relevant to both curators and data consumers interested in analyzing protein-interaction data generated by the scientific community. Database URL: http://wodaklab.org/iRefWeb.

Subject(s)

Database Management Systems , Databases, Protein , Protein Interaction Domains and Motifs , Proteins/chemistry , Proteins/metabolism , Proteomics , Analysis of Variance , Animals , Humans , Internet , Protein Interaction Mapping , Protein Isoforms

iRefWeb: interactive analysis of consolidated protein interaction data and their supporting evidence.

Turner, Brian; Razick, Sabry; Turinsky, Andrei L; Vlasblom, James; Crowdy, Edgard K; Cho, Emerson; Morrison, Kyle; Donaldson, Ian M; Wodak, Shoshana J.

Database (Oxford) ; 2010: baq023, 2010 Oct 12.

Article in English | MEDLINE | ID: mdl-20940177

ABSTRACT

We present iRefWeb, a web interface to protein interaction data consolidated from 10 public databases: BIND, BioGRID, CORUM, DIP, IntAct, HPRD, MINT, MPact, MPPI and OPHID. iRefWeb enables users to examine aggregated interactions for a protein of interest, and presents various statistical summaries of the data across databases, such as the number of organism-specific interactions, proteins and cited publications. Through links to source databases and supporting evidence, researchers may gauge the reliability of an interaction using simple criteria, such as the detection methods, the scale of the study (high- or low-throughput) or the number of cited publications. Furthermore, iRefWeb compares the information extracted from the same publication by different databases, and offers means to follow-up possible inconsistencies. We provide an overview of the consolidated protein-protein interaction landscape and show how it can be automatically cropped to aid the generation of meaningful organism-specific interactomes. iRefWeb can be accessed at: http://wodaklab.org/iRefWeb. Database URL: http://wodaklab.org/iRefWeb/

Subject(s)

Data Display , Database Management Systems , Databases, Protein , Information Storage and Retrieval , Internet , Protein Interaction Mapping/methods , User-Computer Interface , Abstracting and Indexing , Animals , Computational Biology/methods , Humans , Proteins/classification

iRefIndex: a consolidated protein interaction database with provenance.

Razick, Sabry; Magklaras, George; Donaldson, Ian M.

BMC Bioinformatics ; 9: 405, 2008 Sep 30.

Article in English | MEDLINE | ID: mdl-18823568

ABSTRACT

BACKGROUND: Interaction data for a given protein may be spread across multiple databases. We set out to create a unifying index that would facilitate searching for these data and that would group together redundant interaction data while recording the methods used to perform this grouping. RESULTS: We present a method to generate a key for a protein interaction record and a key for each participant protein. These keys may be generated by anyone using only the primary sequence of the proteins, their taxonomy identifiers and the Secure Hash Algorithm. Two interaction records will have identical keys if they refer to the same set of identical protein sequences and taxonomy identifiers. We define records with identical keys as a redundant group. Our method required that we map protein database references found in interaction records to current protein sequence records. Operations performed during this mapping are described by a mapping score that may provide valuable feedback to source interaction databases on problematic references that are malformed, deprecated, ambiguous or unfound. Keys for protein participants allow for retrieval of interaction information independent of the protein references used in the original records. CONCLUSION: We have applied our method to protein interaction records from BIND, BioGrid, DIP, HPRD, IntAct, MINT, MPact, MPPI and OPHID. The resulting interaction reference index is provided in PSI-MITAB 2.5 format at http://irefindex.uio.no. This index may form the basis of alternative redundant groupings based on gene identifiers or near sequence identity groupings.

Subject(s)

Abstracting and Indexing/methods , Database Management Systems , Protein Interaction Mapping/methods , Amino Acid Sequence/physiology , Animals , Data Compression , Databases, Protein , Humans , Neural Networks, Computer , Proteins/classification , Proteins/metabolism , Proteome/metabolism , Proteomics/methods

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

Subject(s)

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL