Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 28
Filtrar
1.
Bioinformatics ; 38(Suppl_2): ii162-ii167, 2022 09 16.
Artigo em Inglês | MEDLINE | ID: mdl-36124803

RESUMO

MOTIVATION: We have previously designed and implemented a tree-based ontology to represent glycan structures with the aim of searching these structures with a glyco-driven syntax. This resulted in creating the GlySTreeM knowledge-base as a linchpin of the structural matching procedure and we now introduce a query language, called GlycoQL, for the actual implementation of a glycan structure search. RESULTS: The methodology is described and illustrated with a use-case focused on Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) spike protein glycosylation. We show how to enhance site annotation with federated queries involving UniProt and GlyConnect, our glycoprotein database. AVAILABILITY AND IMPLEMENTATION: https://glyconnect.expasy.org/glycoql/.


Assuntos
COVID-19 , SARS-CoV-2 , Glicoproteínas , Glicosilação , Humanos , Polissacarídeos/química
2.
Nucleic Acids Res ; 49(D1): D1548-D1554, 2021 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-33174598

RESUMO

Lectins are non-covalent glycan-binding proteins mediating cellular interactions but their annotation in newly sequenced organisms is lacking. The limited size of functional domains and the low level of sequence similarity challenge usual bioinformatics tools. The identification of lectin domains in proteomes requires the manual curation of sequence alignments based on structural folds. A new lectin classification is proposed. It is built on three levels: (i) 35 lectin domain folds, (ii) 109 classes of lectins sharing at least 20% sequence similarity and (iii) 350 families of lectins sharing at least 70% sequence similarity. This information is compiled in the UniLectin platform that includes the previously described UniLectin3D database of curated lectin 3D structures. Since its first release, UniLectin3D has been updated with 485 additional 3D structures. The database is now complemented by two additional modules: PropLec containing predicted ß-propeller lectins and LectomeXplore including predicted lectins from sequences of the NBCI-nr and UniProt for every curated lectin class. UniLectin is accessible at https://www.unilectin.eu/.


Assuntos
Bases de Dados de Proteínas , Genoma , Lectinas/química , Proteoma/química , Receptores de Superfície Celular/química , Sequência de Aminoácidos , Animais , Antozoários/genética , Antozoários/metabolismo , Biologia Computacional/métodos , Humanos , Internet , Lectinas/classificação , Lectinas/genética , Lectinas/metabolismo , Conformação Proteica em alfa-Hélice , Conformação Proteica em Folha beta , Domínios e Motivos de Interação entre Proteínas , Proteoma/classificação , Proteoma/genética , Proteoma/metabolismo , Receptores de Superfície Celular/classificação , Receptores de Superfície Celular/genética , Receptores de Superfície Celular/metabolismo , Alinhamento de Sequência , Homologia de Sequência de Aminoácidos , Software , Terminologia como Assunto
3.
Mol Cell Proteomics ; 19(10): 1602-1618, 2020 10.
Artigo em Inglês | MEDLINE | ID: mdl-32636234

RESUMO

A key point in achieving accurate intact glycopeptide identification is the definition of the glycan composition file that is used to match experimental with theoretical masses by a glycoproteomics search engine. At present, these files are mainly built from searching the literature and/or querying data sources focused on posttranslational modifications. Most glycoproteomics search engines include a default composition file that is readily used when processing MS data. We introduce here a glycan composition visualizing and comparative tool associated with the GlyConnect database and called GlyConnect Compozitor. It offers a web interface through which the database can be queried to bring out contextual information relative to a set of glycan compositions. The tool takes advantage of compositions being related to one another through shared monosaccharide counts and outputs interactive graphs summarizing information searched in the database. These results provide a guide for selecting or deselecting compositions in a file in order to reflect the context of a study as closely as possible. They also confirm the consistency of a set of compositions based on the content of the GlyConnect database. As part of the tool collection of the Glycomics@ExPASy initiative, Compozitor is hosted at https://glyconnect.expasy.org/compozitor/ where it can be run as a web application. It is also directly accessible from the GlyConnect database.


Assuntos
Glicômica , Polissacarídeos/metabolismo , Animais , Células CHO , Cricetulus , Bases de Dados Factuais , Humanos , Imunoglobulina G/metabolismo , Integrinas/metabolismo , Mucinas/metabolismo , Polissacarídeos/química
4.
Glycobiology ; 31(7): 741-750, 2021 08 07.
Artigo em Inglês | MEDLINE | ID: mdl-33677548

RESUMO

Recent years have seen great advances in the development of glycoproteomics protocols and methods resulting in a sustainable increase in the reporting proteins, their attached glycans and glycosylation sites. However, only very few of these reports find their way into databases or data repositories. One of the major reasons is the absence of digital standard to represent glycoproteins and the challenging annotations with glycans. Depending on the experimental method, such a standard must be able to represent glycans as complete structures or as compositions, store not just single glycans but also represent glycoforms on a specific glycosylation side, deal with partially missing site information if no site mapping was performed, and store abundances or ratios of glycans within a glycoform of a specific site. To support the above, we have developed the GlycoConjugate Ontology (GlycoCoO) as a standard semantic framework to describe and represent glycoproteomics data. GlycoCoO can be used to represent glycoproteomics data in triplestores and can serve as a basis for data exchange formats. The ontology, database providers and supporting documentation are available online (https://github.com/glycoinfo/GlycoCoO).


Assuntos
Glicoproteínas , Polissacarídeos , Glicoproteínas/metabolismo , Glicosilação , Polissacarídeos/metabolismo
5.
Nucleic Acids Res ; 47(D1): D1236-D1244, 2019 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-30239928

RESUMO

Lectins, and related receptors such as adhesins and toxins, are glycan-binding proteins from all origins that decipher the glycocode, i.e. the structural information encoded in the conformation of complex carbohydrates present on the surface of all cells. Lectins are still poorly classified and annotated, but since their functions are based on ligand recognition, their 3D-structures provide a solid foundation for characterization. UniLectin3D is a curated database that classifies lectins on origin and fold, with cross-links to literature, other databases in glycosciences and functional data such as known specificity. The database provides detailed information on lectins, their bound glycan ligands, and features their interactions using the Protein-Ligand Interaction Profiler (PLIP) server. Special care was devoted to the description of the bound glycan ligands with the use of simple graphical representation and numerical format for cross-linking to other databases in glycoscience. We conceived the design of the database architecture and the navigation tools to account for all organisms, as well as to search for oligosaccharide epitopes complexed within specified binding sites. UniLectin3D is accessible at https://www.unilectin.eu/unilectin3D.


Assuntos
Biologia Computacional/métodos , Bases de Dados de Proteínas , Conformação Proteica , Receptores de Superfície Celular/química , Sítios de Ligação , Humanos , Internet , Lectinas/química , Lectinas/metabolismo , Ligantes , Modelos Moleculares , Polissacarídeos/química , Polissacarídeos/metabolismo , Ligação Proteica , Receptores de Superfície Celular/metabolismo
6.
Molecules ; 27(1)2021 Dec 23.
Artigo em Inglês | MEDLINE | ID: mdl-35011294

RESUMO

The level of ambiguity in describing glycan structure has significantly increased with the upsurge of large-scale glycomics and glycoproteomics experiments. Consequently, an ontology-based model appears as an appropriate solution for navigating these data. However, navigation is not sufficient and the model should also enable advanced search and comparison. A new ontology with a tree logical structure is introduced to represent glycan structures irrespective of the precision of molecular details. The model heavily relies on the GlycoCT encoding of glycan structures. Its implementation in the GlySTreeM knowledge base was validated with GlyConnect data and benchmarked with the Glycowork library. GlySTreeM is shown to be fast, consistent, reliable and more flexible than existing solutions for matching parts of or whole glycan structures. The model is also well suited for painless future expansion.


Assuntos
Glicômica/métodos , Polissacarídeos/química , Bases de Dados Factuais , Estrutura Molecular , Relação Estrutura-Atividade , Navegador
7.
Mol Cell Proteomics ; 17(11): 2164-2176, 2018 11.
Artigo em Inglês | MEDLINE | ID: mdl-30097532

RESUMO

Glycomics@ExPASy (https://www.expasy.org/glycomics) is the glycomics tab of ExPASy, the server of SIB Swiss Institute of Bioinformatics. It was created in 2016 to centralize web-based glycoinformatics resources developed within an international network of glycoscientists. The hosted collection currently includes mainly databases and tools created and maintained at SIB but also links to a range of reference resources popular in the glycomics community. The philosophy of our toolbox is that it should be {glycoscientist AND protein scientist}-friendly with the aim of (1) popularizing the use of bioinformatics in glycobiology and (2) emphasizing the relationship between glycobiology and protein-oriented bioinformatics resources. The scarcity of data bridging these two disciplines led us to design tools as interactive as possible based on database connectivity to facilitate data exploration and support hypothesis building. Glycomics@ExPASy was designed, and is developed, with a long-term vision in close collaboration with glycoscientists to meet as closely as possible the growing needs of the community for glycoinformatics.


Assuntos
Glicômica/métodos , Software , Coleta de Dados , Glicoproteínas/metabolismo , Humanos , Espectrometria de Massas , Polissacarídeos/metabolismo , Mapas de Interação de Proteínas
8.
J Proteome Res ; 18(2): 664-677, 2019 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-30574787

RESUMO

Knowledge of glycoproteins, their site-specific glycosylation patterns, and the glycan structures that they present to their recognition partners in health and disease is gradually being built on using a range of experimental approaches. The data from these analyses are increasingly being standardized and presented in various sources, from supplemental tables in publications to localized servers in investigator laboratories. Bioinformatics tools are now needed to collect these data and enable the user to search, display, and connect glycomics and glycoproteomics to other sources of related proteomics, genomics, and interactomics information. We here introduce GlyConnect ( https://glyconnect.expasy.org/ ), the central platform of the Glycomics@ExPASy portal for glycoinformatics. GlyConnect has been developed to gather, monitor, integrate, and visualize data in a user-friendly way to facilitate the interpretation of collected glycoscience data. GlyConnect is designed to accommodate and integrate multiple data types as they are increasingly produced.


Assuntos
Glicômica/métodos , Proteômica/métodos , Software , Biologia Computacional/métodos , Glicômica/instrumentação , Glicoproteínas/análise , Glicosilação , Interface Usuário-Computador
9.
Glycobiology ; 29(1): 36-44, 2019 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-30239692

RESUMO

Mammalian glycosaminoglycans are linear complex polysaccharides comprising heparan sulfate, heparin, dermatan sulfate, chondroitin sulfate, keratan sulfate and hyaluronic acid. They bind to numerous proteins and these interactions mediate their biological activities. GAG-protein interaction data reported in the literature are curated mostly in MatrixDB database (http://matrixdb.univ-lyon1.fr/). However, a standard nomenclature and a machine-readable format of GAGs together with bioinformatics tools for mining these interaction data are lacking. We report here the building of an automated pipeline to (i) standardize the format of GAG sequences interacting with proteins manually curated from the literature, (ii) translate them into the machine-readable GlycoCT format and into SNFG (Symbol Nomenclature For Glycan) images and (iii) convert their sequences into a format processed by a builder generating three-dimensional structures of polysaccharides based on a repertoire of conformations experimentally validated by data extracted from crystallized GAG-protein complexes. We have developed for this purpose a converter (the CT23D converter) to automatically translate the GlycoCT code of a GAG sequence into the input file required to construct a three-dimensional model.


Assuntos
Glicosaminoglicanos/química , Modelos Moleculares , Software , Animais , Configuração de Carboidratos , Glicosaminoglicanos/genética , Humanos
10.
Glycobiology ; 28(6): 349-362, 2018 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-29518231

RESUMO

Nowadays, due to the advance of experimental techniques in glycomics, large collections of glycan profiles are regularly published. The rapid growth of available glycan data accentuates the lack of innovative tools for visualizing and exploring large amount of information. Scientists resort to using general-purpose spreadsheet applications to create ad hoc data visualization. Thus, results end up being encoded in publication images and text, while valuable curated data is stored in files as supplementary information. To tackle this problem, we have built an interactive pipeline composed with three tools: Glynsight, EpitopeXtractor and Glydin'. Glycan profile data can be imported in Glynsight, which generates a custom interactive glycan profile. Several profiles can be compared and glycan composition is integrated with structural data stored in databases. Glycan structures of interest can then be sent to EpitopeXtractor to perform a glycoepitope extraction. EpitopeXtractor results can be superimposed on the Glydin' glycoepitope network. The network visualization allows fast detection of clusters of glycoepitopes and discovery of potential new targets. Each of these tools is standalone or can be used in conjunction with the others, depending on the data and the specific interest of the user. All the tools composing this pipeline are part of the Glycomics@ExPASy initiative and are available at https://www.expasy.org/glycomics.


Assuntos
Epitopos/química , Glicômica/métodos , Informática/métodos , Processamento de Proteína Pós-Traducional , Software , Bases de Dados de Compostos Químicos , Epitopos/imunologia , Glicosilação , Humanos
11.
Nucleic Acids Res ; 44(D1): D1243-50, 2016 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-26578555

RESUMO

The SugarBind Database (SugarBindDB) covers knowledge of glycan binding of human pathogen lectins and adhesins. It is a curated database; each glycan-protein binding pair is associated with at least one published reference. The core data element of SugarBindDB is a set of three inseparable components: the pathogenic agent, a lectin/adhesin and a glycan ligand. Each entity (agent, lectin or ligand) is described by a range of properties that are summarized in an entity-dedicated page. Several search, navigation and visualisation tools are implemented to investigate the functional role of glycans in pathogen binding. The database is cross-linked to protein and glycan-relaled resources such as UniProtKB and UniCarbKB. It is tightly bound to the latter via a substructure search tool that maps each ligand to full structures where it occurs. Thus, a glycan-lectin binding pair of SugarBindDB can lead to the identification of a glycan-mediated protein-protein interaction, that is, a lectin-glycoprotein interaction, via substructure search and the knowledge of site-specific glycosylation stored in UniCarbKB. SugarBindDB is accessible at: http://sugarbind.expasy.org.


Assuntos
Bases de Dados de Compostos Químicos , Interações Hospedeiro-Patógeno , Lectinas/metabolismo , Polissacarídeos/metabolismo , Proteínas de Bactérias/metabolismo , Lectinas/química , Ligantes , Polissacarídeos/química , Ligação Proteica , Proteínas Virais/metabolismo
12.
Molecules ; 23(12)2018 Dec 05.
Artigo em Inglês | MEDLINE | ID: mdl-30563078

RESUMO

SugarSketcher is an intuitive and fast JavaScript interface module for online drawing of glycan structures in the popular Symbol Nomenclature for Glycans (SNFG) notation and exporting them to various commonly used formats encoding carbohydrate sequences (e.g., GlycoCT) or quality images (e.g., svg). It does not require a backend server or any specific browser plugins and can be integrated in any web glycoinformatics project. SugarSketcher allows drawing glycans both for glycobiologists and non-expert users. The "quick mode" allows a newcomer to build up a glycan structure having only a limited knowledge in carbohydrate chemistry. The "normal mode" integrates advanced options which enable glycobiologists to tailor complex carbohydrate structures. The source code is freely available on GitHub and glycoinformaticians are encouraged to participate in the development process while users are invited to test a prototype available on the ExPASY web-site and send feedback.


Assuntos
Polissacarídeos/química , Software , Navegador , Biologia Computacional/métodos , Relação Estrutura-Atividade
14.
Anal Chem ; 89(20): 10932-10940, 2017 10 17.
Artigo em Inglês | MEDLINE | ID: mdl-28901741

RESUMO

Tandem mass spectrometry, when combined with liquid chromatography and applied to complex mixtures, produces large amounts of raw data, which needs to be analyzed to identify molecular structures. This technique is widely used, particularly in glycomics. Due to a lack of high throughput glycan sequencing software, glycan spectra are predominantly sequenced manually. A challenge for writing glycan-sequencing software is that there is no direct template that can be used to infer structures detectable in an organism. To help alleviate this bottleneck, we present Glycoforest 1.0, a partial de novo algorithm for sequencing glycan structures based on MS/MS spectra. Glycoforest was tested on two data sets (human gastric and salmon mucosa O-linked glycomes) for which MS/MS spectra were annotated manually. Glycoforest generated the human validated structure for 92% of test cases. The correct structure was found as the best scoring match for 70% and among the top 3 matches for 83% of test cases. In addition, the Glycoforest algorithm detected glycan structures from MS/MS spectra missing a manual annotation. In total 1532 MS/MS previously unannotated spectra were annotated by Glycoforest. A portion containing 521 spectra was manually checked confirming that Glycoforest annotated an additional 50 MS/MS spectra overlooked during manual annotation.


Assuntos
Glicômica/métodos , Polissacarídeos/química , Software , Algoritmos , Sequência de Carboidratos , Cromatografia Líquida de Alta Pressão , Espectrometria de Massas em Tandem
15.
J Proteome Res ; 15(10): 3916-3928, 2016 10 07.
Artigo em Inglês | MEDLINE | ID: mdl-27523326

RESUMO

GlycoSiteAlign is a tool designed to align amino acid sequences of variable length surrounding glycosylation sites depending on the knowledge of glycan structure. It is an exploratory resource intended for the identification of characteristic amino acid patterns of unique glycan-protein interactions. GlycoSiteAlign uses data from the UniCarbKB and UniProtKB databases, and it is hosted on ExPASy, the Swiss Institute of Bioinformatics resource portal. The user can select either specific or general glycan features, set the length of the protein fragments, and trigger an alignment with the option of including 90% homologous proteins. The tool previews and downloads alignments that may reveal amino acid patterns corresponding to selected features (e.g., "fucosylated" versus "non-fucosylated"). GlycoSiteAlign will integrate new data as they become available to confirm and expand results. It is presented as a promising tool for assessing and refining the knowledge about the constraints that link a particular glycan structure to a particular glycosite, and in the long term, this application could help improve prediction tools.


Assuntos
Sequência de Aminoácidos , Glicosilação , Alinhamento de Sequência , Software , Sítios de Ligação , Biologia Computacional , Bases de Dados de Proteínas , Polissacarídeos/química
16.
Nucleic Acids Res ; 42(Database issue): D215-21, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24234447

RESUMO

The UniCarb KnowledgeBase (UniCarbKB; http://unicarbkb.org) offers public access to a growing, curated database of information on the glycan structures of glycoproteins. UniCarbKB is an international effort that aims to further our understanding of structures, pathways and networks involved in glycosylation and glyco-mediated processes by integrating structural, experimental and functional glycoscience information. This initiative builds upon the success of the glycan structure database GlycoSuiteDB, together with the informatic standards introduced by EUROCarbDB, to provide a high-quality and updated resource to support glycomics and glycoproteomics research. UniCarbKB provides comprehensive information concerning glycan structures, and published glycoprotein information including global and site-specific attachment information. For the first release over 890 references, 3740 glycan structure entries and 400 glycoproteins have been curated. Further, 598 protein glycosylation sites have been annotated with experimentally confirmed glycan structures from the literature. Among these are 35 glycoproteins, 502 structures and 60 publications previously not included in GlycoSuiteDB. This article provides an update on the transformation of GlycoSuiteDB (featured in previous NAR Database issues and hosted by ExPASy since 2009) to UniCarbKB and its integration with UniProtKB and GlycoMod. Here, we introduce a refactored database, supported by substantial new curated data collections and intuitive user-interfaces that improve database searching.


Assuntos
Bases de Dados de Proteínas , Glicoproteínas/química , Polissacarídeos/química , Glicoproteínas/metabolismo , Glicosilação , Internet , Polissacarídeos/metabolismo , Proteômica
17.
Bioinformatics ; 30(21): 3131-3, 2014 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-25015990

RESUMO

UNLABELLED: Sequencing oligosaccharides by exoglycosidases, either sequentially or in an array format, is a powerful tool to unambiguously determine the structure of complex N- and O-link glycans. Here, we introduce GlycoDigest, a tool that simulates exoglycosidase digestion, based on controlled rules acquired from expert knowledge and experimental evidence available in GlycoBase. The tool allows the targeted design of glycosidase enzyme mixtures by allowing researchers to model the action of exoglycosidases, thereby validating and improving the efficiency and accuracy of glycan analysis. AVAILABILITY AND IMPLEMENTATION: http://www.glycodigest.org.


Assuntos
Glicosídeo Hidrolases , Oligossacarídeos/química , Software , Bases de Conhecimento
18.
BMC Bioinformatics ; 15 Suppl 1: S9, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-24564482

RESUMO

BACKGROUND: Recent progress in method development for characterising the branched structures of complex carbohydrates has now enabled higher throughput technology. Automation of structure analysis then calls for software development since adding meaning to large data collections in reasonable time requires corresponding bioinformatics methods and tools. Current glycobioinformatics resources do cover information on the structure and function of glycans, their interaction with proteins or their enzymatic synthesis. However, this information is partial, scattered and often difficult to find to for non-glycobiologists. METHODS: Following our diagnosis of the causes of the slow development of glycobioinformatics, we review the "objective" difficulties encountered in defining adequate formats for representing complex entities and developing efficient analysis software. RESULTS: Various solutions already implemented and strategies defined to bridge glycobiology with different fields and integrate the heterogeneous glyco-related information are presented. CONCLUSIONS: Despite the initial stage of our integrative efforts, this paper highlights the rapid expansion of glycomics, the validity of existing resources and the bright future of glycobioinformatics.


Assuntos
Glicômica/métodos , Polissacarídeos/análise , Sequência de Carboidratos , Glicômica/normas , Internet , Polissacarídeos/química , Proteínas/química , Proteínas/metabolismo , Software
19.
bioRxiv ; 2024 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-38798633

RESUMO

Glycosylation is described as a non-templated biosynthesis. Yet, the template-free premise is antithetical to the observation that different N-glycans are consistently placed at specific sites. It has been proposed that glycosite-proximal protein structures could constrain glycosylation and explain the observed microheterogeneity. Using site-specific glycosylation data, we trained a hybrid neural network to parse glycosites (recurrent neural network) and match them to feasible N-glycosylation events (graph neural network). From glycosite-flanking sequences, the algorithm predicts most human N-glycosylation events documented in the GlyConnect database and proposed structures corresponding to observed monosaccharide composition of the glycans at these sites. The algorithm also recapitulated glycosylation in Enhanced Aromatic Sequons, SARS-CoV-2 spike, and IgG3 variants, thus demonstrating the ability of the algorithm to predict both glycan structure and abundance. Thus, protein structure constrains glycosylation, and the neural network enables predictive in silico glycosylation of uncharacterized or novel protein sequences and genetic variants.

20.
Sci Rep ; 12(1): 10846, 2022 06 27.
Artigo em Inglês | MEDLINE | ID: mdl-35760821

RESUMO

Human milk oligosaccharides (HMOs) form the third most abundant component of human milk and are known to convey several benefits to the neonate, including protection from viral and bacterial pathogens, training of the immune system, and influencing the gut microbiome. As HMO production during lactation is driven by enzymes that are common to other glycosylation processes, we adapted a model of mucin-type GalNAc-linked glycosylation enzymes to act on free lactose. We identified a subset of 11 enzyme activities that can account for 206 of 226 distinct HMOs isolated from human milk and constructed a biosynthetic reaction network that identifies 5 new core HMO structures. A comparison of monosaccharide compositions demonstrated that the model was able to discriminate between two possible groups of intermediates between major subnetworks, and to assign possible structures to several previously uncharacterised HMOs. The effect of enzyme knockouts is presented, identifying ß-1,4-galactosyltransferase and ß-1,3-N-acetylglucosaminyltransferase as key enzyme activities involved in the generation of the observed HMO glycosylation patterns. The model also provides a synthesis chassis for the most common HMOs found in lactating mothers.


Assuntos
Microbioma Gastrointestinal , Leite Humano , Bactérias , Feminino , Humanos , Recém-Nascido , Lactação , Leite Humano/química , Oligossacarídeos/química
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA