Búsqueda | BVS CLAP/SMR-OPS/OMS

1.

UniTmp: unified resources for transmembrane proteins.

Dobson, László; Gerdán, Csongor; Tusnády, Simon; Szekeres, Levente; Kuffa, Katalin; Langó, Tamás; Zeke, András; Tusnády, Gábor E.

Nucleic Acids Res ; 52(D1): D572-D578, 2024 Jan 05.

Artículo en Inglés | MEDLINE | ID: mdl-37870462

RESUMEN

The UNIfied database of TransMembrane Proteins (UniTmp) is a comprehensive and freely accessible resource of transmembrane protein structural information at different levels, from localization of protein segments, through the topology of the protein to the membrane-embedded 3D structure. We not only annotated tens of thousands of new structures and experiments, but we also developed a new system that can serve these resources in parallel. UniTmp is a unified platform that merges TOPDB (Topology Data Bank of Transmembrane Proteins), TOPDOM (database of conservatively located domains and motifs in proteins), PDBTM (Protein Data Bank of Transmembrane Proteins) and HTP (Human Transmembrane Proteome) databases and provides interoperability between the incorporated resources and an easy way to keep them regularly updated. The current update contains 9235 membrane-embedded structures, 9088 sequences with 536 035 topology-annotated segments and 8692 conservatively localized protein domains or motifs as well as 5466 annotated human transmembrane proteins. The UniTmp database can be accessed at https://www.unitmp.org.

Asunto(s)

Bases de Datos de Proteínas , Proteínas de la Membrana , Proteoma , Humanos , Proteínas de la Membrana/química

2.

ELM-the Eukaryotic Linear Motif resource-2024 update.

Kumar, Manjeet; Michael, Sushama; Alvarado-Valverde, Jesús; Zeke, András; Lazar, Tamas; Glavina, Juliana; Nagy-Kanta, Eszter; Donagh, Juan Mac; Kalman, Zsofia E; Pascarelli, Stefano; Palopoli, Nicolas; Dobson, László; Suarez, Carmen Florencia; Van Roey, Kim; Krystkowiak, Izabella; Griffin, Juan Esteban; Nagpal, Anurag; Bhardwaj, Rajesh; Diella, Francesca; Mészáros, Bálint; Dean, Kellie; Davey, Norman E; Pancsa, Rita; Chemes, Lucía B; Gibson, Toby J.

Nucleic Acids Res ; 52(D1): D442-D455, 2024 Jan 05.

Artículo en Inglés | MEDLINE | ID: mdl-37962385

RESUMEN

Short Linear Motifs (SLiMs) are the smallest structural and functional components of modular eukaryotic proteins. They are also the most abundant, especially when considering post-translational modifications. As well as being found throughout the cell as part of regulatory processes, SLiMs are extensively mimicked by intracellular pathogens. At the heart of the Eukaryotic Linear Motif (ELM) Resource is a representative (not comprehensive) database. The ELM entries are created by a growing community of skilled annotators and provide an introduction to linear motif functionality for biomedical researchers. The 2024 ELM update includes 346 novel motif instances in areas ranging from innate immunity to both protein and RNA degradation systems. In total, 39 classes of newly annotated motifs have been added, and another 17 existing entries have been updated in the database. The 2024 ELM release now includes 356 motif classes incorporating 4283 individual motif instances manually curated from 4274 scientific publications and including >700 links to experimentally determined 3D structures. In a recent development, the InterPro protein module resource now also includes ELM data. ELM is available at: http://elm.eu.org.

Asunto(s)

Secuencias de Aminoácidos , Bases de Datos de Proteínas , Eucariontes , Secuencias de Aminoácidos/genética , Procesamiento Proteico-Postraduccional , Proteínas/genética , Proteínas/metabolismo , Eucariontes/genética , Internet

3.

Linear motifs regulating protein secretion, sorting and autophagy in Leishmania parasites are diverged with respect to their host equivalents.

Zeke, Andras; Gibson, Toby J; Dobson, Laszlo.

PLoS Comput Biol ; 20(2): e1011902, 2024 Feb.

Artículo en Inglés | MEDLINE | ID: mdl-38363808

RESUMEN

The pathogenic, tropical Leishmania flagellates belong to an early-branching eukaryotic lineage (Kinetoplastida) with several unique features. Unfortunately, they are poorly understood from a molecular biology perspective, making development of mechanistically novel and selective drugs difficult. Here, we explore three functionally critical targeting short linear motif systems as well as their receptors in depth, using a combination of structural modeling, evolutionary sequence divergence and deep learning. Secretory signal peptides, endoplasmic reticulum (ER) retention motifs (KDEL motifs), and autophagy signals (motifs interacting with ATG8 family members) are ancient and essential components of cellular life. Although expected to be conserved amongst the kinetoplastids, we observe that all three systems show a varying degree of divergence from their better studied equivalents in animals, plants, or fungi. We not only describe their behaviour, but also build models that allow the prediction of localization and potential functions for several uncharacterized Leishmania proteins. The unusually Ala/Val-rich secretory signal peptides, endoplasmic reticulum resident proteins ending in Asp-Leu-COOH and atypical ATG8-like proteins are all unique molecular features of kinetoplastid parasites. Several of their critical protein-protein interactions could serve as targets of selective antimicrobial agents against Leishmaniasis due to their systematic divergence from the host.

Asunto(s)

Leishmania , Parásitos , Animales , Transporte de Proteínas , Autofagia , Señales de Clasificación de Proteína

4.

TmAlphaFold database: membrane localization and evaluation of AlphaFold2 predicted alpha-helical transmembrane protein structures.

Dobson, Laszlo; Szekeres, Levente I; Gerdán, Csongor; Langó, Tamás; Zeke, András; Tusnády, Gábor E.

Nucleic Acids Res ; 51(D1): D517-D522, 2023 01 06.

Artículo en Inglés | MEDLINE | ID: mdl-36318239

RESUMEN

AI-driven protein structure prediction, most notably AlphaFold2 (AF2) opens new frontiers for almost all fields of structural biology. As traditional structure prediction methods for transmembrane proteins were both complicated and error prone, AF2 is a great help to the community. Complementing the relatively meager number of experimental structures, AF2 provides 3D predictions for thousands of new alpha-helical membrane proteins. However, the lack of reliable structural templates and the fact that AF2 was not trained to handle phase boundaries also necessitates a delicate assessment of structural correctness. In our new database, Transmembrane AlphaFold database (TmAlphaFold database), we apply TMDET, a simple geometry-based method to visualize the likeliest position of the membrane plane. In addition, we calculate several parameters to evaluate the location of the protein into the membrane. This also allows TmAlphaFold database to show whether the predicted 3D structure is realistic or not. The TmAlphaFold database is available at https://tmalphafold.ttk.hu/.

Asunto(s)

Bases de Datos de Proteínas , Proteínas de la Membrana , Proteínas de la Membrana/química , Conformación Proteica , Conformación Proteica en Hélice alfa

5.

The Eukaryotic Linear Motif resource: 2022 release.

Kumar, Manjeet; Michael, Sushama; Alvarado-Valverde, Jesús; Mészáros, Bálint; Sámano-Sánchez, Hugo; Zeke, András; Dobson, Laszlo; Lazar, Tamas; Örd, Mihkel; Nagpal, Anurag; Farahi, Nazanin; Käser, Melanie; Kraleti, Ramya; Davey, Norman E; Pancsa, Rita; Chemes, Lucía B; Gibson, Toby J.

Nucleic Acids Res ; 50(D1): D497-D508, 2022 01 07.

Artículo en Inglés | MEDLINE | ID: mdl-34718738

RESUMEN

Almost twenty years after its initial release, the Eukaryotic Linear Motif (ELM) resource remains an invaluable source of information for the study of motif-mediated protein-protein interactions. ELM provides a comprehensive, regularly updated and well-organised repository of manually curated, experimentally validated short linear motifs (SLiMs). An increasing number of SLiM-mediated interactions are discovered each year and keeping the resource up-to-date continues to be a great challenge. In the current update, 30 novel motif classes have been added and five existing classes have undergone major revisions. The update includes 411 new motif instances mostly focused on cell-cycle regulation, control of the actin cytoskeleton, membrane remodelling and vesicle trafficking pathways, liquid-liquid phase separation and integrin signalling. Many of the newly annotated motif-mediated interactions are targets of pathogenic motif mimicry by viral, bacterial or eukaryotic pathogens, providing invaluable insights into the molecular mechanisms underlying infectious diseases. The current ELM release includes 317 motif classes incorporating 3934 individual motif instances manually curated from 3867 scientific publications. ELM is available at: http://elm.eu.org.

Asunto(s)

Enfermedades Transmisibles/genética , Bases de Datos de Proteínas , Interacciones Huésped-Patógeno/genética , Dominios y Motivos de Interacción de Proteínas , Programas Informáticos , Citoesqueleto de Actina/química , Citoesqueleto de Actina/metabolismo , Animales , Sitios de Unión , Ciclo Celular/genética , Membrana Celular/química , Membrana Celular/metabolismo , Enfermedades Transmisibles/metabolismo , Enfermedades Transmisibles/virología , Ciclinas/química , Ciclinas/genética , Ciclinas/metabolismo , Células Eucariotas/citología , Células Eucariotas/metabolismo , Células Eucariotas/virología , Regulación de la Expresión Génica , Humanos , Integrinas/química , Integrinas/genética , Integrinas/metabolismo , Ratones , Anotación de Secuencia Molecular , Unión Proteica , Ratas , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Transducción de Señal , Vesículas Transportadoras/química , Vesículas Transportadoras/metabolismo , Virus/genética , Virus/metabolismo

6.

DisProt in 2022: improved quality and accessibility of protein intrinsic disorder annotation.

Quaglia, Federica; Mészáros, Bálint; Salladini, Edoardo; Hatos, András; Pancsa, Rita; Chemes, Lucía B; Pajkos, Mátyás; Lazar, Tamas; Peña-Díaz, Samuel; Santos, Jaime; Ács, Veronika; Farahi, Nazanin; Fichó, Erzsébet; Aspromonte, Maria Cristina; Bassot, Claudio; Chasapi, Anastasia; Davey, Norman E; Davidovic, Radoslav; Dobson, Laszlo; Elofsson, Arne; Erdos, Gábor; Gaudet, Pascale; Giglio, Michelle; Glavina, Juliana; Iserte, Javier; Iglesias, Valentín; Kálmán, Zsófia; Lambrughi, Matteo; Leonardi, Emanuela; Longhi, Sonia; Macedo-Ribeiro, Sandra; Maiani, Emiliano; Marchetti, Julia; Marino-Buslje, Cristina; Mészáros, Attila; Monzon, Alexander Miguel; Minervini, Giovanni; Nadendla, Suvarna; Nilsson, Juliet F; Novotný, Marian; Ouzounis, Christos A; Palopoli, Nicolás; Papaleo, Elena; Pereira, Pedro José Barbosa; Pozzati, Gabriele; Promponas, Vasilis J; Pujols, Jordi; Rocha, Alma Carolina Sanchez; Salas, Martin; Sawicki, Luciana Rodriguez.

Nucleic Acids Res ; 50(D1): D480-D487, 2022 01 07.

Artículo en Inglés | MEDLINE | ID: mdl-34850135

RESUMEN

The Database of Intrinsically Disordered Proteins (DisProt, URL: https://disprot.org) is the major repository of manually curated annotations of intrinsically disordered proteins and regions from the literature. We report here recent updates of DisProt version 9, including a restyled web interface, refactored Intrinsically Disordered Proteins Ontology (IDPO), improvements in the curation process and significant content growth of around 30%. Higher quality and consistency of annotations is provided by a newly implemented reviewing process and training of curators. The increased curation capacity is fostered by the integration of DisProt with APICURON, a dedicated resource for the proper attribution and recognition of biocuration efforts. Better interoperability is provided through the adoption of the Minimum Information About Disorder (MIADE) standard, an active collaboration with the Gene Ontology (GO) and Evidence and Conclusion Ontology (ECO) consortia and the support of the ELIXIR infrastructure.

Asunto(s)

Bases de Datos de Proteínas , Proteínas Intrínsecamente Desordenadas/metabolismo , Anotación de Secuencia Molecular , Programas Informáticos , Secuencia de Aminoácidos , ADN/genética , ADN/metabolismo , Conjuntos de Datos como Asunto , Ontología de Genes , Humanos , Internet , Proteínas Intrínsecamente Desordenadas/química , Proteínas Intrínsecamente Desordenadas/genética , Unión Proteica , ARN/genética , ARN/metabolismo

7.

PolarProtPred: predicting apical and basolateral localization of transmembrane proteins using putative short linear motifs and deep learning.

Dobson, Laszlo; Zeke, András; Tusnády, Gábor E.

Bioinformatics ; 37(23): 4328-4335, 2021 12 07.

Artículo en Inglés | MEDLINE | ID: mdl-34185052

RESUMEN

MOTIVATION: Cell polarity refers to the asymmetric organization of cellular components in various cells. Epithelial cells are the best-known examples of polarized cells, featuring apical and basolateral membrane domains. Mounting evidence suggests that short linear motifs play a major role in protein trafficking to these domains, although the exact rules governing them are still elusive. RESULTS: In this study we prepared neural networks that capture recurrent patterns to classify transmembrane proteins localizing into apical and basolateral membranes. Asymmetric expression of drug transporters results in vectorial drug transport, governing the pharmacokinetics of numerous substances, yet the data on how proteins are sorted in epithelial cells is very scattered. The provided method may offer help to experimentalists to identify or better characterize molecular networks regulating the distribution of transporters or surface receptors (including viral entry receptors like that of COVID-19). AVAILABILITY AND IMPLEMENTATION: The prediction server PolarProtPred is available at http://polarprotpred.ttk.hu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Asunto(s)

COVID-19 , Aprendizaje Profundo , Humanos , Proteínas de la Membrana/metabolismo , Membrana Celular/metabolismo , Células Epiteliales/metabolismo

8.

PlaToLoCo: the first web meta-server for visualization and annotation of low complexity regions in proteins.

Jarnot, Patryk; Ziemska-Legiecka, Joanna; Dobson, Laszlo; Merski, Matthew; Mier, Pablo; Andrade-Navarro, Miguel A; Hancock, John M; Dosztányi, Zsuzsanna; Paladin, Lisanna; Necci, Marco; Piovesan, Damiano; Tosatto, Silvio C E; Promponas, Vasilis J; Grynberg, Marcin; Gruca, Aleksandra.

Nucleic Acids Res ; 48(W1): W77-W84, 2020 07 02.

Artículo en Inglés | MEDLINE | ID: mdl-32421769

RESUMEN

Low complexity regions (LCRs) in protein sequences are characterized by a less diverse amino acid composition compared to typically observed sequence diversity. Recent studies have shown that LCRs may co-occur with intrinsically disordered regions, are highly conserved in many organisms, and often play important roles in protein functions and in diseases. In previous decades, several methods have been developed to identify regions with LCRs or amino acid bias, but most of them as stand-alone applications and currently there is no web-based tool which allows users to explore LCRs in protein sequences with additional functional annotations. We aim to fill this gap by providing PlaToLoCo - PLAtform of TOols for LOw COmplexity-a meta-server that integrates and collects the output of five different state-of-the-art tools for discovering LCRs and provides functional annotations such as domain detection, transmembrane segment prediction, and calculation of amino acid frequencies. In addition, the union or intersection of the results of the search on a query sequence can be obtained. By developing the PlaToLoCo meta-server, we provide the community with a fast and easily accessible tool for the analysis of LCRs with additional information included to aid the interpretation of the results. The PlaToLoCo platform is available at: http://platoloco.aei.polsl.pl/.

Asunto(s)

Proteínas/química , Programas Informáticos , Aminoácidos/análisis , Gráficos por Computador , Humanos , Proteínas de la Membrana/química , Anotación de Secuencia Molecular , Dominios Proteicos , Análisis de Secuencia de Proteína

9.

MemDis: Predicting Disordered Regions in Transmembrane Proteins.

Dobson, Laszlo; Tusnády, Gábor E.

Int J Mol Sci ; 22(22)2021 Nov 12.

Artículo en Inglés | MEDLINE | ID: mdl-34830151

RESUMEN

Transmembrane proteins (TMPs) play important roles in cells, ranging from transport processes and cell adhesion to communication. Many of these functions are mediated by intrinsically disordered regions (IDRs), flexible protein segments without a well-defined structure. Although a variety of prediction methods are available for predicting IDRs, their accuracy is very limited on TMPs due to their special physico-chemical properties. We prepared a dataset containing membrane proteins exclusively, using X-ray crystallography data. MemDis is a novel prediction method, utilizing convolutional neural network and long short-term memory networks for predicting disordered regions in TMPs. In addition to attributes commonly used in IDR predictors, we defined several TMP specific features to enhance the accuracy of our method further. MemDis achieved the highest prediction accuracy on TMP-specific dataset among other popular IDR prediction methods.

Asunto(s)

Biología Computacional/métodos , Proteínas Intrínsecamente Desordenadas/química , Proteínas de la Membrana/química , Redes Neurales de la Computación , Secuencia de Aminoácidos , Minería de Datos/métodos , Bases de Datos de Proteínas/estadística & datos numéricos , Internet , Modelos Moleculares , Conformación Proteica , Reproducibilidad de los Resultados

10.

TSTMP: target selection for structural genomics of human transmembrane proteins.

Varga, Julia; Dobson, László; Reményi, István; Tusnády, Gábor E.

Nucleic Acids Res ; 45(D1): D325-D330, 2017 01 04.

Artículo en Inglés | MEDLINE | ID: mdl-27924015

RESUMEN

The TSTMP database is designed to help the target selection of human transmembrane proteins for structural genomics projects and structure modeling studies. Currently, there are only 60 known 3D structures among the polytopic human transmembrane proteins and about a further 600 could be modeled using existing structures. Although there are a great number of human transmembrane protein structures left to be determined, surprisingly only a small fraction of these proteins have 'selected' (or above) status according to the current version the TargetDB/TargetTrack database. This figure is even worse regarding those transmembrane proteins that would contribute the most to the structural coverage of the human transmembrane proteome. The database was built by sorting out proteins from the human transmembrane proteome with known structure and searching for suitable model structures for the remaining proteins by combining the results of a state-of-the-art transmembrane specific fold recognition algorithm and a sequence similarity search algorithm. Proteins were searched for homologues among the human transmembrane proteins in order to select targets whose successful structure determination would lead to the best structural coverage of the human transmembrane proteome. The pipeline constructed for creating the TSTMP database guarantees to keep the database up-to-date. The database is available at http://tstmp.enzim.ttk.mta.hu.

Asunto(s)

Biología Computacional/métodos , Bases de Datos de Proteínas , Genómica/métodos , Proteínas de la Membrana , Humanos , Proteínas de la Membrana/química , Proteínas de la Membrana/genética , Modelos Moleculares , Conformación Proteica , Proteoma , Proteómica/métodos , Relación Estructura-Actividad , Navegador Web

11.

Sequence and Structure Properties Uncover the Natural Classification of Protein Complexes Formed by Intrinsically Disordered Proteins via Mutual Synergistic Folding.

Mészáros, Bálint; Dobson, László; Fichó, Erzsébet; Simon, István.

Int J Mol Sci ; 20(21)2019 Nov 01.

Artículo en Inglés | MEDLINE | ID: mdl-31683980

RESUMEN

Intrinsically disordered proteins mediate crucial biological functions through their interactions with other proteins. Mutual synergistic folding (MSF) occurs when all interacting proteins are disordered, folding into a stable structure in the course of the complex formation. In these cases, the folding and binding processes occur in parallel, lending the resulting structures uniquely heterogeneous features. Currently there are no dedicated classification approaches that take into account the particular biological and biophysical properties of MSF complexes. Here, we present a scalable clustering-based classification scheme, built on redundancy-filtered features that describe the sequence and structure properties of the complexes and the role of the interaction, which is directly responsible for structure formation. Using this approach, we define six major types of MSF complexes, corresponding to biologically meaningful groups. Hence, the presented method also shows that differences in binding strength, subcellular localization, and regulation are encoded in the sequence and structural properties of proteins. While current protein structure classification methods can also handle complex structures, we show that the developed scheme is fundamentally different, and since it takes into account defining features of MSF complexes, it serves as a better representation of structures arising through this specific interaction mode.

Asunto(s)

Proteínas Intrínsecamente Desordenadas/química , Pliegue de Proteína , Estructura Secundaria de Proteína , Estructura Terciaria de Proteína , Secuencia de Aminoácidos , Animales , Análisis por Conglomerados , Humanos , Proteínas Intrínsecamente Desordenadas/clasificación , Proteínas Intrínsecamente Desordenadas/metabolismo , Cinética , Modelos Moleculares , Unión Proteica , Termodinámica

12.

Occurrence of Ordered and Disordered Structural Elements in Postsynaptic Proteins Supports Optimization for Interaction Diversity.

Kiss-Tóth, Annamária; Dobson, Laszlo; Péterfia, Bálint; Ángyán, Annamária F; Ligeti, Balázs; Lukács, Gergely; Gáspári, Zoltán.

Entropy (Basel) ; 21(8)2019 Aug 06.

Artículo en Inglés | MEDLINE | ID: mdl-33267475

RESUMEN

The human postsynaptic density is an elaborate network comprising thousands of proteins, playing a vital role in the molecular events of learning and the formation of memory. Despite our growing knowledge of specific proteins and their interactions, atomic-level details of their full three-dimensional structure and their rearrangements are mostly elusive. Advancements in structural bioinformatics enabled us to depict the characteristic features of proteins involved in different processes aiding neurotransmission. We show that postsynaptic protein-protein interactions are mediated through the delicate balance of intrinsically disordered regions and folded domains, and this duality is also imprinted in the amino acid sequence. We introduce Diversity of Potential Interactions (DPI), a structure and regulation based descriptor to assess the diversity of interactions. Our approach reveals that the postsynaptic proteome has its own characteristic features and these properties reliably discriminate them from other proteins of the human proteome. Our results suggest that postsynaptic proteins are especially susceptible to forming diverse interactions with each other, which might be key in the reorganization of the postsynaptic density (PSD) in molecular processes related to learning and memory.

13.

A conserved charged single α-helix with a putative steric role in paraspeckle formation.

Dobson, László; Nyitray, László; Gáspári, Zoltán.

RNA ; 21(12): 2023-9, 2015 Dec.

Artículo en Inglés | MEDLINE | ID: mdl-26428695

RESUMEN

Paraspeckles are subnuclear particles involved in the regulation of mRNA expression. They are formed by the association of DBHS family proteins and the NEAT1 long noncoding RNA. Here, we show that a recently identified structural motif, the charged single α-helix, is largely conserved in the DBHS family. Based on the available structural data and a previously suggested multimerization scheme of DBHS proteins, we built a structural model of a (PSPC1/NONO)(n) multimer that might have relevance in paraspeckle formation. Our model contains an extended coiled-coil region that is followed by and partially overlaps with the predicted charged single α-helix. We suggest that the charged single α-helix can act as an elastic ruler governing the exact positioning of the dimeric core structures relative to each other during paraspeckle assembly along the NEAT1 noncoding RNA.

Asunto(s)

Proteínas Asociadas a Matriz Nuclear/química , Proteínas Nucleares/química , Factores de Transcripción de Octámeros/química , Proteínas de Unión al ARN/química , Secuencia de Aminoácidos , Secuencia Conservada , Proteínas de Unión al ADN , Humanos , Interacciones Hidrofóbicas e Hidrofílicas , Modelos Moleculares , Datos de Secuencia Molecular , Factor de Empalme Asociado a PTB , Dominios y Motivos de Interacción de Proteínas , Multimerización de Proteína , Estructura Secundaria de Proteína

14.

TOPDOM: database of conservatively located domains and motifs in proteins.

Varga, Julia; Dobson, László; Tusnády, Gábor E.

Bioinformatics ; 32(17): 2725-6, 2016 09 01.

Artículo en Inglés | MEDLINE | ID: mdl-27153630

RESUMEN

UNLABELLED: The TOPDOM database-originally created as a collection of domains and motifs located consistently on the same side of the membranes in α-helical transmembrane proteins-has been updated and extended by taking into consideration consistently localized domains and motifs in globular proteins, too. By taking advantage of the recently developed CCTOP algorithm to determine the type of a protein and predict topology in case of transmembrane proteins, and by applying a thorough search for domains and motifs as well as utilizing the most up-to-date version of all source databases, we managed to reach a 6-fold increase in the size of the whole database and a 2-fold increase in the number of transmembrane proteins. AVAILABILITY AND IMPLEMENTATION: TOPDOM database is available at http://topdom.enzim.hu The webpage utilizes the common Apache, PHP5 and MySQL software to provide the user interface for accessing and searching the database. The database itself is generated on a high performance computer. CONTACT: tusnady.gabor@ttk.mta.hu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Asunto(s)

Bases de Datos de Proteínas , Dominios Proteicos , Programas Informáticos , Algoritmos , Secuencias de Aminoácidos , Secuencia Conservada , Sistemas de Administración de Bases de Datos , Almacenamiento y Recuperación de la Información , Proteínas de la Membrana , Estructura Terciaria de Proteína , Proteínas , Alineación de Secuencia , Interfaz Usuario-Computador

15.

CCTOP: a Consensus Constrained TOPology prediction web server.

Dobson, László; Reményi, István; Tusnády, Gábor E.

Nucleic Acids Res ; 43(W1): W408-12, 2015 Jul 01.

Artículo en Inglés | MEDLINE | ID: mdl-25943549

RESUMEN

The Consensus Constrained TOPology prediction (CCTOP; http://cctop.enzim.ttk.mta.hu) server is a web-based application providing transmembrane topology prediction. In addition to utilizing 10 different state-of-the-art topology prediction methods, the CCTOP server incorporates topology information from existing experimental and computational sources available in the PDBTM, TOPDB and TOPDOM databases using the probabilistic framework of hidden Markov model. The server provides the option to precede the topology prediction with signal peptide prediction and transmembrane-globular protein discrimination. The initial result can be recalculated by (de)selecting any of the prediction methods or mapped experiments or by adding user specified constraints. CCTOP showed superior performance to existing approaches. The reliability of each prediction is also calculated, which correlates with the accuracy of the per protein topology prediction. The prediction results and the collected experimental information are visualized on the CCTOP home page and can be downloaded in XML format. Programmable access of the CCTOP server is also available, and an example of client-side script is provided.

Asunto(s)

Proteínas de la Membrana/química , Programas Informáticos , Algoritmos , Humanos , Internet , Conformación Proteica

16.

Expediting topology data gathering for the TOPDB database.

Dobson, László; Langó, Tamás; Reményi, István; Tusnády, Gábor E.

Nucleic Acids Res ; 43(Database issue): D283-9, 2015 Jan.

Artículo en Inglés | MEDLINE | ID: mdl-25392424

RESUMEN

The Topology Data Bank of Transmembrane Proteins (TOPDB, http://topdb.enzim.ttk.mta.hu) contains experimentally determined topology data of transmembrane proteins. Recently, we have updated TOPDB from several sources and utilized a newly developed topology prediction algorithm to determine the most reliable topology using the results of experiments as constraints. In addition to collecting the experimentally determined topology data published in the last couple of years, we gathered topographies defined by the TMDET algorithm using 3D structures from the PDBTM. Results of global topology analysis of various organisms as well as topology data generated by high throughput techniques, like the sequential positions of N- or O-glycosylations were incorporated into the TOPDB database. Moreover, a new algorithm was developed to integrate scattered topology data from various publicly available databases and a new method was introduced to measure the reliability of predicted topologies. We show that reliability values highly correlate with the per protein topology accuracy of the utilized prediction method. Altogether, more than 52,000 new topology data and more than 2600 new transmembrane proteins have been collected since the last public release of the TOPDB database.

Asunto(s)

Bases de Datos de Proteínas , Proteínas de la Membrana/química , Conformación Proteica

17.

Disordered regions in transmembrane proteins.

Tusnády, Gábor E; Dobson, László; Tompa, Peter.

Biochim Biophys Acta ; 1848(11 Pt A): 2839-48, 2015 Nov.

Artículo en Inglés | MEDLINE | ID: mdl-26275590

RESUMEN

The functions of transmembrane proteins in living cells are widespread; they range from various transport processes to energy production, from cell-cell adhesion to communication. Structurally, they are highly ordered in their membrane-spanning regions, but may contain disordered regions in the cytosolic and extra-cytosolic parts. In this study, we have investigated the disordered regions in transmembrane proteins by a stringent definition of disordered residues on the currently available largest experimental dataset, and show a significant correlation between the spatial distributions of positively charged residues and disordered regions. This finding suggests a new role of disordered regions in transmembrane proteins by providing structural flexibility for stabilizing interactions with negatively charged head groups of the lipid molecules. We also find a preference of structural disorder in the terminal--as opposed to loop--regions in transmembrane proteins, and survey the respective functions involved in recruiting other proteins or mediating allosteric signaling effects. Finally, we critically compare disorder prediction methods on our transmembrane protein set. While there are no major differences between these methods using the usual statistics, such as per residue accuracies, Matthew's correlation coefficients, etc.; substantial differences can be found regarding the spatial distribution of the predicted disordered regions. We conclude that a predictor optimized for transmembrane proteins would be of high value to the field of structural disorder.

Asunto(s)

Bases de Datos de Proteínas , Proteínas de la Membrana/química , Modelos Moleculares , Conformación Proteica , Secuencia de Aminoácidos , Biología Computacional/métodos , Internet , Reproducibilidad de los Resultados

18.

How AlphaFold2 shaped the structural coverage of the human transmembrane proteome.

Jambrich, Márton A; Tusnady, Gabor E; Dobson, Laszlo.

Sci Rep ; 13(1): 20283, 2023 11 20.

Artículo en Inglés | MEDLINE | ID: mdl-37985809

RESUMEN

AlphaFold2 (AF2) provides a 3D structure for every known or predicted protein, opening up new prospects for virtually every field in structural biology. However, working with transmembrane protein molecules pose a notorious challenge for scientists, resulting in a limited number of experimentally determined structures. Consequently, algorithms trained on this finite training set also face difficulties. To address this issue, we recently launched the TmAlphaFold database, where predicted AlphaFold2 structures are embedded into the membrane plane and a quality assessment (plausibility of the membrane-embedded structure) is provided for each prediction using geometrical evaluation. In this paper, we analyze how AF2 has improved the structural coverage of membrane proteins compared to earlier years when only experimental structures were available, and high-throughput structure prediction was greatly limited. We also evaluate how AF2 can be used to search for (distant) homologs in highly diverse protein families. By combining quality assessment and homology search, we can pinpoint protein families where AF2 accuracy is still limited, and experimental structure determination would be desirable.

Asunto(s)

Furilfuramida , Proteoma , Humanos , Proteínas de la Membrana , Algoritmos , Bases de Datos Factuales

19.

LeishMANIAdb: a comparative resource for Leishmania proteins.

Tusnády, Gábor E; Zeke, András; Kálmán, Zsófia E; Fatoux, Marie; Ricard-Blum, Sylvie; Gibson, Toby J; Dobson, Laszlo.

Database (Oxford) ; 20232023 10 31.

Artículo en Inglés | MEDLINE | ID: mdl-37935582

RESUMEN

Leishmaniasis is a detrimental disease causing serious changes in quality of life and some forms can lead to death. The disease is spread by the parasite Leishmania transmitted by sandfly vectors and their primary hosts are vertebrates including humans. The pathogen penetrates host cells and secretes proteins (the secretome) to repurpose cells for pathogen growth and to alter cell signaling via host-pathogen protein-protein interactions). Here, we present LeishMANIAdb, a database specifically designed to investigate how Leishmania virulence factors may interfere with host proteins. Since the secretomes of different Leishmania species are only partially characterized, we collated various experimental evidence and used computational predictions to identify Leishmania secreted proteins to generate a user-friendly unified web resource allowing users to access all information available on experimental and predicted secretomes. In addition, we manually annotated host-pathogen interactions of 211 proteins and the localization/function of 3764 transmembrane (TM) proteins of different Leishmania species. We also enriched all proteins with automatic structural and functional predictions that can provide new insights in the molecular mechanisms of infection. Our database may provide novel insights into Leishmania host-pathogen interactions and help to identify new therapeutic targets for this neglected disease. Database URL https://leishmaniadb.ttk.hu/.

Asunto(s)

Leishmania , Leishmaniasis , Humanos , Animales , Leishmania/genética , Calidad de Vida , Leishmaniasis/genética , Leishmaniasis/metabolismo , Leishmaniasis/parasitología , Proteínas de la Membrana

20.

PSINDB: the postsynaptic protein-protein interaction database.

Kalman, Zsofia E; Dudola, Dániel; Mészáros, Bálint; Gáspári, Zoltán; Dobson, Laszlo.

Database (Oxford) ; 2022(2022)2022 03 02.

Artículo en Inglés | MEDLINE | ID: mdl-35234850

RESUMEN

The postsynaptic region is the receiving part of the synapse comprising thousands of proteins forming an elaborate and dynamically changing network indispensable for the molecular mechanisms behind fundamental phenomena such as learning and memory. Despite the growing amount of information about individual protein-protein interactions (PPIs) in this network, these data are mostly scattered in the literature or stored in generic databases that are not designed to display aspects that are fundamental to the understanding of postsynaptic functions. To overcome these limitations, we collected postsynaptic PPIs complemented by a high amount of detailed structural and biological information and launched a freely available resource, the Postsynaptic Interaction Database (PSINDB), to make these data and annotations accessible. PSINDB includes tens of thousands of binding regions together with structural features, mediating and regulating the formation of PPIs, annotated with detailed experimental information about each interaction. PSINDB is expected to be useful for various aspects of molecular neurobiology research, from experimental design to network and systems biology-based modeling and analysis of changes in the protein network upon various stimuli. Database URL https://psindb.itk.ppke.hu/.

Asunto(s)

Mapeo de Interacción de Proteínas , Proteínas , Bases de Datos de Proteínas , Mapas de Interacción de Proteínas , Proteínas/química

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA