Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 13 de 13
Filtrar
1.
Bioinformatics ; 35(24): 5309-5312, 2019 12 15.
Artículo en Inglés | MEDLINE | ID: mdl-31250907

RESUMEN

SUMMARY: JUCHMME is an open-source software package designed to fit arbitrary custom Hidden Markov Models (HMMs) with a discrete alphabet of symbols. We incorporate a large collection of standard algorithms for HMMs as well as a number of extensions and evaluate the software on various biological problems. Importantly, the JUCHMME toolkit includes several additional features that allow for easy building and evaluation of custom HMMs, which could be a useful resource for the research community. AVAILABILITY AND IMPLEMENTATION: http://www.compgen.org/tools/juchmme, https://github.com/pbagos/juchmme. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Algoritmos , Programas Informáticos , Análisis de Secuencia
2.
J Proteome Res ; 18(5): 2310-2320, 2019 05 03.
Artículo en Inglés | MEDLINE | ID: mdl-30908064

RESUMEN

Voltage-gated ion channels (VGICs) are one of the largest groups of transmembrane proteins. Due to their major role in the generation and propagation of electrical signals, VGICs are considered important from a medical viewpoint, and their dysfunction is often associated with Channelopathies. We identified disease-associated mutations and polymorphisms in these proteins through mapping missense single-nucleotide polymorphisms from the UniProt and ClinVar databases on their amino acid sequence, considering their special topological and functional characteristics. Statistical analysis revealed that disease-associated SNPs are mostly found in the voltage sensor domain and the pore loop. Both of these regions are extremely important for the activation and ion conductivity of VGICs. Moreover, among the most frequently observed mutations are those of arginine to glutamine, to histidine or to cysteine, which can probably be attributed to the extremely important role of arginine residues in the regulation of membrane potential in these proteins. We suggest that topological information in combination with genetic variation data can contribute toward a better evaluation of the effect of currently unclassified mutations in VGICs. It is hoped that potential associations with certain disease phenotypes will be revealed in the future with the use of similar approaches.


Asunto(s)
Canales de Calcio/genética , Canalopatías/genética , Polimorfismo de Nucleótido Simple , Canales de Potasio con Entrada de Voltaje/genética , Canales de Sodio Activados por Voltaje/genética , Secuencia de Aminoácidos , Arginina/metabolismo , Canales de Calcio/clasificación , Canales de Calcio/metabolismo , Canalopatías/metabolismo , Canalopatías/patología , Cisteína/metabolismo , Bases de Datos de Proteínas , Expresión Génica , Glutamina/metabolismo , Histidina/metabolismo , Humanos , Activación del Canal Iónico/genética , Modelos Moleculares , Canales de Potasio con Entrada de Voltaje/clasificación , Canales de Potasio con Entrada de Voltaje/metabolismo , Conformación Proteica , Dominios Proteicos , Proteómica/métodos , Canales de Sodio Activados por Voltaje/clasificación , Canales de Sodio Activados por Voltaje/metabolismo
3.
J Bioinform Comput Biol ; 22(4): 2450021, 2024 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-39215524

RESUMEN

Sorting signals are crucial for the anchoring of proteins to the cell surface in archaea and bacteria. These proteins often feature distinct motifs at their C-terminus, cleaved by sortase or sortase-like enzymes. Gram-positive bacteria exhibit the LPXTGX consensus motif, cleaved by sortases, while Gram-negative bacteria employ exosortases recognizing motifs like PEP. Archaea utilize exosortase homologs known as archaeosortases for signal anchoring. Traditionally identification of such C-terminal sorting signals was performed with profile Hidden Markov Models (pHMMs). The Cell-Wall PREDiction (CW-PRED) method introduced for the first time a custom-made class HMM for proteins in Gram-positive bacteria that contain a cell wall sorting signal which begins with an LPXTG motif, followed by a hydrophobic domain and a tail of positively charged residues. Here we present a new and updated version of CW-PRED for predicting C-terminal sorting signals in Archaea, Gram-positive, and Gram-negative bacteria. We used a large training set and several model enhancements that improve motif identification in order to achieve better discrimination between C-terminal signals and other proteins. Cross-validation demonstrates CW-PRED's superiority in sensitivity and specificity compared to other methods. Application of the method in reference proteomes reveals a large number of potential surface proteins not previously identified. The method is available for academic use at http://195.251.108.230/apps.compgen.org/CW-PRED/ and as standalone software.


Asunto(s)
Proteínas Arqueales , Proteínas Bacterianas , Señales de Clasificación de Proteína , Proteínas Bacterianas/metabolismo , Proteínas Bacterianas/química , Proteínas Bacterianas/genética , Proteínas Arqueales/metabolismo , Proteínas Arqueales/química , Proteínas Arqueales/genética , Archaea/metabolismo , Archaea/genética , Biología Computacional/métodos , Pared Celular/metabolismo , Pared Celular/química , Cadenas de Markov , Secuencias de Aminoácidos , Programas Informáticos , Bacterias/metabolismo , Bacterias/genética , Algoritmos
4.
Biomolecules ; 13(2)2023 02 01.
Artículo en Inglés | MEDLINE | ID: mdl-36830638

RESUMEN

Receptor tyrosine kinases (RTKs) form a highly important group of protein receptors of the eukaryotic cell membrane. They control many vital cellular functions and are involved in the regulation of complex signaling networks. Mutations in RTKs have been associated with different types of cancers and other diseases. Although they are very important for proper cell function, they have been experimentally studied in a limited range of eukaryotic species. Currently, there is no available database for RTKs providing information about their function, expression, and interactions. Therefore, the identification of RTKs in multiple organisms, the documentation of their characteristics, and the collection of related information would be very useful. In this paper, we present a novel RTK detection pipeline (RTK-PRED) and the Receptor Tyrosine Kinases Database (TyReK-DB). RTK-PRED combines profile HMMs with transmembrane topology prediction to identify and classify potential RTKs. Proteins of all eukaryotic reference proteomes of the UniProt database were used as input in RTK-PRED leading to a filtered dataset of 20,478 RTKs. Based on the information collected for these RTKs from multiple databases, the relational TyReK database was created.


Asunto(s)
Neoplasias , Proteoma , Humanos , Proteínas Tirosina Quinasas Receptoras/metabolismo , Transducción de Señal/fisiología , Neoplasias/metabolismo , Tirosina
5.
Membranes (Basel) ; 13(1)2023 Jan 03.
Artículo en Inglés | MEDLINE | ID: mdl-36676869

RESUMEN

The nuclear envelope (NE) is a double-membrane system surrounding the nucleus of eukaryotic cells. A large number of proteins are localized in the NE, performing a wide variety of functions, from the bidirectional exchange of molecules between the cytoplasm and the nucleus to chromatin tethering, genome organization, regulation of signaling cascades, and many others. Despite its importance, several aspects of the NE, including its protein-protein interactions, remain understudied. In this work, we present NucEnvDB, a publicly available database of NE proteins and their interactions. Each database entry contains useful annotation including a description of its position in the NE, its interactions with other proteins, and cross-references to major biological repositories. In addition, the database provides users with a number of visualization and analysis tools, including the ability to construct and visualize protein-protein interaction networks and perform functional enrichment analysis for clusters of NE proteins and their interaction partners. The capabilities of NucEnvDB and its analysis tools are showcased by two informative case studies, exploring protein-protein interactions in Hutchinson-Gilford progeria and during SARS-CoV-2 infection at the level of the nuclear envelope.

6.
Biochim Biophys Acta Biomembr ; 1864(9): 183956, 2022 09 01.
Artículo en Inglés | MEDLINE | ID: mdl-35577076

RESUMEN

Ligand-Gated Ion Channels (LGICs) is one of the largest groups of transmembrane proteins. Due to their major role in synaptic transmission, both in the nervous system and the somatic neuromuscular junction, LGICs present attractive therapeutic targets. During the last few years, several computational methods for the detection of LGICs have been developed. These methods are based on machine learning approaches utilizing features extracted solely from the amino acid composition. Here we report the development of LiGIoNs, a profile Hidden Markov Model (pHMM) method for the prediction and ligand-based classification of LGICs. The method consists of a library of 10 pHMMs, one per LGIC subfamily, built from the alignment of representative LGIC sequences. In addition, 14 Pfam pHMMs are used to further annotate and classify unknown protein sequences into one of the 10 LGIC subfamilies. Evaluation of the method showed that it outperforms existing methods in the detection of LGICs. On top of that, LiGIoNs is the only currently available method that classifies LGICs into subfamilies. The method is available online at http://bioinformatics.biol.uoa.gr/ligions/.


Asunto(s)
Canales Iónicos Activados por Ligandos , Secuencia de Aminoácidos , Ligandos
7.
Amyloid ; 26(3): 112-117, 2019 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-31094220

RESUMEN

Amyloid fibrils are formed when soluble proteins misfold into highly ordered insoluble fibrillar aggregates and affect various organs and tissues. The deposition of amyloid fibrils is the main hallmark of a group of disorders, called amyloidoses. Curiously, fibril deposition has been also recorded as a complication in a number of other pathological conditions, including well-known neurodegenerative or endocrine diseases. To date, amyloidoses are roughly classified, owing to their tremendous heterogeneity. In this work, we introduce AmyCo, a freely available collection of amyloidoses and clinical disorders related to amyloid deposition. AmyCo classifies 75 diseases associated with amyloid deposition into two distinct categories, namely 1) amyloidosis and 2) clinical conditions associated with amyloidosis. Each database entry is annotated with the major protein component (causative protein), other components of amyloid deposits and affected tissues or organs. Database entries are also supplemented with appropriate detailed annotation and are referenced to ICD-10, MeSH, OMIM, PubMed, AmyPro and UniProtKB databases. To our knowledge, AmyCo is the first attempt towards the creation of a complete and an up-to-date repository, containing information about amyloidoses and diseases related to amyloid deposition. The AmyCo web interface is available at http://bioinformatics.biol.uoa.gr/amyco .


Asunto(s)
Enfermedad de Alzheimer/clasificación , Amiloide/genética , Amiloidosis/clasificación , Enfermedad de Parkinson/clasificación , Enfermedad de Alzheimer/diagnóstico , Enfermedad de Alzheimer/genética , Enfermedad de Alzheimer/metabolismo , Amiloide/metabolismo , Amiloidosis/diagnóstico , Amiloidosis/genética , Amiloidosis/metabolismo , Bases de Datos Factuales , Estudio de Asociación del Genoma Completo , Humanos , Mutación , Enfermedad de Parkinson/diagnóstico , Enfermedad de Parkinson/genética , Enfermedad de Parkinson/metabolismo , Terminología como Asunto
8.
J Bioinform Comput Biol ; 6(2): 387-401, 2008 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-18464329

RESUMEN

Surface proteins in Gram-positive bacteria are frequently implicated in virulence. We have focused on a group of extracellular cell wall-attached proteins (CWPs), containing an LPXTG motif for cleavage and covalent coupling to peptidoglycan by sortase enzymes. A hidden Markov model (HMM) approach for predicting the LPXTG-anchored cell wall proteins of Gram-positive bacteria was developed and compared against existing methods. The HMM model is parsimonious in terms of the number of freely estimated parameters, and it has proved to be very sensitive and specific in a training set of 55 experimentally verified LPXTG-anchored cell wall proteins as well as in reliable data sets of globular and transmembrane proteins. In order to identify such proteins in Gram-positive bacteria, a comprehensive analysis of 94 completely sequenced genomes has been performed. We identified, in total, 860 LPXTG-anchored cell wall proteins, a number that is significantly higher compared to those obtained by other available methods. Of these proteins, 237 are hypothetical proteins according to the annotation of SwissProt, and 88 had no homologs in the SwissProt database--this might be evidence that they are members of newly identified families of CWPs. The prediction tool, the database with the proteins identified in the genomes, and supplementary material are available online at http://bioinformatics.biol.uoa.gr/CW-PRED/.


Asunto(s)
Algoritmos , Pared Celular/metabolismo , Genoma Bacteriano , Bacterias Grampositivas/genética , Cadenas de Markov , Animales , Pared Celular/genética , Humanos , Modelos Genéticos , Valor Predictivo de las Pruebas
9.
PLoS One ; 12(2): e0171512, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-28158294

RESUMEN

Cutaneous melanoma is a malignant tumor of skin melanocytes that are pigment-producing cells located in the basal layer (stratum basale) of epidermis. Accumulation of genetic mutations within their oncogenes or tumor-suppressor genes compels melanocytes to aberrant proliferation and spread to distant organs of the body, thereby resulting in severe and/or lethal malignancy. Metastatic melanoma's heavy mutational load, molecular heterogeneity and resistance to therapy necessitate the development of novel biomarkers and drug-based protocols that target key proteins involved in perpetuation of the disease. To this direction, we have herein employed a nano liquid chromatography-tandem mass spectrometry (nLC-MS/MS) proteomics technology to profile the deep-proteome landscape of WM-266-4 human metastatic melanoma cells. Our advanced melanoma-specific catalogue proved to contain 6,681 unique proteins, which likely constitute the hitherto largest single cell-line-derived proteomic collection of the disease. Through engagement of UNIPROT, DAVID, KEGG, PANTHER, INTACT, CYTOSCAPE, dbEMT and GAD bioinformatics resources, WM-266-4 melanoma proteins were categorized according to their sub-cellular compartmentalization, function and tumorigenicity, and successfully reassembled in molecular networks and interactomes. The obtained data dictate the presence of plastically inter-converted sub-populations of non-cancer and cancer stem cells, and also indicate the oncoproteomic resemblance of melanoma to glioma and lung cancer. Intriguingly, WM-266-4 cells seem to be subjected to both epithelial-to-mesenchymal (EMT) and mesenchymal-to-epithelial (MET) programs, with 1433G and ADT3 proteins being identified in the EMT/MET molecular interface. Oncogenic addiction of WM-266-4 cells to autocrine/paracrine signaling of IL17-, DLL3-, FGF(2/13)- and OSTP-dependent sub-routines suggests their critical contribution to the metastatic melanoma chemotherapeutic refractoriness. Interestingly, the 1433G family member that is shared between the BRAF- and EMT/MET-specific interactomes likely emerges as a novel and promising druggable target for the malignancy. Derailed proliferation and metastatic capacity of WM-266-4 cells could also derive from their metabolic addiction to pathways associated with glutamate/ammonia, propanoate and sulfur homeostasis, whose successful targeting may prove beneficial for advanced melanoma-affected patients.


Asunto(s)
Melanoma/metabolismo , Proteínas de Neoplasias/metabolismo , Proteoma , Neoplasias Cutáneas/metabolismo , Proteínas 14-3-3/metabolismo , Amoníaco/metabolismo , Animales , Línea Celular Tumoral , Cromatografía Liquida/métodos , Resistencia a Antineoplásicos/genética , Femenino , Heterogeneidad Genética , Humanos , Melaninas/biosíntesis , Melanoma/genética , Ratones SCID , Factor de Transcripción Asociado a Microftalmía/metabolismo , Persona de Mediana Edad , Proteínas de Neoplasias/genética , Trasplante de Neoplasias , Propionatos/metabolismo , Mapas de Interacción de Proteínas , Proteoma/genética , Proteínas Proto-Oncogénicas B-raf/metabolismo , Transducción de Señal , Neoplasias Cutáneas/genética , Espectrometría de Masas en Tándem/métodos , Melanoma Cutáneo Maligno
10.
Genomics Proteomics Bioinformatics ; 4(1): 48-55, 2006 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-16689702

RESUMEN

The ability to predict the subcellular localization of a protein from its sequence is of great importance, as it provides information about the protein's function. We present a computational tool, PredSL, which utilizes neural networks, Markov chains, profile hidden Markov models, and scoring matrices for the prediction of the subcellular localization of proteins in eukaryotic cells from the N-terminal amino acid sequence. It aims to classify proteins into five groups: chloroplast, thylakoid, mitochondrion, secretory pathway, and "other". When tested in a five-fold cross-validation procedure, PredSL demonstrates 86.7% and 87.1% overall accuracy for the plant and non-plant datasets, respectively. Compared with TargetP, which is the most widely used method to date, and LumenP, the results of PredSL are comparable in most cases. When tested on the experimentally verified proteins of the Saccharomyces cerevisiae genome, PredSL performs comparably if not better than any available algorithm for the same task. Furthermore, PredSL is the only method capable for the prediction of these subcellular localizations that is available as a stand-alone application through the URL:http://bioinformatics.biol.uoa.gr/PredSL/.


Asunto(s)
Biología Computacional/métodos , Orgánulos/metabolismo , Fragmentos de Péptidos/metabolismo , Señales de Clasificación de Proteína , Proteínas/metabolismo , Análisis de Secuencia de Proteína/métodos , Programas Informáticos , Algoritmos , Bases de Datos de Proteínas , Señales de Clasificación de Proteína/fisiología
11.
Biomed Res Int ; 2014: 397145, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-25057483

RESUMEN

A major part of membrane function is conducted by proteins, both integral and peripheral. Peripheral membrane proteins temporarily adhere to biological membranes, either to the lipid bilayer or to integral membrane proteins with noncovalent interactions. The aim of this study was to construct and analyze the interactions of the human plasma membrane peripheral proteins (peripherome hereinafter). For this purpose, we collected a dataset of peripheral proteins of the human plasma membrane. We also collected a dataset of experimentally verified interactions for these proteins. The interaction network created from this dataset has been visualized using Cytoscape. We grouped the proteins based on their subcellular location and clustered them using the MCL algorithm in order to detect functional modules. Moreover, functional and graph theory based analyses have been performed to assess biological features of the network. Interaction data with drug molecules show that ~10% of peripheral membrane proteins are targets for approved drugs, suggesting their potential implications in disease. In conclusion, we reveal novel features and properties regarding the protein-protein interaction network created by peripheral proteins of the human plasma membrane.


Asunto(s)
Membrana Celular/química , Biología Computacional/métodos , Mapeo de Interacción de Proteínas/métodos , Proteoma , Algoritmos , Análisis por Conglomerados , Bases de Datos de Proteínas , Humanos , Membrana Dobles de Lípidos/química , Preparaciones Farmacéuticas/química , Programas Informáticos
12.
In Silico Biol ; 6(5): 379-86, 2006.
Artículo en Inglés | MEDLINE | ID: mdl-17274767

RESUMEN

Genomics and proteomics have added valuable information to our knowledgebase of the human biological system including the discovery of therapeutic targets and disease biomarkers. However, molecular profiling studies commonly result in the identification of novel proteins of unknown localization. A class of proteins of special interest is membrane proteins, in particular plasma membrane proteins. Despite their biological and medical significance, the 3-dimensional structures of less than 1% of plasma membrane proteins have been determined. In order to aid in identification of membrane proteins, a number of computational methods have been developed. These tools operate by predicting the presence of transmembrane segments. Here, we utilized five topology prediction methods (TMHMM, SOSUI, waveTM, HMMTOP, and TopPred II) in order to estimate the ratio of integral membrane proteins in the human proteome. These methods employ different algorithms and include a newly-developed method (waveTM) that has yet to be tested on a large proteome database. Since these tools are prone for error mainly as a result of falsely predicting signal peptides as transmembrane segments, we have utilized an additional method, SignalP. Based on our analyses, the ratio of human proteins with transmembrane segments is estimated to fall between 15% and 39% with a consensus of 13%. Agreement among the programs is reduced further when both a positive identification of a membrane protein and the number of transmembrane segments per protein are considered. Such a broad range of prediction depends on the selectivity of the individual method in predicting integral membrane proteins. These methods can play a critical role in determining protein structure and, hence, identifying suitable drug targets in humans.


Asunto(s)
Proteínas de la Membrana/genética , Algoritmos , Simulación por Computador , Bases de Datos de Proteínas , Humanos , Señales de Clasificación de Proteína/genética , Proteoma , Proteómica/estadística & datos numéricos , Programas Informáticos
13.
In Silico Biol ; 4(2): 127-31, 2004.
Artículo en Inglés | MEDLINE | ID: mdl-15107018

RESUMEN

waveTM is a web tool for the prediction of transmembrane segments in alpha-helical membrane proteins. Prediction is performed by a dynamic programming algorithm on wavelet-denoised 'hydropathy' signals. Users submit a protein sequence and receive interactively the results. Topology prediction can also be obtained in conjunction with the algorithm OrienTM. A web server that implements the waveTM algorithm is freely available at http://bioinformatics.biol.uoa.gr/waveTM.


Asunto(s)
Proteínas de la Membrana/química , Membranas/metabolismo , Análisis de Secuencia de Proteína/métodos , Programas Informáticos , Algoritmos , Biología Computacional/métodos , Simulación por Computador , Bases de Datos de Proteínas , Conformación Proteica , Estructura Secundaria de Proteína
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA