Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 76
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Mol Psychiatry ; 29(1): 186-196, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-38102483

RESUMEN

Autism spectrum disorder (ASD) comprises a large group of neurodevelopmental conditions featuring, over a wide range of severity and combinations, a core set of manifestations (restricted sociality, stereotyped behavior and language impairment) alongside various comorbidities. Common and rare variants in several hundreds of genes and regulatory regions have been implicated in the molecular pathogenesis of ASD along a range of causation evidence strength. Despite significant progress in elucidating the impact of few paradigmatic individual loci, such sheer complexity in the genetic architecture underlying ASD as a whole has hampered the identification of convergent actionable hubs hypothesized to relay between the vastness of risk alleles and the core phenotypes. In turn this has limited the development of strategies that can revert or ameliorate this condition, calling for a systems-level approach to probe the cross-talk of cooperating genes in terms of causal interaction networks in order to make convergences experimentally tractable and reveal their clinical actionability. As a first step in this direction, we have captured from the scientific literature information on the causal links between the genes whose variants have been associated with ASD and the whole human proteome. This information has been annotated in a computer readable format in the SIGNOR database and is made freely available in the resource website. To link this information to cell functions and phenotypes, we have developed graph algorithms that estimate the functional distance of any protein in the SIGNOR causal interactome to phenotypes and pathways. The main novelty of our approach resides in the possibility to explore the mechanistic links connecting the suggested gene-phenotype relations.


Asunto(s)
Trastorno del Espectro Autista , Predisposición Genética a la Enfermedad , Trastornos del Neurodesarrollo , Fenotipo , Humanos , Trastorno del Espectro Autista/genética , Predisposición Genética a la Enfermedad/genética , Trastornos del Neurodesarrollo/genética , Redes Reguladoras de Genes/genética , Trastorno Autístico/genética , Estudios de Asociación Genética/métodos , Proteoma/genética
2.
Nucleic Acids Res ; 51(D1): D631-D637, 2023 01 06.
Artículo en Inglés | MEDLINE | ID: mdl-36243968

RESUMEN

The SIGnaling Network Open Resource (SIGNOR 3.0, https://signor.uniroma2.it) is a public repository that captures causal information and represents it according to an 'activity-flow' model. SIGNOR provides freely-accessible static maps of causal interactions that can be tailored, pruned and refined to build dynamic and predictive models. Each signaling relationship is annotated with an effect (up/down-regulation) and with the mechanism (e.g. binding, phosphorylation, transcriptional activation, etc.) causing the regulation of the target entity. Since its latest release, SIGNOR has undergone a significant upgrade including: (i) a new website that offers an improved user experience and novel advanced search and graph tools; (ii) a significant content growth adding up to a total of approx. 33,000 manually-annotated causal relationships between more than 8900 biological entities; (iii) an increase in the number of manually annotated pathways, currently including pathways deregulated by SARS-CoV-2 infection or involved in neurodevelopment synaptic transmission and metabolism, among others; (iv) additional features such as new model to represent metabolic reactions and a new confidence score assigned to each interaction.


Asunto(s)
Bases de Datos de Proteínas , Humanos , COVID-19 , Fosforilación , SARS-CoV-2/genética , Transducción de Señal , Regulación de la Expresión Génica
3.
Nucleic Acids Res ; 48(D1): D416-D421, 2020 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-31598703

RESUMEN

CancerGeneNet (https://signor.uniroma2.it/CancerGeneNet/) is a resource that links genes that are frequently mutated in cancers to cancer phenotypes. The resource takes advantage of a curation effort aimed at embedding a large fraction of the gene products that are found altered in cancer cells into a network of causal protein relationships. Graph algorithms, in turn, allow to infer likely paths of causal interactions linking cancer associated genes to cancer phenotypes thus offering a rational framework for the design of strategies to revert disease phenotypes. CancerGeneNet bridges two interaction layers by connecting proteins whose activities are affected by cancer drivers to proteins that impact on the 'hallmarks of cancer'. In addition, CancerGeneNet annotates curated pathways that are relevant to rationalize the pathological consequences of cancer driver mutations in selected common cancers and 'MiniPathways' illustrating regulatory circuits that are frequently altered in different cancers.


Asunto(s)
Bases de Datos Genéticas , Neoplasias/genética , Proteínas/genética , Algoritmos , Antineoplásicos/farmacología , Gráficos por Computador , Humanos , Terapia Molecular Dirigida , Neoplasias/tratamiento farmacológico , Fenotipo , Interfaz Usuario-Computador
4.
Nucleic Acids Res ; 48(D1): D504-D510, 2020 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-31665520

RESUMEN

The SIGnaling Network Open Resource 2.0 (SIGNOR 2.0) is a public repository that stores signaling information as binary causal relationships between biological entities. The captured information is represented graphically as a signed directed graph. Each signaling relationship is associated to an effect (up/down-regulation) and to the mechanism (e.g. binding, phosphorylation, transcriptional activation, etc.) causing the up/down-regulation of the target entity. Since its first release, SIGNOR has undergone a significant content increase and the number of annotated causal interactions have almost doubled. SIGNOR 2.0 now stores almost 23 000 manually-annotated causal relationships between proteins and other biologically relevant entities: chemicals, phenotypes, complexes, etc. We describe here significant changes in curation policy and a new confidence score, which is assigned to each interaction. We have also improved the compliance to the FAIR data principles by providing (i) SIGNOR stable identifiers, (ii) programmatic access through REST APIs, (iii) bioschemas and (iv) downloadable data in standard-compliant formats, such as PSI-MI CausalTAB and GMT. The data are freely accessible and downloadable at https://signor.uniroma2.it/.


Asunto(s)
Bases de Datos Factuales , Transducción de Señal , Programas Informáticos , Animales , Humanos , Mapas de Interacción de Proteínas
6.
Nucleic Acids Res ; 46(D1): D527-D534, 2018 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-29036667

RESUMEN

DISNOR is a new resource that aims at exploiting the explosion of data on the identification of disease-associated genes to assemble inferred disease pathways. This may help dissecting the signaling events whose disruption causes the pathological phenotypes and may contribute to build a platform for precision medicine. To this end we combine the gene-disease association (GDA) data annotated in the DisGeNET resource with a new curation effort aimed at populating the SIGNOR database with causal interactions related to disease genes with the highest possible coverage. DISNOR can be freely accessed at http://DISNOR.uniroma2.it/ where >3700 disease-networks, linking ∼2600 disease genes, can be explored. For each disease curated in DisGeNET, DISNOR links disease genes by manually annotated causal relationships and offers an intuitive visualization of the inferred 'patho-pathways' at different complexity levels. User-defined gene lists are also accepted in the query pipeline. In addition, for each list of query genes-either annotated in DisGeNET or user-defined-DISNOR performs a gene set enrichment analysis on KEGG-defined pathways or on the lists of proteins associated with the inferred disease pathways. This function offers additional information on disease-associated cellular pathways and disease similarity.


Asunto(s)
Bases de Datos Genéticas , Enfermedad/genética , Curaduría de Datos , Redes Reguladoras de Genes , Estudios de Asociación Genética , Humanos , Internet , Mutación , Polimorfismo de Nucleótido Simple , Motor de Búsqueda , Transducción de Señal/genética , Programas Informáticos , Interfaz Usuario-Computador
7.
Int J Mol Sci ; 20(5)2019 Mar 08.
Artículo en Inglés | MEDLINE | ID: mdl-30857167

RESUMEN

The newly identified CUBAN (Cullin binding domain associating with NEDD8) domain recognizes both ubiquitin and the ubiquitin-like NEDD8. Despite the high similarity between the two molecules, CUBAN shows a clear preference for NEDD8, free and conjugated to cullins. We previously characterized the domain structure, both alone and in complex with NEDD8. The results here reported are addressed to investigate the determinants that drive the selective binding of CUBAN towards NEDD8 and ubiquitin. The 15N HSQC NMR perturbation pattern of the labeled CUBAN domain, when combined with either NEDD8 or ubiquitin, shows a clear involvement of hydrophobic residues that characterize the early stages of these interactions. After a slow conformational selection step, hydrophobic and then neutral and polar interactions take place, which drive the correct orientation of the CUBAN domain, leading to differences in the recognition scheme of NEDD8 and ubiquitin. As a result, a cascade of induced fit steps seems to determine the structural preference shown for NEDD8 and therefore the basis of the selectivity of the CUBAN domain. Finally, molecular dynamics analysis was performed to determine by fluctuations the internal flexibility of the CUBAN/NEDD8 complex. We consider that our results, based on a structural investigation mainly focused on the early stages of the recognition, provide a fruitful opportunity to report the different behavior of the same protein with two highly similar binding partners.


Asunto(s)
Proteína NEDD8/metabolismo , Ubiquitina/metabolismo , Humanos , Simulación del Acoplamiento Molecular , Simulación de Dinámica Molecular , Proteína NEDD8/química , Resonancia Magnética Nuclear Biomolecular , Unión Proteica , Conformación Proteica , Dominios y Motivos de Interacción de Proteínas , Mapas de Interacción de Proteínas , Ubiquitina/química , Ubiquitinación
8.
J Biol Chem ; 292(12): 4942-4952, 2017 03 24.
Artículo en Inglés | MEDLINE | ID: mdl-28159843

RESUMEN

Reversible tyrosine phosphorylation is a widespread post-translational modification mechanism underlying cell physiology. Thus, understanding the mechanisms responsible for substrate selection by kinases and phosphatases is central to our ability to model signal transduction at a system level. Classical protein-tyrosine phosphatases can exhibit substrate specificity in vivo by combining intrinsic enzymatic specificity with the network of protein-protein interactions, which positions the enzymes in close proximity to their substrates. Here we use a high throughput approach, based on high density phosphopeptide chips, to determine the in vitro substrate preference of 16 members of the protein-tyrosine phosphatase family. This approach helped identify one residue in the substrate binding pocket of the phosphatase domain that confers specificity for phosphopeptides in a specific sequence context. We also present a Bayesian model that combines intrinsic enzymatic specificity and interaction information in the context of the human protein interaction network to infer new phosphatase substrates at the proteome level.


Asunto(s)
Fosfopéptidos/metabolismo , Proteínas Tirosina Fosfatasas/metabolismo , Secuencia de Aminoácidos , Teorema de Bayes , Sitios de Unión , Humanos , Modelos Biológicos , Simulación del Acoplamiento Molecular , Fosfopéptidos/química , Fosforilación , Conformación Proteica , Dominios Proteicos , Mapas de Interacción de Proteínas , Proteínas Tirosina Fosfatasas/química , Especificidad por Sustrato
9.
Nucleic Acids Res ; 44(D1): D548-54, 2016 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-26467481

RESUMEN

Assembly of large biochemical networks can be achieved by confronting new cell-specific experimental data with an interaction subspace constrained by prior literature evidence. The SIGnaling Network Open Resource, SIGNOR (available on line at http://signor.uniroma2.it), was developed to support such a strategy by providing a scaffold of prior experimental evidence of causal relationships between biological entities. The core of SIGNOR is a collection of approximately 12,000 manually-annotated causal relationships between over 2800 human proteins participating in signal transduction. Other entities annotated in SIGNOR are complexes, chemicals, phenotypes and stimuli. The information captured in SIGNOR can be represented as a signed directed graph illustrating the activation/inactivation relationships between signalling entities. Each entry is associated to the post-translational modifications that cause the activation/inactivation of the target proteins. More than 4900 modified residues causing a change in protein concentration or activity have been curated and linked to the modifying enzymes (about 351 human kinases and 94 phosphatases). Additional modifications such as ubiquitinations, sumoylations, acetylations and their effect on the modified target proteins are also annotated. This wealth of structured information can support experimental approaches based on multi-parametric analysis of cell systems after physiological or pathological perturbations and to assemble large logic models.


Asunto(s)
Bases de Datos de Proteínas , Transducción de Señal , Humanos , Internet , Péptidos y Proteínas de Señalización Intracelular/química , Fosfoproteínas Fosfatasas/química , Fosfoproteínas Fosfatasas/metabolismo , Proteínas Quinasas/química , Proteínas Quinasas/metabolismo
10.
Br J Cancer ; 115(12): 1451-1456, 2016 Dec 06.
Artículo en Inglés | MEDLINE | ID: mdl-27875520

RESUMEN

The biguanide drug metformin profoundly affects cell metabolism, causing an impairment of the cell energy balance and triggering a plethora of pleiotropic effects that vary depending on the cellular or environmental context. Interestingly, a decade ago, it was observed that metformin-treated diabetic patients have a significantly lower cancer risk. Although a variety of in vivo and in vitro observations emphasising the role of metformin as anticancer drug have been reported, the underlying mechanisms are still poorly understood. Here, we discuss our current understanding of the molecular mechanisms that are perturbed by metformin treatment and that might be relevant to understand its antitumour activities. We focus on the cell-autonomous mechanisms modulating growth and death of cancer cells.


Asunto(s)
Antineoplásicos/uso terapéutico , Metformina/uso terapéutico , Neoplasias/tratamiento farmacológico , Antineoplásicos/farmacología , Muerte Celular/efectos de los fármacos , Proliferación Celular/efectos de los fármacos , Progresión de la Enfermedad , Humanos , Metformina/farmacología , Neoplasias/patología
11.
Nucleic Acids Res ; 40(Database issue): D857-61, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22096227

RESUMEN

The Molecular INTeraction Database (MINT, http://mint.bio.uniroma2.it/mint/) is a public repository for protein-protein interactions (PPI) reported in peer-reviewed journals. The database grows steadily over the years and at September 2011 contains approximately 235,000 binary interactions captured from over 4750 publications. The web interface allows the users to search, visualize and download interactions data. MINT is one of the members of the International Molecular Exchange consortium (IMEx) and adopts the Molecular Interaction Ontology of the Proteomics Standard Initiative (PSI-MI) standards for curation and data exchange. MINT data are freely accessible and downloadable at http://mint.bio.uniroma2.it/mint/download.do. We report here the growth of the database, the major changes in curation policy and a new algorithm to assign a confidence to each interaction.


Asunto(s)
Bases de Datos de Proteínas , Mapeo de Interacción de Proteínas , Algoritmos , Animales , Humanos , Ratones , Proteínas/química , Proteínas/genética , Ratas
12.
J Biol Chem ; 287(32): 27066-77, 2012 Aug 03.
Artículo en Inglés | MEDLINE | ID: mdl-22711529

RESUMEN

Activating mutations in PTPN11 cause Noonan syndrome, the most common nonchromosomal disorder affecting development and growth. PTPN11 encodes SHP2, an Src homology 2 (SH2) domain-containing protein-tyrosine phosphatase that positively modulates RAS function. Here, we characterized functionally all possible amino acid substitutions arising from single-base changes affecting codons 62 and 63 to explore the molecular mechanisms lying behind the largely invariant occurrence of the Y62D and Y63C substitutions recurring in Noonan syndrome. We provide structural and biochemical data indicating that the autoinhibitory interaction between the N-SH2 and protein-tyrosine phosphatase (PTP) domains is perturbed in both mutants as a result of an extensive structural rearrangement of the N-SH2 domain. Most mutations affecting Tyr(63) exerted an unpredicted disrupting effect on the structure of the N-SH2 phosphopeptide-binding cleft mediating the interaction of SHP2 with signaling partners. Among all the amino acid changes affecting that codon, the disease-causing mutation was the only substitution that perturbed the stability of the inactive conformation of SHP2 without severely impairing proper phosphopeptide binding of N-SH2. On the other hand, the disruptive effect of the Y62D change on the autoinhibited conformation of the protein was balanced, in part, by less efficient binding properties of the mutant. Overall, our data demonstrate that the selection-by-function mechanism acting as driving force for PTPN11 mutations affecting codons 62 and 63 implies balancing of counteracting effects operating on the allosteric control of the function of SHP2.


Asunto(s)
Síndrome de Noonan/enzimología , Proteína Tirosina Fosfatasa no Receptora Tipo 11/metabolismo , Dominios Homologos src , Secuencia de Aminoácidos , Humanos , Modelos Moleculares , Simulación de Dinámica Molecular , Datos de Secuencia Molecular , Mutación , Fosforilación , Proteína Tirosina Fosfatasa no Receptora Tipo 11/química , Proteína Tirosina Fosfatasa no Receptora Tipo 11/genética
13.
Mol Syst Biol ; 8: 603, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-22893001

RESUMEN

Large-scale siRNA screenings allow linking the function of poorly characterized genes to phenotypic readouts. According to this strategy, genes are associated with a function of interest if the alteration of their expression perturbs the phenotypic readouts. However, given the intricacy of the cell regulatory network, the mapping procedure is low resolution and the resulting models provide little mechanistic insights. We have developed a new strategy that combines multiparametric analysis of cell perturbation with logic modeling to achieve a more detailed functional mapping of human genes onto complex pathways. A literature-derived optimized model is used to infer the cell activation state following upregulation or downregulation of the model entities. By matching this signature with the experimental profile obtained in the high-throughput siRNA screening it is possible to infer the target of each protein, thus defining its 'entry point' in the network. By this novel approach, 41 phosphatases that affect key growth pathways were identified and mapped onto a human epithelial cell-specific growth model, thus providing insights into the mechanisms underlying their function.


Asunto(s)
Ensayos Analíticos de Alto Rendimiento/métodos , Redes y Vías Metabólicas , Monoéster Fosfórico Hidrolasas/genética , Monoéster Fosfórico Hidrolasas/metabolismo , Proteoma/genética , Proteoma/metabolismo , Perfilación de la Expresión Génica/métodos , Genómica/métodos , Células HeLa , Humanos , Microscopía Fluorescente , Modelos Biológicos , Neoplasias/genética , Neoplasias/metabolismo , Proteínas/genética , Proteínas/metabolismo , ARN Interferente Pequeño/genética , ARN Interferente Pequeño/metabolismo , Transducción de Señal
14.
J Biol Chem ; 286(6): 4173-85, 2011 Feb 11.
Artículo en Inglés | MEDLINE | ID: mdl-21123182

RESUMEN

There is growing evidence that tyrosine phosphatases display an intrinsic enzymatic preference for the sequence context flanking the target phosphotyrosines. On the other hand, substrate selection in vivo is decisively guided by the enzyme-substrate connectivity in the protein interaction network. We describe here a system wide strategy to infer physiological substrates of protein-tyrosine phosphatases. Here we integrate, by a Bayesian model, proteome wide evidence about in vitro substrate preference, as determined by a novel high-density peptide chip technology, and "closeness" in the protein interaction network. This allows to rank candidate substrates of the human PTP1B phosphatase. Ultimately a variety of in vitro and in vivo approaches were used to verify the prediction that the tyrosine phosphorylation levels of five high-ranking substrates, PLC-γ1, Gab1, SHP2, EGFR, and SHP1, are indeed specifically modulated by PTP1B. In addition, we demonstrate that the PTP1B-mediated dephosphorylation of Gab1 negatively affects its EGF-induced association with the phosphatase SHP2. The dissociation of this signaling complex is accompanied by a decrease of ERK MAP kinase phosphorylation and activation.


Asunto(s)
Sistema de Señalización de MAP Quinasas/fisiología , Proteína Tirosina Fosfatasa no Receptora Tipo 1/metabolismo , Proteoma/metabolismo , Proteínas Adaptadoras Transductoras de Señales/metabolismo , Línea Celular , Receptores ErbB/metabolismo , Quinasas MAP Reguladas por Señal Extracelular/metabolismo , Humanos , Fosfolipasa C gamma/metabolismo , Fosforilación/fisiología , Análisis por Matrices de Proteínas/métodos , Proteína Tirosina Fosfatasa no Receptora Tipo 11/metabolismo , Proteína Tirosina Fosfatasa no Receptora Tipo 6/metabolismo , Especificidad por Sustrato/fisiología
15.
PLoS Biol ; 7(10): e1000218, 2009 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-19841731

RESUMEN

SH3 domains are peptide recognition modules that mediate the assembly of diverse biological complexes. We scanned billions of phage-displayed peptides to map the binding specificities of the SH3 domain family in the budding yeast, Saccharomyces cerevisiae. Although most of the SH3 domains fall into the canonical classes I and II, each domain utilizes distinct features of its cognate ligands to achieve binding selectivity. Furthermore, we uncovered several SH3 domains with specificity profiles that clearly deviate from the two canonical classes. In conjunction with phage display, we used yeast two-hybrid and peptide array screening to independently identify SH3 domain binding partners. The results from the three complementary techniques were integrated using a Bayesian algorithm to generate a high-confidence yeast SH3 domain interaction map. The interaction map was enriched for proteins involved in endocytosis, revealing a set of SH3-mediated interactions that underlie formation of protein complexes essential to this biological pathway. We used the SH3 domain interaction network to predict the dynamic localization of several previously uncharacterized endocytic proteins, and our analysis suggests a novel role for the SH3 domains of Lsb3p and Lsb4p as hubs that recruit and assemble several endocytic complexes.


Asunto(s)
Endocitosis , Regulación Fúngica de la Expresión Génica , Modelos Moleculares , Proteínas de Saccharomyces cerevisiae/química , Proteínas de Saccharomyces cerevisiae/metabolismo , Saccharomyces cerevisiae/metabolismo , Dominios Homologos src , Algoritmos , Teorema de Bayes , Proteínas Portadoras/química , Proteínas Portadoras/metabolismo , Ligandos , Proteínas de Microfilamentos/química , Proteínas de Microfilamentos/metabolismo , Biblioteca de Péptidos , Unión Proteica , Mapeo de Interacción de Proteínas/métodos , Proteínas de Saccharomyces cerevisiae/genética , Técnicas del Sistema de Dos Híbridos
16.
Nucleic Acids Res ; 38(Database issue): D532-9, 2010 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-19897547

RESUMEN

MINT (http://mint.bio.uniroma2.it/mint) is a public repository for molecular interactions reported in peer-reviewed journals. Since its last report, MINT has grown considerably in size and evolved in scope to meet the requirements of its users. The main changes include a more precise definition of the curation policy and the development of an enhanced and user-friendly interface to facilitate the analysis of the ever-growing interaction dataset. MINT has adopted the PSI-MI standards for the annotation and for the representation of molecular interactions and is a member of the IMEx consortium.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Bases de Datos de Ácidos Nucleicos , Mapeo de Interacción de Proteínas , Animales , Biología Computacional/tendencias , Bases de Datos de Proteínas , Receptores ErbB/metabolismo , Genoma Viral , Humanos , Almacenamiento y Recuperación de la Información/métodos , Internet , Lenguajes de Programación , Unión Proteica , Estructura Terciaria de Proteína , Programas Informáticos
17.
Front Mol Biosci ; 9: 893256, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35664677

RESUMEN

Some inherited or somatically-acquired gene variants are observed significantly more frequently in the genome of cancer cells. Although many of these cannot be confidently classified as driver mutations, they may contribute to shaping a cell environment that favours cancer onset and development. Understanding how these gene variants causally affect cancer phenotypes may help developing strategies for reverting the disease phenotype. Here we focus on variants of genes whose products have the potential to modulate metabolism to support uncontrolled cell growth. Over recent months our team of expert curators has undertaken an effort to annotate in the database SIGNOR 1) metabolic pathways that are deregulated in cancer and 2) interactions connecting oncogenes and tumour suppressors to metabolic enzymes. In addition, we refined a recently developed graph analysis tool that permits users to infer causal paths leading from any human gene to modulation of metabolic pathways. The tool grounds on a human signed and directed network that connects ∼8400 biological entities such as proteins and protein complexes via causal relationships. The network, which is based on more than 30,000 published causal links, can be downloaded from the SIGNOR website. In addition, as SIGNOR stores information on drugs or other chemicals targeting the activity of many of the genes in the network, the identification of likely functional paths offers a rational framework for exploring new therapeutic strategies that revert the disease phenotype.

18.
Cell Death Discov ; 8(1): 16, 2022 Jan 10.
Artículo en Inglés | MEDLINE | ID: mdl-35013135

RESUMEN

Repurposing of drugs for new therapeutic use has received considerable attention for its potential to limit time and cost of drug development. Here we present a new strategy to identify chemicals that are likely to promote a desired phenotype. We used data from the Connectivity Map (CMap) to produce a ranked list of drugs according to their potential to activate transcription factors that mediate myeloid differentiation of leukemic progenitor cells. To validate our strategy, we tested the in vitro differentiation potential of candidate compounds using the HL-60 human cell line as a myeloid differentiation model. Ten out of 22 compounds, which were ranked high in the inferred list, were confirmed to promote significant differentiation of HL-60. These compounds may be considered candidate for differentiation therapy. The method that we have developed is versatile and it can be adapted to different drug repurposing projects.

19.
Proteomics ; 11(1): 128-43, 2011 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-21182200

RESUMEN

Large-scale interaction studies contribute the largest fraction of protein interactions information in databases. However, co-purification of non-specific or indirect ligands, often results in data sets that are affected by a considerable number of false positives. For the fraction of interactions mediated by short linear peptides, we present here a combined experimental and computational strategy for ranking the reliability of the inferred partners. We apply this strategy to the family of 14-3-3 domains. We have first characterized the recognition specificity of this domain family, largely confirming the results of previous analyses, while revealing new features of the preferred sequence context of 14-3-3 phospho-peptide partners. Notably, a proline next to the carboxy side of the phospho-amino acid functions as a potent inhibitor of 14-3-3 binding. The position-specific information about residue preference was encoded in a scoring matrix and two regular expressions. The integration of these three features in a single predictive model outperforms publicly available prediction tools. Next we have combined, by a naïve Bayesian approach, these "peptide features" with "protein features", such as protein co-expression and co-localization. Our approach provides an orthogonal reliability assessment and maps with high confidence the 14-3-3 peptide target on the partner proteins.


Asunto(s)
Proteínas 14-3-3/metabolismo , Biología Computacional/métodos , Péptidos/metabolismo , Mapeo de Interacción de Proteínas/métodos , Saccharomyces cerevisiae/metabolismo , Humanos , Fosfopéptidos/metabolismo , Unión Proteica
20.
BMC Bioinformatics ; 12 Suppl 8: S8, 2011 Oct 03.
Artículo en Inglés | MEDLINE | ID: mdl-22151178

RESUMEN

BACKGROUND: The vast amount of data published in the primary biomedical literature represents a challenge for the automated extraction and codification of individual data elements. Biological databases that rely solely on manual extraction by expert curators are unable to comprehensively annotate the information dispersed across the entire biomedical literature. The development of efficient tools based on natural language processing (NLP) systems is essential for the selection of relevant publications, identification of data attributes and partially automated annotation. One of the tasks of the Biocreative 2010 Challenge III was devoted to the evaluation of NLP systems developed to identify articles for curation and extraction of protein-protein interaction (PPI) data. RESULTS: The Biocreative 2010 competition addressed three tasks: gene normalization, article classification and interaction method identification. The BioGRID and MINT protein interaction databases both participated in the generation of the test publication set for gene normalization, annotated the development and test sets for article classification, and curated the test set for interaction method classification. These test datasets served as a gold standard for the evaluation of data extraction algorithms. CONCLUSION: The development of efficient tools for extraction of PPI data is a necessary step to achieve full curation of the biomedical literature. NLP systems can in the first instance facilitate expert curation by refining the list of candidate publications that contain PPI data; more ambitiously, NLP approaches may be able to directly extract relevant information from full-text articles for rapid inspection by expert curators. Close collaboration between biological databases and NLP systems developers will continue to facilitate the long-term objectives of both disciplines.


Asunto(s)
Minería de Datos , Bases de Datos de Proteínas , Genes , Procesamiento de Lenguaje Natural , Algoritmos , Minería de Datos/normas , Humanos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA