Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 26
Filtrar
1.
Brief Bioinform ; 25(2)2024 Jan 22.
Artículo en Inglés | MEDLINE | ID: mdl-38436560

RESUMEN

RNA is a complex macromolecule that plays central roles in the cell. While it is well known that its structure is directly related to its functions, understanding and predicting RNA structures is challenging. Assessing the real or predictive quality of a structure is also at stake with the complex 3D possible conformations of RNAs. Metrics have been developed to measure model quality while scoring functions aim at assigning quality to guide the discrimination of structures without a known and solved reference. Throughout the years, many metrics and scoring functions have been developed, and no unique assessment is used nowadays. Each developed assessment method has its specificity and might be complementary to understanding structure quality. Therefore, to evaluate RNA 3D structure predictions, it would be important to calculate different metrics and/or scoring functions. For this purpose, we developed RNAdvisor, a comprehensive automated software that integrates and enhances the accessibility of existing metrics and scoring functions. In this paper, we present our RNAdvisor tool, as well as state-of-the-art existing metrics, scoring functions and a set of benchmarks we conducted for evaluating them. Source code is freely available on the EvryRNA platform: https://evryrna.ibisc.univ-evry.fr.


Asunto(s)
Benchmarking , ARN , Modelos Estructurales , ARN/genética , Programas Informáticos
2.
Brief Bioinform ; 24(4)2023 07 20.
Artículo en Inglés | MEDLINE | ID: mdl-37337745

RESUMEN

RNAs can interact with other molecules in their environment, such as ions, proteins or other RNAs, to form complexes with important biological roles. The prediction of the structure of these complexes is therefore an important issue and a difficult task. We are interested in RNA complexes composed of several (more than two) interacting RNAs. We show how available knowledge on the considered RNAs can help predict their secondary structure. We propose an interactive tool for the prediction of RNA complexes, called C-RCPRed, that considers user knowledge and probing data (which can be generated experimentally or artificially). C-RCPred is based on a multi-objective optimization algorithm. Through an extensive benchmarking procedure, which includes state-of-the-art methods, we show the efficiency of the multi-objective approach and the positive impact of considering user knowledge and probing data on the prediction results. C-RCPred is freely available as an open-source program and web server on the EvryRNA website (https://evryrna.ibisc.univ-evry.fr).


Asunto(s)
ARN , Programas Informáticos , ARN/química , Análisis de Secuencia de ARN/métodos , Algoritmos , Estructura Secundaria de Proteína , Conformación de Ácido Nucleico
3.
Nucleic Acids Res ; 51(W1): W281-W288, 2023 07 05.
Artículo en Inglés | MEDLINE | ID: mdl-37158254

RESUMEN

Recent advances have shown that some biologically active non-coding RNAs (ncRNAs) are actually translated into polypeptides that have a physiological function as well. This paradigm shift requires adapted computational methods to predict this new class of 'bifunctional RNAs'. Previously, we developed IRSOM, an open-source algorithm to classify non-coding and coding RNAs. Here, we use the binary statistical model of IRSOM as a ternary classifier, called IRSOM2, to identify bifunctional RNAs as a rejection of the two other classes. We present its easy-to-use web interface, which allows users to perform predictions on large datasets of RNA sequences in a short time, to re-train the model with their own data, and to visualize and analyze the classification results thanks to the implementation of self-organizing maps (SOM). We also propose a new benchmark of experimentally validated RNAs that play both protein-coding and non-coding roles, in different organisms. Thus, IRSOM2 showed promising performance in detecting these bifunctional transcripts among ncRNAs of different types, such as circRNAs and lncRNAs (in particular those of shorter lengths). The web server is freely available on the EvryRNA platform: https://evryrna.ibisc.univ-evry.fr.


Asunto(s)
Algoritmos , Biología Computacional , Simulación por Computador , ARN , ARN Largo no Codificante/química , Análisis de Secuencia de ARN/métodos , Biología Computacional/instrumentación , Biología Computacional/métodos , ARN/química , ARN/clasificación , Internet
4.
Nucleic Acids Res ; 50(W1): W732-W738, 2022 07 05.
Artículo en Inglés | MEDLINE | ID: mdl-35580056

RESUMEN

Understanding the functions and origins of proteins requires splitting these macromolecules into fragments that could be independent in terms of folding, activity, or evolution. For that purpose, structural domains are the typical level of analysis, but shorter segments, such as subdomains and supersecondary structures, are insightful as well. Here, we propose SWORD2, a web server for exploring how an input protein structure may be decomposed into 'Protein Units' that can be hierarchically assembled to delimit structural domains. For each partitioning solution, the relevance of the identified substructures is estimated through different measures. This multilevel analysis is achieved by integrating our previous work on domain delineation, 'protein peeling' and model quality assessment. We hope that SWORD2 will be useful to biologists searching for key regions in their proteins of interest and to bioinformaticians building datasets of protein structures. The web server is freely available online: https://www.dsimb.inserm.fr/SWORD2.


Asunto(s)
Proteínas , Programas Informáticos , Proteínas/química , Computadores , Conformación Proteica , Internet
5.
J Cell Mol Med ; 27(15): 2228-2238, 2023 08.
Artículo en Inglés | MEDLINE | ID: mdl-37415307

RESUMEN

Down syndrome is the most common chromosomal abnormality in humans. Patients with Down syndrome have hematologic disorders, including mild to moderate thrombocytopenia. In case of Down syndrome, thrombocytopenia is not associated with bleeding, and it remains poorly characterized regarding molecular mechanisms. We investigated the effects of overexpression of Dyrk1A, an important factor contributing to some major Down syndrome phenotypes, on platelet number and bleeding in mice. Mice overexpressing Dyrk1A have a decrease in platelet number by 20%. However, bleeding time was found to be reduced by 50%. The thrombocytopenia and the decreased bleeding time observed were not associated to an abnormal platelet receptors expression, to a defect of platelet activation by ADP, thrombin or convulxin, to the presence of activated platelets in the circulation or to an abnormal half-life of the platelets. To propose molecular mechanisms explaining this discrepancy, we performed a network analysis of Dyrk1A interactome and demonstrated that Dyrk1A, fibronectin and fibrinogen interact indirectly through two distinct clusters of proteins. Moreover, in mice overexpressing Dyrk1A, increased plasma fibronectin and fibrinogen levels were found, linked to an increase of the hepatic fibrinogen production. Our results indicate that overexpression of Dyrk1A in mice induces decreased bleeding consistent with increased plasma fibronectin and fibrinogen levels, revealing a new role of Dyrk1A depending on its indirect interaction with these two proteins.


Asunto(s)
Síndrome de Down , Trombocitopenia , Animales , Humanos , Ratones , Plaquetas/metabolismo , Síndrome de Down/metabolismo , Fibrinógeno/metabolismo , Fibronectinas/metabolismo , Hemorragia/metabolismo , Trombocitopenia/metabolismo , Quinasas DyrK
6.
Nucleic Acids Res ; 49(W1): W277-W284, 2021 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-33978743

RESUMEN

The InterEvDock3 protein docking server exploits the constraints of evolution by multiple means to generate structural models of protein assemblies. The server takes as input either several sequences or 3D structures of proteins known to interact. It returns a set of 10 consensus candidate complexes, together with interface predictions to guide further experimental validation interactively. Three key novelties were implemented in InterEvDock3 to help obtain more reliable models: users can (i) generate template-based structural models of assemblies using close and remote homologs of known 3D structure, detected through an automated search protocol, (ii) select the assembly models most consistent with contact maps from external methods that implement covariation-based contact prediction with or without deep learning and (iii) exploit a novel coevolution-based scoring scheme at atomic level, which leads to significantly higher free docking success rates. The performance of the server was validated on two large free docking benchmark databases, containing respectively 230 unbound targets (Weng dataset) and 812 models of unbound targets (PPI4DOCK dataset). Its effectiveness has also been proven on a number of challenging examples. The InterEvDock3 web interface is available at http://bioserv.rpbs.univ-paris-diderot.fr/services/InterEvDock3/.


Asunto(s)
Simulación del Acoplamiento Molecular , Conformación Proteica , Programas Informáticos , Subunidades de Proteína/química , Homología de Secuencia de Aminoácido , Homología Estructural de Proteína
7.
Nucleic Acids Res ; 49(W1): W567-W572, 2021 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-33963857

RESUMEN

Proteo3Dnet is a web server dedicated to the analysis of mass spectrometry interactomics experiments. Given a flat list of proteins, its aim is to organize it in terms of structural interactions to provide a clearer overview of the data. This is achieved using three means: (i) the search for interologs with resolved structure available in the protein data bank, including cross-species remote homology search, (ii) the search for possibly weaker interactions mediated through Short Linear Motifs as predicted by ELM-a unique feature of Proteo3Dnet, (iii) the search for protein-protein interactions physically validated in the BioGRID database. The server then compiles this information and returns a graph of the identified interactions and details about the different searches. The graph can be interactively explored to understand the way the core complexes identified could interact. It can also suggest undetected partners to the experimentalists, or specific cases of conditionally exclusive binding. The interest of Proteo3Dnet, previously demonstrated for the difficult cases of the proteasome and pragmin complexes data is, here, illustrated in the context of yeast precursors to the small ribosomal subunits and the smaller interactome of 14-3-3zeta frequent interactors. The Proteo3Dnet web server is accessible at http://bioserv.rpbs.univ-paris-diderot.fr/services/Proteo3Dnet/.


Asunto(s)
Conformación Proteica , Mapeo de Interacción de Proteínas/métodos , Programas Informáticos , Proteínas 14-3-3/metabolismo , Internet , Espectrometría de Masas , Dominios y Motivos de Interacción de Proteínas , Mapas de Interacción de Proteínas , Proteómica , Subunidades Ribosómicas Pequeñas de Eucariotas/metabolismo
8.
Nucleic Acids Res ; 47(W1): W423-W428, 2019 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-31114872

RESUMEN

Loop regions in protein structures often have crucial roles, and they are much more variable in sequence and structure than other regions. In homology modeling, this leads to larger deviations from the homologous templates, and loop modeling of homology models remains an open problem. To address this issue, we have previously developed the DaReUS-Loop protocol, leading to significant improvement over existing methods. Here, a DaReUS-Loop web server is presented, providing an automated platform for modeling or remodeling loops in the context of homology models. This is the first web server accepting a protein with up to 20 loop regions, and modeling them all in parallel. It also provides a prediction confidence level that corresponds to the expected accuracy of the loops. DaReUS-Loop facilitates the analysis of the results through its interactive graphical interface and is freely available at http://bioserv.rpbs.univ-paris-diderot.fr/services/DaReUS-Loop/.


Asunto(s)
Modelos Moleculares , Programas Informáticos , Homología Estructural de Proteína , Internet
9.
J Proteome Res ; 19(7): 2807-2820, 2020 07 02.
Artículo en Inglés | MEDLINE | ID: mdl-32338910

RESUMEN

Protein-protein interactions play a major role in the molecular machinery of life, and various techniques such as AP-MS are dedicated to their identification. However, those techniques return lists of proteins devoid of organizational structure, not detailing which proteins interact with which others. Proposing a hierarchical view of the interactions between the members of the flat list becomes highly tedious for large data sets when done by hand. To help hierarchize this data, we introduce a new bioinformatics protocol that integrates information of the multimeric protein 3D structures available in the Protein Data Bank using remote homology detection, as well as information related to Short Linear Motifs and interaction data from the BioGRID. We illustrate on two unrelated use-cases of different complexity how our approach can be useful to decipher the network of interactions hidden in the list of input proteins, and how it provides added value compared to state-of-the-art resources such as Interactome3D or STRING. Particularly, we show the added value of using homology detection to distinguish between orthologs and paralogs, and to distinguish between core obligate and more facultative interactions. We also demonstrate the potential of considering interactions occurring through Short Linear Motifs.


Asunto(s)
Mapas de Interacción de Proteínas , Proteómica , Biología Computacional , Bases de Datos de Proteínas , Mapeo de Interacción de Proteínas , Proteínas/genética , Proteínas/metabolismo
10.
Nucleic Acids Res ; 46(D1): D454-D458, 2018 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-29136213

RESUMEN

Knottins, or inhibitor cystine knots (ICKs), are ultra-stable miniproteins with multiple applications in drug design and medical imaging. These widespread and functionally diverse proteins are characterized by the presence of three interwoven disulfide bridges in their structure, which form a unique pseudoknot. Since 2004, the KNOTTIN database (www.dsimb.inserm.fr/KNOTTIN/) has been gathering standardized information about knottin sequences, structures, functions and evolution. The website also provides access to bibliographic data and to computational tools that have been specifically developed for ICKs. Here, we present a major upgrade of our database, both in terms of data content and user interface. In addition to the new features, this article describes how KNOTTIN has seen its size multiplied over the past ten years (since its last publication), notably with the recent inclusion of predicted ICKs structures. Finally, we report how our web resource has proved usefulness for the researchers working on ICKs, and how the new version of the KNOTTIN website will continue to serve this active community.


Asunto(s)
Miniproteínas Nodales de Cistina/química , Bases de Datos de Proteínas/historia , Modelos Moleculares , Interfaz Usuario-Computador , Secuencia de Aminoácidos , Gráficos por Computador , Miniproteínas Nodales de Cistina/clasificación , Miniproteínas Nodales de Cistina/genética , Miniproteínas Nodales de Cistina/metabolismo , Disulfuros , Expresión Génica , Historia del Siglo XXI , Humanos , Internet , Ligandos , Conformación Proteica en Hélice alfa , Conformación Proteica en Lámina beta , Alineación de Secuencia , Análisis de Secuencia de Proteína
11.
Bioinformatics ; 32(16): 2548-50, 2016 08 15.
Artículo en Inglés | MEDLINE | ID: mdl-27153644

RESUMEN

UNLABELLED: : The experimental determination of membrane protein orientation within the lipid bilayer is extremely challenging, such that computational methods are most often the only solution. Moreover, obtaining all-atom 3D structures of membrane proteins is also technically difficult, and many of the available data are either experimental low-resolution structures or theoretical models, whose structural quality needs to be evaluated. Here, to address these two crucial problems, we propose OREMPRO, a web server capable of both (i) positioning α-helical and ß-sheet transmembrane domains in the lipid bilayer and (ii) assessing their structural quality. Most importantly, OREMPRO uses the sole alpha carbon coordinates, which makes it the only web server compatible with both high and low structural resolutions. Finally, OREMPRO is also interesting in its ability to process coarse-grained protein models, by using coordinates of backbone beads in place of alpha carbons. AVAILABILITY AND IMPLEMENTATION: http://www.dsimb.inserm.fr/OREMPRO/ CONTACT: : guillaume.postic@univ-paris-diderot.fr or jean-christophe.gelly@univ-paris-diderot.fr SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Computadores , Proteínas de la Membrana , Programas Informáticos , Internet , Modelos Moleculares , Proteínas
12.
Bioinformatics ; 31(23): 3782-9, 2015 Dec 01.
Artículo en Inglés | MEDLINE | ID: mdl-26254434

RESUMEN

MOTIVATION: Template-based modeling, the most successful approach for predicting protein 3D structure, often requires detecting distant evolutionary relationships between the target sequence and proteins of known structure. Developed for this purpose, fold recognition methods use elaborate strategies to exploit evolutionary information, mainly by encoding amino acid sequence into profiles. Since protein structure is more conserved than sequence, the inclusion of structural information can improve the detection of remote homology. RESULTS: Here, we present ORION, a new fold recognition method based on the pairwise comparison of hybrid profiles that contain evolutionary information from both protein sequence and structure. Our method uses the 16-state structural alphabet Protein Blocks, which provides an accurate 1D description of protein structure local conformations. ORION systematically outperforms PSI-BLAST and HHsearch on several benchmarks, including target sequences from the modeling competitions CASP8, 9 and 10, and detects ∼10% more templates at fold and superfamily SCOP levels. AVAILABILITY: Software freely available for download at http://www.dsimb.inserm.fr/orion/. CONTACT: jean-christophe.gelly@univ-paris-diderot.fr. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Algoritmos , Pliegue de Proteína , Evolución Molecular , Modelos Moleculares , Conformación Proteica , Proteínas/química , Proteínas/genética , Análisis de Secuencia de Proteína/métodos , Programas Informáticos
13.
RNA ; 18(1): 155-65, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22114320

RESUMEN

Processive RNases are unable to degrade efficiently very short oligonucleotides, and they are complemented by specific enzymes, nanoRNases, that assist in this process. We previously identified NrnA (YtqI) from Bacillus subtilis as a bifunctional protein with the ability to degrade nanoRNA (RNA oligos ≤5 nucleotides) and to dephosphorylate 3'-phosphoadenosine 5'-phosphate (pAp) to AMP. While the former activity is analogous to that of oligoribonuclease (Orn) from Escherichia coli, the latter corresponds to CysQ. NrnA homologs are widely present in bacterial and archaeal genomes. They are found preferably in genomes that lack Orn or CysQ homologs. Here, we characterize NrnA homologs from important human pathogens, Mpn140 from Mycoplasma pneumoniae, and Rv2837c from Mycobacterium tuberculosis. Like NrnA, these enzymes degrade nanoRNA and dephosphorylate pAp in vitro. However, they show dissimilar preferences for specific nanoRNA substrate lengths. Whereas NrnA prefers RNA 3-mers with a 10-fold higher specific activity compared to 5-mers, Rv2837c shows a preference for nanoRNA of a different length, namely, 2-mers. Mpn140 degrades Cy5-labeled nanoRNA substrates in vitro with activities varying within one order of magnitude as follows: 5-mer>4-mer>3-mer>2-mer. In agreement with these in vitro activities, both Rv2837c and Mpn140 can complement the lack of their functional counterparts in E. coli: CysQ and Orn. The NrnA homolog from Streptococcus mutans, SMU.1297, was previously shown to hydrolyze pAp and to complement an E. coli cysQ mutant. Here, we show that SMU.1297 can complement an E. coli orn(-) mutant, suggesting that having both pAp-phosphatase and nanoRNase activity is a common feature of NrnA homologs.


Asunto(s)
Exorribonucleasas/química , Mycobacterium tuberculosis/enzimología , Mycoplasma pneumoniae/enzimología , Escherichia coli/genética , Exorribonucleasas/genética , Prueba de Complementación Genética , MicroARNs/química , Mutación , Monoéster Fosfórico Hidrolasas/genética , Proteínas Recombinantes/química , Proteínas Recombinantes/genética , Streptococcus mutans/enzimología , Streptococcus mutans/genética
14.
NAR Genom Bioinform ; 6(2): lqae048, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38745991

RESUMEN

RNAs are essential molecules involved in numerous biological functions. Understanding RNA functions requires the knowledge of their 3D structures. Computational methods have been developed for over two decades to predict the 3D conformations from RNA sequences. These computational methods have been widely used and are usually categorised as either ab initio or template-based. The performances remain to be improved. Recently, the rise of deep learning has changed the sight of novel approaches. Deep learning methods are promising, but their adaptation to RNA 3D structure prediction remains difficult. In this paper, we give a brief review of the ab initio, template-based and novel deep learning approaches. We highlight the different available tools and provide a benchmark on nine methods using the RNA-Puzzles dataset. We provide an online dashboard that shows the predictions made by benchmarked methods, freely available on the EvryRNA platform: https://evryrna.ibisc.univ-evry.fr/evryrna/state_of_the_rnart/.

15.
Infect Immun ; 79(4): 1428-39, 2011 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-21245269

RESUMEN

Francisella tularensis is a highly infectious bacterium causing the zoonotic disease tularemia. This facultative intracellular bacterium replicates in vivo mainly inside macrophages and therefore has developed strategies to resist this stressful environment. Here, we identified a novel genetic locus that is important for stress resistance and intracellular survival of F. tularensis. In silico and transcriptional analyses suggest that this locus (genes FTL_0200 to FTL_0209 in the live vaccine strain [LVS]) constitutes an operon controlled by the alternative sigma factor σ³². The first gene, FTL_0200, encodes a putative AAA+ ATPase of the MoxR subfamily. Insertion mutagenesis into genes FTL_0200, FTL_0205, and FTL_0206 revealed a role for the locus in both intracellular multiplication and in vivo survival of F. tularensis. Deletion of gene FTL_0200 led to a mutant bacterium with increased vulnerability to various stress conditions, including oxidative and pH stresses. Proteomic analyses revealed a pleiotropic impact of the ΔFTL_0200 deletion, supporting a role as a chaperone for FTL_0200. This is the first report of a role for a MoxR family member in bacterial pathogenesis. This class of proteins is remarkably conserved among pathogenic species and may thus constitute a novel player in bacterial virulence.


Asunto(s)
Francisella tularensis/genética , Francisella tularensis/patogenicidad , Genes Bacterianos/genética , Chaperonas Moleculares/genética , Estrés Fisiológico/genética , Secuencia de Aminoácidos , Proteínas Bacterianas/genética , Proteínas Bacterianas/metabolismo , Southern Blotting , Humanos , Macrófagos/metabolismo , Macrófagos/microbiología , Chaperonas Moleculares/metabolismo , Datos de Secuencia Molecular , Mutagénesis Sitio-Dirigida , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , Alineación de Secuencia , Espectrometría de Masa por Láser de Matriz Asistida de Ionización Desorción , Tularemia/genética , Tularemia/metabolismo , Virulencia/genética
16.
Comput Struct Biotechnol J ; 19: 2618-2625, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34025948

RESUMEN

The recent breakthrough in the field of protein structure prediction shows the relevance of using knowledge-based based scoring functions in combination with a low-resolution 3D representation of protein macromolecules. The choice of not using all atoms is barely supported by any data in the literature, and is mostly motivated by empirical and practical reasons, such as the computational cost of assessing the numerous folds of the protein conformational space. Here, we present a comprehensive study, carried on a large and balanced benchmark of predicted protein structures, to see how different types of structural representations rank in either accuracy or calculation speed, and which ones offer the best compromise between these two criteria. We tested ten representations, including low-resolution, high-resolution, and coarse-grained approaches. We also investigated the generalization of the findings to other formalisms than the widely-used "potential of mean force" (PMF) method. Thus, we observed that representing protein structures by their ß carbons-combined or not with Cα-provides the best speed-accuracy trade-off, when using a "total information gain" scoring function. For statistical PMFs, using MARTINI backbone and side-chains beads is the best option. Finally, we also demonstrated the necessity of training the reference state on all atom types, and of including the Cα atoms of glycine residues, in a Cß-based representation.

17.
BMC Genomics ; 11: 625, 2010 Nov 10.
Artículo en Inglés | MEDLINE | ID: mdl-21067590

RESUMEN

BACKGROUND: Regulation of bacterial gene expression by small RNAs (sRNAs) have proved to be important for many biological processes. Francisella tularensis is a highly pathogenic Gram-negative bacterium that causes the disease tularaemia in humans and animals. Relatively little is known about the regulatory networks existing in this organism that allows it to survive in a wide array of environments and no sRNA regulators have been identified so far. RESULTS: We have used a combination of experimental assays and in silico prediction to identify sRNAs in F. tularensis strain LVS. Using a cDNA cloning and sequencing approach we have shown that F. tularensis expresses homologues of several sRNAs that are well-conserved among diverse bacteria. We have also discovered two abundant putative sRNAs that share no sequence similarity or conserved genomic context with any previously annotated regulatory transcripts. Deletion of either of these two loci led to significant changes in the expression of several mRNAs that likely include the cognate target(s) of these sRNAs. Deletion of these sRNAs did not, however, significantly alter F. tularensis growth under various stress conditions in vitro, its replication in murine cells, or its ability to induce disease in a mouse model of F. tularensis infection. We also conducted a genome-wide in silico search for intergenic loci that suggests F. tularensis encodes several other sRNAs in addition to the sRNAs found in our experimental screen. CONCLUSION: Our findings suggest that F. tularensis encodes a significant number of non-coding regulatory RNAs, including members of well conserved families of structural and housekeeping RNAs and other poorly conserved transcripts that may have evolved more recently to help F. tularensis deal with the unique and diverse set of environments with which it must contend.


Asunto(s)
Francisella tularensis/genética , ARN Bacteriano/análisis , ARN Bacteriano/genética , Animales , Vacunas Bacterianas/inmunología , Secuencia de Bases , Northern Blotting , Clonación Molecular , Biología Computacional , ADN Complementario/genética , Francisella tularensis/inmunología , Perfilación de la Expresión Génica , Regulación Bacteriana de la Expresión Génica , Espacio Intracelular/microbiología , Macrófagos/microbiología , Ratones , Datos de Secuencia Molecular , Mutación/genética , Conformación de Ácido Nucleico , Análisis de Secuencia por Matrices de Oligonucleótidos , Transporte de ARN/genética , ARN Bacteriano/química , Reproducibilidad de los Resultados , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , Vacunas Atenuadas/inmunología
18.
Comput Struct Biotechnol J ; 18: 2228-2236, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-32837711

RESUMEN

For three decades now, knowledge-based scoring functions that operate through the "potential of mean force" (PMF) approach have continuously proven useful for studying protein structures. Although these statistical potentials are not to be confused with their physics-based counterparts of the same name-i.e. PMFs obtained by molecular dynamics simulations-their particular success in assessing the native-like character of protein structure predictions has lead authors to consider the computed scores as approximations of the free energy. However, this physical justification is a matter of controversy since the beginning. Alternative interpretations based on Bayes' theorem have been proposed, but the misleading formalism that invokes the inverse Boltzmann law remains recurrent in the literature. In this article, we present a conceptually new method for ranking protein structure models by quality, which is (i) independent of any physics-based explanation and (ii) relevant to statistics and to a general definition of information gain. The theoretical development described in this study provides new insights into how statistical PMFs work, in comparison with our approach. To prove the concept, we have built interatomic distance-dependent scoring functions, based on the former and new equations, and compared their performance on an independent benchmark of 60,000 protein structures. The results demonstrate that our new formalism outperforms statistical PMFs in evaluating the quality of protein structural decoys. Therefore, this original type of score offers a possibility to improve the success of statistical PMFs in the various fields of structural biology where they are applied. The open-source code is available for download at https://gitlab.rpbs.univ-paris-diderot.fr/src/ig-score.

19.
Biochimie ; 151: 37-41, 2018 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-29857183

RESUMEN

Evaluating the model quality of protein structures that evolve in environments with particular physicochemical properties requires scoring functions that are adapted to their specific residue compositions and/or structural characteristics. Thus, computational methods developed for structures from the cytosol cannot work properly on membrane or secreted proteins. Here, we present MyPMFs, an easy-to-use tool that allows users to train statistical potentials of mean force (PMFs) on the protein structures of their choice, with all parameters being adjustable. We demonstrate its use by creating an accurate statistical potential for transmembrane protein domains. We also show its usefulness to study the influence of the physical environment on residue interactions within protein structures. Our open-source software is freely available for download at https://github.com/bibip-impmc/mypmfs.


Asunto(s)
Biología Computacional/métodos , Modelos Moleculares , Conformación Proteica , Proteínas/química , Aminoácidos/química
20.
Database (Oxford) ; 2017(1)2017 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-28365741

RESUMEN

Knowing the position of protein structures within the membrane is crucial for fundamental and applied research in the field of molecular biology. Only few web resources propose coordinate files of oriented transmembrane proteins, and these exclude predicted structures, although they represent the largest part of the available models. In this article, we present TMPL (http://www.dsimb.inserm.fr/TMPL/), a database of transmembrane protein structures (α-helical and ß-sheet) positioned in the lipid bilayer. It is the first database to include theoretical models of transmembrane protein structures, making it a large repository with more than 11 000 entries. The TMPL database also contains experimentally solved protein structures, which are available as either atomistic or coarse-grained models. A unique feature of TMPL is the possibility for users to update the database by uploading, through an intuitive web interface, the membrane assignments they can obtain with our recent OREMPRO web server.


Asunto(s)
Bases de Datos de Proteínas , Membrana Dobles de Lípidos , Proteínas de la Membrana/química , Modelos Moleculares , Proteínas de la Membrana/genética , Dominios Proteicos , Estructura Secundaria de Proteína
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA