Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 25
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
1.
Nucleic Acids Res ; 2024 Apr 08.
Artigo em Inglês | MEDLINE | ID: mdl-38587206

RESUMO

Alignment of 3D molecular structures involves overlaying their sets of atoms in space in such a way as to minimize the distance between the corresponding atoms. The purpose of this procedure is usually to analyze and assess structural similarity on a global (e.g. evaluating predicted 3D models and clustering structures) or a local level (e.g. searching for common substructures). Although the idea of alignment is simple, combinatorial algorithms that implement it require considerable computational resources, even when processing relatively small structures. In this paper, we introduce RNAhugs, a web server for custom and flexible alignment of 3D RNA structures. Using two efficient heuristics, GEOS and GENS, it finds the longest corresponding fragments within 3D structures that may differ in sizes-given in the PDB or PDBx/mmCIF formats-that manage to align with user-specified accuracy (i.e. with an RMSD not exceeding a cutoff value given as an input parameter). A distinctive advantage of the system lies in its ability to process multi-model files and compare the results of 1-25 alignments in a single task. RNAhugs has an intuitive interface and is publicly available at https://rnahugs.cs.put.poznan.pl/.

2.
Proteins ; 91(12): 1790-1799, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37615316

RESUMO

As CASP15 participants, in the new category of 3D RNA structure prediction, we applied expert modeling with the support of our proprietary system RNAComposer. Although RNAComposer is primarily known as an automated web server, its features allow it to be used interactively, for example, for homology-based modeling or assembling models from user-provided structural elements. In the paper, we present various scenarios of applying the system to predict the 3D RNA structures that we employed. Their combination with expert input, comparative analysis of models, and routines to select representative resultant structures form a ready-for-reuse workflow. With selected examples, we demonstrate its application for the in silico modeling of natural and synthetic RNA molecules targeted in CASP15.


Assuntos
RNA , Software , Humanos , RNA/química , Conformação de Ácido Nucleico , Modelos Moleculares , Simulação por Computador
3.
Proteins ; 91(12): 1550-1557, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37306011

RESUMO

Prediction categories in the Critical Assessment of Structure Prediction (CASP) experiments change with the need to address specific problems in structure modeling. In CASP15, four new prediction categories were introduced: RNA structure, ligand-protein complexes, accuracy of oligomeric structures and their interfaces, and ensembles of alternative conformations. This paper lists technical specifications for these categories and describes their integration in the CASP data management system.


Assuntos
Biologia Computacional , Proteínas , Conformação Proteica , Proteínas/química , Modelos Moleculares , Ligantes
4.
Bioinformatics ; 39(5)2023 05 04.
Artigo em Inglês | MEDLINE | ID: mdl-37166444

RESUMO

MOTIVATION: Tertiary structure alignment is one of the main challenges in the computer-aided comparative study of molecular structures. Its aim is to optimally overlay the 3D shapes of two or more molecules in space to find the correspondence between their nucleotides. Alignment is the starting point for most algorithms that assess structural similarity or find common substructures. Thus, it has applications in solving a variety of bioinformatics problems, e.g. in the search for structural patterns, structure clustering, identifying structural redundancy, and evaluating the prediction accuracy of 3D models. To date, several tools have been developed to align 3D structures of RNA. However, most of them are not applicable to arbitrarily large structures and do not allow users to parameterize the optimization algorithm. RESULTS: We present two customizable heuristics for flexible alignment of 3D RNA structures, geometric search (GEOS), and genetic algorithm (GENS). They work in sequence-dependent/independent mode and find the suboptimal alignment of expected quality (below a predefined RMSD threshold). We compare their performance with those of state-of-the-art methods for aligning RNA structures. We show the results of quantitative and qualitative tests run for all of these algorithms on benchmark sets of RNA structures. AVAILABILITY AND IMPLEMENTATION: Source codes for both heuristics are hosted at https://github.com/RNApolis/rnahugs.


Assuntos
RNA , Software , RNA/química , Heurística , Algoritmos , Conformação de Ácido Nucleico
5.
Brief Bioinform ; 24(3)2023 05 19.
Artigo em Inglês | MEDLINE | ID: mdl-37096592

RESUMO

Since the 1980s, dozens of computational methods have addressed the problem of predicting RNA secondary structure. Among them are those that follow standard optimization approaches and, more recently, machine learning (ML) algorithms. The former were repeatedly benchmarked on various datasets. The latter, on the other hand, have not yet undergone extensive analysis that could suggest to the user which algorithm best fits the problem to be solved. In this review, we compare 15 methods that predict the secondary structure of RNA, of which 6 are based on deep learning (DL), 3 on shallow learning (SL) and 6 control methods on non-ML approaches. We discuss the ML strategies implemented and perform three experiments in which we evaluate the prediction of (I) representatives of the RNA equivalence classes, (II) selected Rfam sequences and (III) RNAs from new Rfam families. We show that DL-based algorithms (such as SPOT-RNA and UFold) can outperform SL and traditional methods if the data distribution is similar in the training and testing set. However, when predicting 2D structures for new RNA families, the advantage of DL is no longer clear, and its performance is inferior or equal to that of SL and non-ML methods.


Assuntos
Aprendizado de Máquina , RNA , Humanos , RNA/genética , RNA/química , Algoritmos , Benchmarking
6.
Int J Mol Sci ; 23(17)2022 Aug 25.
Artigo em Inglês | MEDLINE | ID: mdl-36077037

RESUMO

RNA is a unique biomolecule that is involved in a variety of fundamental biological functions, all of which depend solely on its structure and dynamics. Since the experimental determination of crystal RNA structures is laborious, computational 3D structure prediction methods are experiencing an ongoing and thriving development. Such methods can lead to many models; thus, it is necessary to build comparisons and extract common structural motifs for further medical or biological studies. Here, we introduce a computational pipeline dedicated to reference-free high-throughput comparative analysis of 3D RNA structures. We show its application in the RNA-Puzzles challenge, in which five participating groups attempted to predict the three-dimensional structures of 5'- and 3'-untranslated regions (UTRs) of the SARS-CoV-2 genome. We report the results of this puzzle and discuss the structural motifs obtained from the analysis. All simulated models and tools incorporated into the pipeline are open to scientific and academic use.


Assuntos
COVID-19 , RNA , Regiões 3' não Traduzidas , Humanos , Conformação de Ácido Nucleico , RNA/química , SARS-CoV-2
7.
Bioinformatics ; 38(17): 4200-4205, 2022 09 02.
Artigo em Inglês | MEDLINE | ID: mdl-35809063

RESUMO

MOTIVATION: Knowledge of the 3D structure of RNA supports discovering its functions and is crucial for designing drugs and modern therapeutic solutions. Thus, much attention is devoted to experimental determination and computational prediction targeting the global fold of RNA and its local substructures. The latter include multi-branched loops-functionally significant elements that highly affect the spatial shape of the entire molecule. Unfortunately, their computational modeling constitutes a weak point of structural bioinformatics. A remedy for this is in collecting these motifs and analyzing their features. RESULTS: RNAloops is a self-updating database that stores multi-branched loops identified in the PDB-deposited RNA structures. A description of each loop includes angular data-planar and Euler angles computed between pairs of adjacent helices to allow studying their mutual arrangement in space. The system enables search and analysis of multiloops, presents their structure details numerically and visually, and computes data statistics. AVAILABILITY AND IMPLEMENTATION: RNAloops is freely accessible at https://rnaloops.cs.put.poznan.pl. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
RNA , Software , RNA/química , Conformação de Ácido Nucleico , Análise de Sequência de RNA , Bases de Dados Factuais
8.
Bioinformatics ; 38(14): 3668-3670, 2022 07 11.
Artigo em Inglês | MEDLINE | ID: mdl-35674373

RESUMO

MOTIVATION: The development of algorithms dedicated to RNA three-dimensional (3D) structures contributes to the demand for training, testing and benchmarking data. A reliable source of such data derived from computational prediction is the RNA-Puzzles repository. In contrast, the largest resource with experimentally determined structures is the Protein Data Bank. However, files in this archive often contain other molecular data in addition to the RNA structure itself, which-to be used by RNA processing algorithms-should be removed. RESULTS: RNAsolo is a self-updating database dedicated to RNA bioinformatics. It systematically collects experimentally determined RNA 3D structures stored in the PDB, cleans them from non-RNA chains, and groups them into equivalence classes. It allows users to download various subsets of data-clustered by resolution, source, data format, etc.-for further processing and analysis with a single click. AVAILABILITY AND IMPLEMENTATION: The repository is publicly available at https://rnasolo.cs.put.poznan.pl.


Assuntos
RNA , Software , RNA/química , Conformação de Ácido Nucleico , Biologia Computacional , Bases de Dados de Proteínas
9.
Nucleic Acids Res ; 50(W1): W663-W669, 2022 07 05.
Artigo em Inglês | MEDLINE | ID: mdl-35349710

RESUMO

Advances in experimental and computational techniques enable the exploration of large and complex RNA 3D structures. These, in turn, reveal previously unstudied properties and motifs not characteristic for small molecules with simple architectures. Examples include entanglements of structural elements in RNA molecules and knot-like folds discovered, among others, in the genomes of RNA viruses. Recently, we presented the first classification of entanglements, determined by their topology and the type of entangled structural elements. Here, we introduce RNAspider - a web server to automatically identify, classify, and visualize primary and higher-order entanglements in RNA tertiary structures. The program applies to evaluate RNA 3D models obtained experimentally or by computational prediction. It supports the analysis of uncommon topologies in the pseudoknotted RNA structures. RNAspider is implemented as a publicly available tool with a user-friendly interface and can be freely accessed at https://rnaspider.cs.put.poznan.pl/.


Assuntos
RNA , Software , RNA/química , Conformação de Ácido Nucleico , Análise de Sequência de RNA
10.
RNA ; 28(2): 250-262, 2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-34819324

RESUMO

In silico prediction is a well-established approach to derive a general shape of an RNA molecule based on its sequence or secondary structure. This paper reports an analysis of the stereochemical quality of the RNA three-dimensional models predicted using dedicated computer programs. The stereochemistry of 1052 RNA 3D structures, including 1030 models predicted by fully automated and human-guided approaches within 22 RNA-Puzzles challenges and reference structures, is analyzed. The evaluation is based on standards of RNA stereochemistry that the Protein Data Bank requires from deposited experimental structures. Deviations from standard bond lengths and angles, planarity, or chirality are quantified. A reduction in the number of such deviations should help in the improvement of RNA 3D structure modeling approaches.


Assuntos
Simulação de Dinâmica Molecular/normas , RNA/química , Animais , Humanos , Conformação de Ácido Nucleico
11.
Nucleic Acids Res ; 49(17): 9625-9632, 2021 09 27.
Artigo em Inglês | MEDLINE | ID: mdl-34432024

RESUMO

Computational methods to predict RNA 3D structure have more and more practical applications in molecular biology and medicine. Therefore, it is crucial to intensify efforts to improve the accuracy and quality of predicted three-dimensional structures. A significant role in this is played by the RNA-Puzzles initiative that collects, evaluates, and shares RNAs built computationally within currently nearly 30 challenges. RNA-Puzzles datasets, subjected to multi-criteria analysis, allow revealing the strengths and weaknesses of computer prediction methods. Here, we study the issue of entangled RNA fragments in the predicted RNA 3D structure models. By entanglement, we mean an arrangement of two structural elements such that one of them passes through the other. We propose the classification of entanglements driven by their topology and components. It distinguishes two general classes, interlaces and lassos, and subclasses characterized by element types-loops, dinucleotide steps, open single-stranded fragments-and puncture multiplicity. Our computational pipeline for entanglement detection, applied for 1,017 non-redundant models from RNA-Puzzles, has shown the frequency of different entanglements and allowed identifying 138 structures with intersected assemblies.


Assuntos
Modelos Moleculares , RNA/química , Biologia Computacional , Conformação de Ácido Nucleico
12.
RNA ; 26(8): 982-995, 2020 08.
Artigo em Inglês | MEDLINE | ID: mdl-32371455

RESUMO

RNA-Puzzles is a collective endeavor dedicated to the advancement and improvement of RNA 3D structure prediction. With agreement from crystallographers, the RNA structures are predicted by various groups before the publication of the crystal structures. We now report the prediction of 3D structures for six RNA sequences: four nucleolytic ribozymes and two riboswitches. Systematic protocols for comparing models and crystal structures are described and analyzed. In these six puzzles, we discuss (i) the comparison between the automated web servers and human experts; (ii) the prediction of coaxial stacking; (iii) the prediction of structural details and ligand binding; (iv) the development of novel prediction methods; and (v) the potential improvements to be made. We show that correct prediction of coaxial stacking and tertiary contacts is essential for the prediction of RNA architecture, while ligand binding modes can only be predicted with low resolution and simultaneous prediction of RNA structure with accurate ligand binding still remains out of reach. All the predicted models are available for the future development of force field parameters and the improvement of comparison and assessment tools.


Assuntos
Aptâmeros de Nucleotídeos/química , RNA Catalítico/química , RNA/química , Sequência de Bases , Ligantes , Conformação de Ácido Nucleico , Riboswitch/genética
13.
Nucleic Acids Res ; 48(2): 576-588, 2020 01 24.
Artigo em Inglês | MEDLINE | ID: mdl-31799609

RESUMO

Significant improvements have been made in the efficiency and accuracy of RNA 3D structure prediction methods during the succeeding challenges of RNA-Puzzles, a community-wide effort on the assessment of blind prediction of RNA tertiary structures. The RNA-Puzzles contest has shown, among others, that the development and validation of computational methods for RNA fold prediction strongly depend on the benchmark datasets and the structure comparison algorithms. Yet, there has been no systematic benchmark set or decoy structures available for the 3D structure prediction of RNA, hindering the standardization of comparative tests in the modeling of RNA structure. Furthermore, there has not been a unified set of tools that allows deep and complete RNA structure analysis, and at the same time, that is easy to use. Here, we present RNA-Puzzles toolkit, a computational resource including (i) decoy sets generated by different RNA 3D structure prediction methods (raw, for-evaluation and standardized datasets), (ii) 3D structure normalization, analysis, manipulation, visualization tools (RNA_format, RNA_normalizer, rna-tools) and (iii) 3D structure comparison metric tools (RNAQUA, MCQ4Structures). This resource provides a full list of computational tools as well as a standard RNA 3D structure prediction assessment protocol for the community.


Assuntos
Biologia Computacional , Conformação de Ácido Nucleico , RNA/química , Software , Algoritmos , Benchmarking , RNA/genética
14.
Bioinformatics ; 35(1): 152-155, 2019 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-29985979

RESUMO

Motivation: In the study of 3D RNA structure, information about non-canonical interactions between nucleobases is increasingly important. Specialized databases support investigation of this issue based on experimental data, and several programs can annotate non-canonical base pairs in the RNA 3D structure. However, predicting the extended RNA secondary structure which describes both canonical and non-canonical interactions remains difficult. Results: Here, we present RNAvista that allows predicting an extended RNA secondary structure from sequence or from the list enumerating canonical base pairs only. RNAvista is implemented as a publicly available webserver with user-friendly interface. It runs on all major web browsers. Availability and implementation: http://rnavista.cs.put.poznan.pl.


Assuntos
Pareamento de Bases , Conformação de Ácido Nucleico , RNA/química , Software , Biologia Computacional
15.
BMC Bioinformatics ; 19(1): 304, 2018 Aug 22.
Artigo em Inglês | MEDLINE | ID: mdl-30134831

RESUMO

BACKGROUND: Computational RNA 3D structure prediction and modeling are rising as complementary approaches to high-resolution experimental techniques for structure determination. They often apply to substitute or complement them. Recently, researchers' interests have directed towards in silico methods to fit, remodel and refine RNA tertiary structure models. Their power lies in a problem-specific exploration of RNA conformational space and efficient optimization procedures. The aim is to improve the accuracy of models obtained either computationally or experimentally. RESULTS: Here, we present RNAfitme, a versatile webserver tool for remodeling of nucleobase- and nucleoside residue conformations in the fixed-backbone RNA 3D structures. Our approach makes use of dedicated libraries that define RNA conformational space. They have been built upon torsional angle characteristics of PDB-deposited RNA structures. RNAfitme can be applied to reconstruct full-atom model of RNA from its backbone; remodel user-selected nucleobase/nucleoside residues in a given RNA structure; predict RNA 3D structure based on the sequence and the template of a homologous molecule of the same size; refine RNA 3D model by reducing steric clashes indicated during structure quality assessment. RNAfitme is a publicly available tool with an intuitive interface. It is freely accessible at http://rnafitme.cs.put.poznan.pl/ CONCLUSIONS: RNAfitme has been applied in various RNA 3D remodeling scenarios for several types of input data. Computational experiments proved its efficiency, accuracy, and usefulness in the processing of RNAs of any size. Fidelity of RNAfitme predictions has been thoroughly tested for RNA 3D structures determined experimentally and modeled in silico.


Assuntos
Internet , Conformação de Ácido Nucleico , Nucleosídeos/genética , RNA/química , RNA/genética , Software , Algoritmos , Sequência de Bases , Glutamina/química , Modelos Moleculares , Motivos de Nucleotídeos , RNA de Transferência/química , RNA de Transferência/genética , Análise de Sequência de RNA
16.
Nucleic Acids Res ; 46(W1): W30-W35, 2018 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-29718468

RESUMO

In the field of RNA structural biology and bioinformatics, an access to correctly annotated RNA structure is of crucial importance, especially in the secondary and 3D structure predictions. RNApdbee webserver, introduced in 2014, primarily aimed to address the problem of RNA secondary structure extraction from the PDB files. Its new version, RNApdbee 2.0, is a highly advanced multifunctional tool for RNA structure annotation, revealing the relationship between RNA secondary and 3D structure given in the PDB or PDBx/mmCIF format. The upgraded version incorporates new algorithms for recognition and classification of high-ordered pseudoknots in large RNA structures. It allows analysis of isolated base pairs impact on RNA structure. It can visualize RNA secondary structures-including that of quadruplexes-with depiction of non-canonical interactions. It also annotates motifs to ease identification of stems, loops and single-stranded fragments in the input RNA structure. RNApdbee 2.0 is implemented as a publicly available webserver with an intuitive interface and can be freely accessed at http://rnapdbee.cs.put.poznan.pl/.


Assuntos
Biologia Computacional , Internet , RNA/genética , Software , Algoritmos , Pareamento de Bases/genética , Bases de Dados de Ácidos Nucleicos , Conformação de Ácido Nucleico , Estrutura Secundária de Proteína/genética , RNA/química
17.
Bioinformatics ; 34(8): 1304-1312, 2018 04 15.
Artigo em Inglês | MEDLINE | ID: mdl-29236971

RESUMO

Motivation: Understanding the formation, architecture and roles of pseudoknots in RNA structures are one of the most difficult challenges in RNA computational biology and structural bioinformatics. Methods predicting pseudoknots typically perform this with poor accuracy, often despite experimental data incorporation. Existing bioinformatic approaches differ in terms of pseudoknots' recognition and revealing their nature. A few ways of pseudoknot classification exist, most common ones refer to a genus or order. Following the latter one, we propose new algorithms that identify pseudoknots in RNA structure provided in BPSEQ format, determine their order and encode in dot-bracket-letter notation. The proposed encoding aims to illustrate the hierarchy of RNA folding. Results: New algorithms are based on dynamic programming and hybrid (combining exhaustive search and random walk) approaches. They evolved from elementary algorithm implemented within the workflow of RNA FRABASE 1.0, our database of RNA structure fragments. They use different scoring functions to rank dissimilar dot-bracket representations of RNA structure. Computational experiments show an advantage of new methods over the others, especially for large RNA structures. Availability and implementation: Presented algorithms have been implemented as new functionality of RNApdbee webserver and are ready to use at http://rnapdbee.cs.put.poznan.pl. Contact: mszachniuk@cs.put.poznan.pl. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Algoritmos , Conformação de Ácido Nucleico , Dobramento de RNA , RNA/metabolismo , Análise de Sequência de RNA/métodos , Biologia Computacional/métodos , Bases de Dados Factuais , Humanos , Modelos Moleculares , RNA/química
18.
RNA ; 23(5): 655-672, 2017 05.
Artigo em Inglês | MEDLINE | ID: mdl-28138060

RESUMO

RNA-Puzzles is a collective experiment in blind 3D RNA structure prediction. We report here a third round of RNA-Puzzles. Five puzzles, 4, 8, 12, 13, 14, all structures of riboswitch aptamers and puzzle 7, a ribozyme structure, are included in this round of the experiment. The riboswitch structures include biological binding sites for small molecules (S-adenosyl methionine, cyclic diadenosine monophosphate, 5-amino 4-imidazole carboxamide riboside 5'-triphosphate, glutamine) and proteins (YbxF), and one set describes large conformational changes between ligand-free and ligand-bound states. The Varkud satellite ribozyme is the most recently solved structure of a known large ribozyme. All puzzles have established biological functions and require structural understanding to appreciate their molecular mechanisms. Through the use of fast-track experimental data, including multidimensional chemical mapping, and accurate prediction of RNA secondary structure, a large portion of the contacts in 3D have been predicted correctly leading to similar topologies for the top ranking predictions. Template-based and homology-derived predictions could predict structures to particularly high accuracies. However, achieving biological insights from de novo prediction of RNA 3D structures still depends on the size and complexity of the RNA. Blind computational predictions of RNA structures already appear to provide useful structural information in many cases. Similar to the previous RNA-Puzzles Round II experiment, the prediction of non-Watson-Crick interactions and the observed high atomic clash scores reveal a notable need for an algorithm of improvement. All prediction models and assessment results are available at http://ahsoka.u-strasbg.fr/rnapuzzles/.


Assuntos
RNA Catalítico/química , Riboswitch , Aminoimidazol Carboxamida/química , Aminoimidazol Carboxamida/metabolismo , Aptâmeros de Nucleotídeos/química , Aptâmeros de Nucleotídeos/metabolismo , Fosfatos de Dinucleosídeos/metabolismo , Endorribonucleases/química , Endorribonucleases/metabolismo , Glutamina/química , Glutamina/metabolismo , Ligantes , Modelos Moleculares , Conformação de Ácido Nucleico , RNA Catalítico/metabolismo , Ribonucleotídeos/química , Ribonucleotídeos/metabolismo , S-Adenosilmetionina/química , S-Adenosilmetionina/metabolismo
19.
Acta Biochim Pol ; 63(4): 737-744, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-27741327

RESUMO

RNAComposer is a fully automated, web-interfaced system for RNA 3D structure prediction, freely available at http://rnacomposer.cs.put.poznan.pl/ and http://rnacomposer.ibch.poznan.pl/. Its main components are: manually curated database of RNA 3D structure elements, highly efficient computational engine and user-friendly web application. In this paper, we demonstrate how the latest additions to the system allow the user to significantly affect the process of 3D model composition on several computational levels. Although in general our method is based on the knowledge of secondary structure topology, currently the RNAComposer offers a choice of six incorporated programs for secondary structure prediction. It also allows to apply a conditional search in the database of 3D structure elements and introduce user-provided elements into the final 3D model. This new functionality contributes to a significant improvement of the predicted 3D model reliability and it facilitates a better model adjustment to the experimental data. This is exemplified based on the RNAComposer application for modelling of the 3D structures of precursors of the miR160 family members.


Assuntos
MicroRNAs/química , Software , Animais , Sequência de Bases , Humanos , Modelos Moleculares , Conformação de Ácido Nucleico , Análise de Sequência de RNA
20.
BMC Bioinformatics ; 17: 383, 2016 Sep 17.
Artigo em Inglês | MEDLINE | ID: mdl-27639380

RESUMO

BACKGROUND: Structural alignment of proteins is one of the most challenging problems in molecular biology. The tertiary structure of a protein strictly correlates with its function and computationally predicted structures are nowadays a main premise for understanding the latter. However, computationally derived 3D models often exhibit deviations from the native structure. A way to confirm a model is a comparison with other structures. The structural alignment of a pair of proteins can be defined with the use of a concept of protein descriptors. The protein descriptors are local substructures of protein molecules, which allow us to divide the original problem into a set of subproblems and, consequently, to propose a more efficient algorithmic solution. In the literature, one can find many applications of the descriptors concept that prove its usefulness for insight into protein 3D structures, but the proposed approaches are presented rather from the biological perspective than from the computational or algorithmic point of view. Efficient algorithms for identification and structural comparison of descriptors can become crucial components of methods for structural quality assessment as well as tertiary structure prediction. RESULTS: In this paper, we propose a new combinatorial model and new polynomial-time algorithms for the structural alignment of descriptors. The model is based on the maximum-size assignment problem, which we define here and prove that it can be solved in polynomial time. We demonstrate suitability of this approach by comparison with an exact backtracking algorithm. Besides a simplification coming from the combinatorial modeling, both on the conceptual and complexity level, we gain with this approach high quality of obtained results, in terms of 3D alignment accuracy and processing efficiency. CONCLUSIONS: All the proposed algorithms were developed and integrated in a computationally efficient tool descs-standalone, which allows the user to identify and structurally compare descriptors of biological molecules, such as proteins and RNAs. Both PDB (Protein Data Bank) and mmCIF (macromolecular Crystallographic Information File) formats are supported. The proposed tool is available as an open source project stored on GitHub ( https://github.com/mantczak/descs-standalone ).


Assuntos
Proteínas/química , Alinhamento de Sequência/métodos , Algoritmos , Sequência de Aminoácidos , Bases de Dados de Proteínas , Modelos Moleculares , Fatores de Tempo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA