RESUMO
ChannelsDB 2.0 is an updated database providing structural information about the position, geometry and physicochemical properties of protein channels-tunnels and pores-within deposited biomacromolecular structures from PDB and AlphaFoldDB databases. The newly deposited information originated from several sources. Firstly, we included data calculated using a popular CAVER tool to complement the data obtained using original MOLE tool for detection and analysis of protein tunnels and pores. Secondly, we added tunnels starting from cofactors within the AlphaFill database to enlarge the scope of the database to protein models based on Uniprot. This has enlarged available channel annotations â¼4.6 times as of 1 September 2023. The database stores information about geometrical features, e.g. length and radius, and physico-chemical properties based on channel-lining amino acids. The stored data are interlinked with the available UniProt mutation annotation data. ChannelsDB 2.0 provides an excellent resource for deep analysis of the role of biomacromolecular tunnels and pores. The database is available free of charge: https://channelsdb2.biodata.ceitec.cz.
Assuntos
Bases de Dados de Proteínas , Proteínas , Software , Aminoácidos , Proteínas/química , Conformação ProteicaRESUMO
Recombinant proteins play pivotal roles in numerous applications including industrial biocatalysts or therapeutics. Despite the recent progress in computational protein structure prediction, protein solubility and reduced aggregation propensity remain challenging attributes to design. Identification of aggregation-prone regions is essential for understanding misfolding diseases or designing efficient protein-based technologies, and as such has a great socio-economic impact. Here, we introduce AggreProt, a user-friendly webserver that automatically exploits an ensemble of deep neural networks to predict aggregation-prone regions (APRs) in protein sequences. Trained on experimentally evaluated hexapeptides, AggreProt compares to or outperforms state-of-the-art algorithms on two independent benchmark datasets. The server provides per-residue aggregation profiles along with information on solvent accessibility and transmembrane propensity within an intuitive interface with interactive sequence and structure viewers for comprehensive analysis. We demonstrate AggreProt efficacy in predicting differential aggregation behaviours in proteins on several use cases, which emphasize its potential for guiding protein engineering strategies towards decreased aggregation propensity and improved solubility. The webserver is freely available and accessible at https://loschmidt.chemi.muni.cz/aggreprot/.
Assuntos
Internet , Agregados Proteicos , Software , Engenharia de Proteínas/métodos , Algoritmos , Proteínas/química , Proteínas/genética , Redes Neurais de Computação , Dobramento de Proteína , Solubilidade , Conformação ProteicaRESUMO
Thermostable proteins find their use in numerous biomedical and biotechnological applications. However, the computational design of stable proteins often results in single-point mutations with a limited effect on protein stability. However, the construction of stable multiple-point mutants can prove difficult due to the possibility of antagonistic effects between individual mutations. FireProt protocol enables the automated computational design of highly stable multiple-point mutants. FireProt 2.0 builds on top of the previously published FireProt web, retaining the original functionality and expanding it with several new stabilization strategies. FireProt 2.0 integrates the AlphaFold database and the homology modeling for structure prediction, enabling calculations starting from a sequence. Multiple-point designs are constructed using the Bron-Kerbosch algorithm minimizing the antagonistic effect between the individual mutations. Users can newly limit the FireProt calculation to a set of user-defined mutations, run a saturation mutagenesis of the whole protein or select rigidifying mutations based on B-factors. Evolution-based back-to-consensus strategy is complemented by ancestral sequence reconstruction. FireProt 2.0 is significantly faster and a reworked graphical user interface broadens the tool's availability even to users with older hardware. FireProt 2.0 is freely available at http://loschmidt.chemi.muni.cz/fireprotweb.
Assuntos
Algoritmos , Proteínas , Proteínas/genética , Proteínas/química , Mutação , Estabilidade Proteica , InternetRESUMO
PredictONCO 1.0 is a unique web server that analyzes effects of mutations on proteins frequently altered in various cancer types. The server can assess the impact of mutations on the protein sequential and structural properties and apply a virtual screening to identify potential inhibitors that could be used as a highly individualized therapeutic approach, possibly based on the drug repurposing. PredictONCO integrates predictive algorithms and state-of-the-art computational tools combined with information from established databases. The user interface was carefully designed for the target specialists in precision oncology, molecular pathology, clinical genetics and clinical sciences. The tool summarizes the effect of the mutation on protein stability and function and currently covers 44 common oncological targets. The binding affinities of Food and Drug Administration/ European Medicines Agency -approved drugs with the wild-type and mutant proteins are calculated to facilitate treatment decisions. The reliability of predictions was confirmed against 108 clinically validated mutations. The server provides a fast and compact output, ideal for the often time-sensitive decision-making process in oncology. Three use cases of missense mutations, (i) K22A in cyclin-dependent kinase 4 identified in melanoma, (ii) E1197K mutation in anaplastic lymphoma kinase 4 identified in lung carcinoma and (iii) V765A mutation in epidermal growth factor receptor in a patient with congenital mismatch repair deficiency highlight how the tool can increase levels of confidence regarding the pathogenicity of the variants and identify the most effective inhibitors. The server is available at https://loschmidt.chemi.muni.cz/predictonco.
Assuntos
Melanoma , Medicina de Precisão , Humanos , Reprodutibilidade dos Testes , Biologia Computacional , Mutação , Proteínas , Aprendizado de MáquinaRESUMO
SUMMARY: Protein design requires information about how mutations affect protein stability. Many web-based predictors are available for this purpose, yet comparing them or using them en masse is difficult. Here, we present BenchStab, a console tool/Python package for easy and quick execution of 19 predictors and result collection on a list of mutants. Moreover, the tool is easily extensible with additional predictors. We created an independent dataset derived from the FireProtDB and evaluated 24 different prediction methods. AVAILABILITY AND IMPLEMENTATION: BenchStab is an open-source Python package available at https://github.com/loschmidt/BenchStab with a detailed README and example usage at https://loschmidt.chemi.muni.cz/benchstab. The BenchStab dataset is available on Zenodo: https://zenodo.org/records/10637728.
Assuntos
Internet , Software , Estabilidade Proteica , Proteínas/química , Biologia Computacional/métodos , Bases de Dados de ProteínasRESUMO
SUMMARY: Access pathways in enzymes are crucial for the passage of substrates and products of catalysed reactions. The process can be studied by computational means with variable degrees of precision. Our in-house approximative method CaverDock provides a fast and easy way to set up and run ligand binding and unbinding calculations through protein tunnels and channels. Here we introduce pyCaverDock, a Python3 API designed to improve user experience with the tool and further facilitate the ligand transport analyses. The API enables users to simplify the steps needed to use CaverDock, from automatizing setup processes to designing screening pipelines. AVAILABILITY AND IMPLEMENTATION: pyCaverDock API is implemented in Python 3 and is freely available with detailed documentation and practical examples at https://loschmidt.chemi.muni.cz/caverdock/.
Assuntos
Proteínas , Software , LigantesRESUMO
The importance of the quantitative description of protein unfolding and aggregation for the rational design of stability or understanding the molecular basis of protein misfolding diseases is well established. Protein thermostability is typically assessed by calorimetric or spectroscopic techniques that monitor different complementary signals during unfolding. The CalFitter webserver has already proved integral to deriving invaluable energy parameters by global data analysis. Here, we introduce CalFitter 2.0, which newly incorporates singular value decomposition (SVD) of multi-wavelength spectral datasets into the global fitting pipeline. Processed time- or temperature-evolved SVD components can now be fitted together with other experimental data types. Moreover, deconvoluted basis spectra provide spectral fingerprints of relevant macrostates populated during unfolding, which greatly enriches the information gains of the CalFitter output. The SVD analysis is fully automated in a highly interactive module, providing access to the results to users without any prior knowledge of the underlying mathematics. Additionally, a novel data uploading wizard has been implemented to facilitate rapid and easy uploading of multiple datasets. Together, the newly introduced changes significantly improve the user experience, making this software a unique, robust, and interactive platform for the analysis of protein thermal denaturation data. The webserver is freely accessible at https://loschmidt.chemi.muni.cz/calfitter.
Assuntos
Desdobramento de Proteína , Proteínas , Proteínas/química , Software , Temperatura , Desnaturação ProteicaRESUMO
The transplantation of loops between structurally related proteins is a compelling method to improve the activity, specificity and stability of enzymes. However, despite the interest of loop regions in protein engineering, the available methods of loop-based rational protein design are scarce. One particular difficulty related to loop engineering is the unique dynamism that enables them to exert allosteric control over the catalytic function of enzymes. Thus, when engaging in a transplantation effort, such dynamics in the context of protein structure need consideration. A second practical challenge is identifying successful excision points for the transplantation or grafting. Here, we present LoopGrafter (https://loschmidt.chemi.muni.cz/loopgrafter/), a web server that specifically guides in the loop grafting process between structurally related proteins. The server provides a step-by-step interactive procedure in which the user can successively identify loops in the two input proteins, calculate their geometries, assess their similarities and dynamics, and select a number of loops to be transplanted. All possible different chimeric proteins derived from any existing recombination point are calculated, and 3D models for each of them are constructed and energetically evaluated. The obtained results can be interactively visualized in a user-friendly graphical interface and downloaded for detailed structural analyses.
Assuntos
Proteínas , Software , Modelos Moleculares , Proteínas/genética , Proteínas/química , Engenharia de Proteínas , InternetRESUMO
The development of microbial products for cancer treatment has been in the spotlight in recent years. In order to accelerate the lengthy and expensive drug development process, in silico screening tools are systematically employed, especially during the initial discovery phase. Moreover, considering the steadily increasing number of molecules approved by authorities for commercial use, there is a demand for faster methods to repurpose such drugs. Here we present a review on virtual screening web tools, such as publicly available databases of molecular targets and libraries of ligands, with the aim to facilitate the discovery of potential anticancer drugs based on microbial products. We provide an entry-level step-by-step description of the workflow for virtual screening of microbial metabolites with known protein targets, as well as two practical examples using freely available web tools. The first case presents a virtual screening study of drugs developed from microbial products using Caver Web, a web tool that performs docking along a tunnel. The second case comprises a comparative analysis between a wild type isocitrate dehydrogenase 1 and a mutant that results in cancer, using the recently developed web tool PredictSNPOnco. In summary, this review provides the basic and essential background information necessary for virtual screening experiments, which may accelerate the discovery of novel anticancer drugs.
Assuntos
Antineoplásicos , Humanos , Ligantes , Antineoplásicos/farmacologia , Antineoplásicos/uso terapêuticoRESUMO
There is a great interest in increasing proteins' stability to widen their usability in numerous biomedical and biotechnological applications. However, native proteins cannot usually withstand the harsh industrial environment, since they are evolved to function under mild conditions. Ancestral sequence reconstruction is a well-established method for deducing the evolutionary history of genes. Besides its applicability to discover the most probable evolutionary ancestors of the modern proteins, ancestral sequence reconstruction has proven to be a useful approach for the design of highly stable proteins. Recently, several computational tools were developed, which make the ancestral reconstruction algorithms accessible to the community, while leaving the most crucial steps of the preparation of the input data on users' side. FireProtASR aims to overcome this obstacle by constructing a fully automated workflow, allowing even the unexperienced users to obtain ancestral sequences based on a sequence query as the only input. FireProtASR is complemented with an interactive, easy-to-use web interface and is freely available at https://loschmidt.chemi.muni.cz/fireprotasr/.
Assuntos
Algoritmos , Bases de Dados de Proteínas , Evolução Molecular , Proteínas/genética , Análise de Sequência de Proteína , Software , Biologia Computacional , Alinhamento de SequênciaRESUMO
The majority of naturally occurring proteins have evolved to function under mild conditions inside the living organisms. One of the critical obstacles for the use of proteins in biotechnological applications is their insufficient stability at elevated temperatures or in the presence of salts. Since experimental screening for stabilizing mutations is typically laborious and expensive, in silico predictors are often used for narrowing down the mutational landscape. The recent advances in machine learning and artificial intelligence further facilitate the development of such computational tools. However, the accuracy of these predictors strongly depends on the quality and amount of data used for training and testing, which have often been reported as the current bottleneck of the approach. To address this problem, we present a novel database of experimental thermostability data for single-point mutants FireProtDB. The database combines the published datasets, data extracted manually from the recent literature, and the data collected in our laboratory. Its user interface is designed to facilitate both types of the expected use: (i) the interactive explorations of individual entries on the level of a protein or mutation and (ii) the construction of highly customized and machine learning-friendly datasets using advanced searching and filtering. The database is freely available at https://loschmidt.chemi.muni.cz/fireprotdb.
Assuntos
Biologia Computacional/métodos , Bases de Dados de Proteínas , Aprendizado de Máquina/estatística & dados numéricos , Mutação Puntual , Proteínas/química , Conjuntos de Dados como Assunto , Internet , Modelos Moleculares , Anotação de Sequência Molecular , Estabilidade Proteica , Proteínas/genética , SoftwareRESUMO
The choroid plexus (ChP) produces and is bathed in the cerebrospinal fluid (CSF), which in aging and Alzheimer's disease (AD) shows extensive proteomic alterations including evidence of inflammation. Considering inflammation hampers functions of the involved tissues, the CSF abnormalities reported in these conditions are suggestive of ChP injury. Indeed, several studies document ChP damage in aging and AD, which nevertheless remains to be systematically characterized. We here report that the changes elicited in the CSF by AD are consistent with a perturbed aging process and accompanied by aberrant accumulation of inflammatory signals and metabolically active proteins in the ChP. Magnetic resonance imaging (MRI) imaging shows that these molecular aberrancies correspond to significant remodeling of ChP in AD, which correlates with aging and cognitive decline. Collectively, our preliminary post-mortem and in vivo findings reveal a repertoire of ChP pathologies indicative of its dysfunction and involvement in the pathogenesis of AD. HIGHLIGHTS: Cerebrospinal fluid changes associated with aging are perturbed in Alzheimer's disease Paradoxically, in Alzheimer's disease, the choroid plexus exhibits increased cytokine levels without evidence of inflammatory activation or infiltrates In Alzheimer's disease, increased choroid plexus volumes correlate with age and cognitive performance.
Assuntos
Doença de Alzheimer , Humanos , Doença de Alzheimer/patologia , Plexo Corióideo/metabolismo , Plexo Corióideo/patologia , Proteômica , Envelhecimento , InflamaçãoRESUMO
Stroke burden is substantially increasing but current therapeutic drugs are still far from ideal. Here we highlight the vast potential of staphylokinase as an efficient, fibrin-selective, inexpensive, and evolvable thrombolytic agent. The emphasis is escalated by new recent findings. Staphylokinase nonimmunogenic variant was proven noninferior to alteplase in a clinical trial, with decreased risk of intracranial hemorrhage and the advantage of single bolus administration. Furthermore, our detailed kinetic analysis revealed a new staphylokinase limiting bottleneck whose elimination might provide up to 1000-fold higher activity than the clinically approved alteplase. This knowledge of limitations unlocks new possibilities for improvements that are now achievable by the community of protein engineers who have the required expertise and are ready to transform staphylokinase into a powerful molecule. Collectively, the noninferiority and safety of nonimmunogenic staphylokinase together with the newly identified effectivity limitation make staphylokinase a perfect candidate for further exploration, modification, and advancement to make it the next-generation widely accessible thrombolytic drug effectively treating stroke all around the world, including middle- and low-income countries.
Assuntos
Fibrinolíticos , Acidente Vascular Cerebral , Fibrina , Fibrinolíticos/uso terapêutico , Humanos , Cinética , Metaloendopeptidases/metabolismo , Metaloendopeptidases/uso terapêutico , Acidente Vascular Cerebral/tratamento farmacológico , Terapia Trombolítica , Ativador de Plasminogênio Tecidual/uso terapêuticoRESUMO
MOTIVATION: Poor protein solubility hinders the production of many therapeutic and industrially useful proteins. Experimental efforts to increase solubility are plagued by low success rates and often reduce biological activity. Computational prediction of protein expressibility and solubility in Escherichia coli using only sequence information could reduce the cost of experimental studies by enabling prioritization of highly soluble proteins. RESULTS: A new tool for sequence-based prediction of soluble protein expression in E.coli, SoluProt, was created using the gradient boosting machine technique with the TargetTrack database as a training set. When evaluated against a balanced independent test set derived from the NESG database, SoluProt's accuracy of 58.5% and AUC of 0.62 exceeded those of a suite of alternative solubility prediction tools. There is also evidence that it could significantly increase the success rate of experimental protein studies. SoluProt is freely available as a standalone program and a user-friendly webserver at https://loschmidt.chemi.muni.cz/soluprot/. AVAILABILITY AND IMPLEMENTATION: https://loschmidt.chemi.muni.cz/soluprot/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
RESUMO
Millions of protein sequences are being discovered at an incredible pace, representing an inexhaustible source of biocatalysts. Despite genomic databases growing exponentially, classical biochemical characterization techniques are time-demanding, cost-ineffective and low-throughput. Therefore, computational methods are being developed to explore the unmapped sequence space efficiently. Selection of putative enzymes for biochemical characterization based on rational and robust analysis of all available sequences remains an unsolved problem. To address this challenge, we have developed EnzymeMiner-a web server for automated screening and annotation of diverse family members that enables selection of hits for wet-lab experiments. EnzymeMiner prioritizes sequences that are more likely to preserve the catalytic activity and are heterologously expressible in a soluble form in Escherichia coli. The solubility prediction employs the in-house SoluProt predictor developed using machine learning. EnzymeMiner reduces the time devoted to data gathering, multi-step analysis, sequence prioritization and selection from days to hours. The successful use case for the haloalkane dehalogenase family is described in a comprehensive tutorial available on the EnzymeMiner web page. EnzymeMiner is a universal tool applicable to any enzyme family that provides an interactive and easy-to-use web interface freely available at https://loschmidt.chemi.muni.cz/enzymeminer/.
Assuntos
Enzimas/química , Software , Biocatálise , Estabilidade Enzimática , Enzimas/metabolismo , Hidrolases/química , Análise de Sequência de Proteína , Homologia de Sequência de Aminoácidos , SolubilidadeRESUMO
Rational enzyme design presents a major challenge that has not been overcome by computational approaches. One of the key challenges is the difficulty in assessing the magnitude of the maximum possible catalytic activity. In an attempt to overcome this challenge, we introduce a strategy that takes an active enzyme (assuming that its activity is close to the maximum possible activity), design mutations that reduce the catalytic activity, and then try to restore that catalysis by mutating other residues. Here we take as a test case the enzyme haloalkane dehalogenase (DhlA), with a 1,2-dichloroethane substrate. We start by demonstrating our ability to reproduce the results of single mutations. Next, we design mutations that reduce the enzyme activity and finally design double mutations that are aimed at restoring the activity. Using the computational predictions as a guide, we conduct an experimental study that confirms our prediction in one case and leads to inconclusive results in another case with 1,2-dichloroethane as substrate. Interestingly, one of our predicted double mutants catalyzes dehalogenation of 1,2-dibromoethane more efficiently than the wild-type enzyme.
Assuntos
Simulação por Computador , Dicloretos de Etileno/química , Hidrolases/química , Modelos Químicos , Modelos Moleculares , Domínio Catalítico , Especificidade por SubstratoRESUMO
Alzheimer's disease (AD) is a chronic neurodegenerative disease associated with the overproduction and accumulation of amyloid-ß peptide and hyperphosphorylation of tau proteins in the brain. Despite extensive research on the amyloid-based mechanism of AD pathogenesis, the underlying cause of AD is not fully understood. No disease-modifying therapies currently exist, and numerous clinical trials have failed to demonstrate any benefits. The recent discovery that the amyloid-ß peptide has antimicrobial activities supports the possibility of an infectious aetiology of AD and suggests that amyloid-ß plaque formation might be induced by infection. AD patients have a weakened blood-brain barrier and immune system and are thus at elevated risk of microbial infections. Such infections can cause chronic neuroinflammation, production of the antimicrobial amyloid-ß peptide, and neurodegeneration. Various pathogens, including viruses, bacteria, fungi, and parasites have been associated with AD. Most research in this area has focused on individual pathogens, with herpesviruses and periodontal bacteria being most frequently implicated. The purpose of this review is to highlight the potential role of multi-pathogen infections in AD. Recognition of the potential coexistence of multiple pathogens and biofilms in AD's aetiology may stimulate the development of novel approaches to its diagnosis and treatment. Multiple diagnostic tests could be applied simultaneously to detect major pathogens, followed by anti-microbial treatment using antiviral, antibacterial, antifungal, and anti-biofilm agents.
Assuntos
Doença de Alzheimer/microbiologia , Doença de Alzheimer/tratamento farmacológico , Animais , Anti-Infecciosos/uso terapêutico , Antivirais/farmacologia , Antivirais/uso terapêutico , Biofilmes/efeitos dos fármacos , HumanosRESUMO
Caver Web 1.0 is a web server for comprehensive analysis of protein tunnels and channels, and study of the ligands' transport through these transport pathways. Caver Web is the first interactive tool allowing both the analyses within a single graphical user interface. The server is built on top of the abundantly used tunnel detection tool Caver 3.02 and CaverDock 1.0 enabling the study of the ligand transport. The program is easy-to-use as the only required inputs are a protein structure for a tunnel identification and a list of ligands for the transport analysis. The automated guidance procedures assist the users to set up the calculation in a way to obtain biologically relevant results. The identified tunnels, their properties, energy profiles and trajectories for ligands' passages can be calculated and visualized. The tool is very fast (2-20 min per job) and is applicable even for virtual screening purposes. Its simple setup and comprehensive graphical user interface make the tool accessible for a broad scientific community. The server is freely available at https://loschmidt.chemi.muni.cz/caverweb.
Assuntos
Algoritmos , Proteínas de Transporte/química , Biologia Computacional/métodos , Interface Usuário-Computador , Sequência de Aminoácidos , Animais , Benchmarking , Sítios de Ligação , Proteínas de Transporte/metabolismo , Humanos , Internet , Ligantes , Simulação de Acoplamento Molecular , Ligação Proteica , Domínios e Motivos de Interação entre Proteínas , Estrutura Quaternária de Proteína , Estrutura Terciária de ProteínaRESUMO
Haloalkane dehalogenases (EC 3.8.1.5) play an important role in hydrolytic degradation of halogenated compounds, resulting in a halide ion, a proton, and an alcohol. They are used in biocatalysis, bioremediation, and biosensing of environmental pollutants and also for molecular tagging in cell biology. The method of ancestral sequence reconstruction leads to prediction of sequences of ancestral enzymes allowing their experimental characterization. Based on the sequences of modern haloalkane dehalogenases from the subfamily II, the most common ancestor of thoroughly characterized enzymes LinB from Sphingobium japonicum UT26 and DmbA from Mycobacterium bovis 5033/66 was in silico predicted, recombinantly produced and structurally characterized. The ancestral enzyme AncLinB-DmbA was crystallized using the sitting-drop vapor-diffusion method, yielding rod-like crystals that diffracted X-rays to 1.5 Å resolution. Structural comparison of AncLinB-DmbA with their closely related descendants LinB and DmbA revealed some differences in overall structure and tunnel architecture. Newly prepared AncLinB-DmbA has the highest active site cavity volume and the biggest entrance radius on the main tunnel in comparison to descendant enzymes. Ancestral sequence reconstruction is a powerful technique to study molecular evolution and design robust proteins for enzyme technologies.
Assuntos
Hidrolases/química , Mycobacterium bovis/enzimologia , Sphingomonadaceae/enzimologia , Sítios de Ligação , Domínio Catalítico , Cristalografia por Raios X/métodos , Evolução Molecular , Hidrolases/metabolismo , Hidrólise , Modelos Moleculares , Engenharia de Proteínas/métodos , Análise de Sequência de Proteína/métodosRESUMO
MOTIVATION: Protein tunnels and channels are key transport pathways that allow ligands to pass between proteins' external and internal environments. These functionally important structural features warrant detailed attention. It is difficult to study the ligand binding and unbinding processes experimentally, while molecular dynamics simulations can be time-consuming and computationally demanding. RESULTS: CaverDock is a new software tool for analysing the ligand passage through the biomolecules. The method uses the optimized docking algorithm of AutoDock Vina for ligand placement docking and implements a parallel heuristic algorithm to search the space of possible trajectories. The duration of the simulations takes from minutes to a few hours. Here we describe the implementation of the method and demonstrate CaverDock's usability by: (i) comparison of the results with other available tools, (ii) determination of the robustness with large ensembles of ligands and (iii) the analysis and comparison of the ligand trajectories in engineered tunnels. Thorough testing confirms that CaverDock is applicable for the fast analysis of ligand binding and unbinding in fundamental enzymology and protein engineering. AVAILABILITY AND IMPLEMENTATION: User guide and binaries for Ubuntu are freely available for non-commercial use at https://loschmidt.chemi.muni.cz/caverdock/. The web implementation is available at https://loschmidt.chemi.muni.cz/caverweb/. The source code is available upon request. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.