Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 20
Filtrar
1.
Nucleic Acids Res ; 49(W1): W559-W566, 2021 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-34019657

RESUMO

The experimental characterization and computational prediction of protein structures has become increasingly rapid and precise. However, the analysis of protein structures often requires researchers to use several software packages or web servers, which complicates matters. To provide long-established structural analyses in a modern, easy-to-use interface, we implemented ProteinTools, a web server toolkit for protein structure analysis. ProteinTools gathers four applications so far, namely the identification of hydrophobic clusters, hydrogen bond networks, salt bridges, and contact maps. In all cases, the input data is a PDB identifier or an uploaded structure, whereas the output is an interactive dynamic web interface. Thanks to the modular nature of ProteinTools, the addition of new applications will become an easy task. Given the current need to have these tools in a single, fast, and interpretable interface, we believe that ProteinTools will become an essential toolkit for the wider protein research community. The web server is available at https://proteintools.uni-bayreuth.de.


Assuntos
Conformação Proteica , Proteínas/química , Software , Aminoácidos/química , Ligação de Hidrogênio , Interações Hidrofóbicas e Hidrofílicas , Modelos Moleculares
2.
Bioinformatics ; 37(19): 3182-3189, 2021 Oct 11.
Artigo em Inglês | MEDLINE | ID: mdl-33901273

RESUMO

MOTIVATION: Duplication and recombination of protein fragments have led to the highly diverse protein space that we observe today. By mimicking this natural process, the design of protein chimeras via fragment recombination has proven experimentally successful and has opened a new era for the design of customizable proteins. The in silico building of structural models for these chimeric proteins, however, remains a manual task that requires a considerable degree of expertise and is not amenable for high-throughput studies. Energetic and structural analysis of the designed proteins often require the use of several tools, each with their unique technical difficulties and available in different programming languages or web servers. RESULTS: We implemented a Python package that enables automated, high-throughput design of chimeras and their structural analysis. First, it fetches evolutionarily conserved fragments from a built-in database (also available at fuzzle.uni-bayreuth.de). These relationships can then be represented via networks or further selected for chimera construction via recombination. Designed chimeras or natural proteins are then scored and minimized with the Charmm and Amber forcefields and their diverse structural features can be analyzed at ease. Here, we showcase Protlego's pipeline by exploring the relationships between the P-loop and Rossmann superfolds, building and characterizing their offspring chimeras. We believe that Protlego provides a powerful new tool for the protein design community. AVAILABILITY AND IMPLEMENTATION: Protlego runs on the Linux platform and is freely available at (https://hoecker-lab.github.io/protlego/) with tutorials and documentation. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

3.
J Chem Inf Model ; 55(10): 2200-5, 2015 Oct 26.
Artigo em Inglês | MEDLINE | ID: mdl-26376295

RESUMO

Novel bioactive molecules can be rationally designed by growing and linking small fragments. Because fragments are fast and promiscuous, it is common to have contradictory hit results between different experimental screening techniques. Here, we simultaneously determine fragment binding poses, affinities, and kinetics on a focused library of 42 fragments against the serine protease factor Xa using multimillisecond molecular dynamics simulations. We predict experimental poses of 12 over 15 S1 crystal structures, and affinities are recovered for 4 out of 6. A kinetic map of protein cavities is computed in terms of on- and off-rates as well as insights into secondary ligand poses. The results suggest that the approach can be useful to recapitulate discordant fragment screening data.


Assuntos
Fator Xa/química , Simulação de Dinâmica Molecular , Fragmentos de Peptídeos/química , Bibliotecas de Moléculas Pequenas/química , Bioensaio , Domínio Catalítico , Humanos , Ligação Proteica , Termodinâmica
4.
J Chem Inf Model ; 53(12): 3123-6, 2013 Dec 23.
Artigo em Inglês | MEDLINE | ID: mdl-24219403

RESUMO

Extracellular S468R mutation of the epidermal growth factor receptor (EGFR) was recently identified as the cause of resistance to cetuximab, a widely used drug in colorectal cancer treatment. Here, we have determined the binding free energies of cetuximab's Fab V(H)-V(L) domains and endogenous EGF ligand to wild type and S468R EGFR by high-throughput molecular dynamics. This work provides a possible mechanism of resistance in terms of increased competition, an hypothesis that can be further validated experimentally.


Assuntos
Anticorpos Monoclonais Humanizados/química , Antineoplásicos/química , Receptores ErbB/química , Simulação de Dinâmica Molecular , Anticorpos de Domínio Único/química , Cetuximab , Neoplasias Colorretais/tratamento farmacológico , Resistencia a Medicamentos Antineoplásicos/genética , Fator de Crescimento Epidérmico/química , Receptores ErbB/antagonistas & inibidores , Receptores ErbB/genética , Humanos , Ligantes , Mutação , Neuregulina-1/química , Termodinâmica , Fator de Crescimento Transformador alfa/química
5.
Artigo em Inglês | MEDLINE | ID: mdl-37848247

RESUMO

Recent advancements in specialized large-scale architectures for training images and language have profoundly impacted the field of computer vision and natural language processing (NLP). Language models, such as the recent ChatGPT and GPT-4, have demonstrated exceptional capabilities in processing, translating, and generating human language. These breakthroughs have also been reflected in protein research, leading to the rapid development of numerous new methods in a short time, with unprecedented performance. Several of these models have been developed with the goal of generating sequences in novel regions of the protein space. In this work, we provide an overview of the use of protein generative models, reviewing (1) language models for the design of novel artificial proteins, (2) works that use non-transformer architectures, and (3) applications in directed evolution approaches.

6.
Comput Struct Biotechnol J ; 21: 238-250, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36544476

RESUMO

The process of designing biomolecules, in particular proteins, is witnessing a rapid change in available tooling and approaches, moving from design through physicochemical force fields, to producing plausible, complex sequences fast via end-to-end differentiable statistical models. To achieve conditional and controllable protein design, researchers at the interface of artificial intelligence and biology leverage advances in natural language processing (NLP) and computer vision techniques, coupled with advances in computing hardware to learn patterns from growing biological databases, curated annotations thereof, or both. Once learned, these patterns can be used to provide novel insights into mechanistic biology and the design of biomolecules. However, navigating and understanding the practical applications for the many recent protein design tools is complex. To facilitate this, we 1) document recent advances in deep learning (DL) assisted protein design from the last three years, 2) present a practical pipeline that allows to go from de novo-generated sequences to their predicted properties and web-powered visualization within minutes, and 3) leverage it to suggest a generated protein sequence which might be used to engineer a biosynthetic gene cluster to produce a molecular glue-like compound. Lastly, we discuss challenges and highlight opportunities for the protein design field.

7.
ACS Synth Biol ; 12(10): 2812-2818, 2023 10 20.
Artigo em Inglês | MEDLINE | ID: mdl-37703075

RESUMO

Epitopes are specific regions on an antigen's surface that the immune system recognizes. Epitopes are usually protein regions on foreign immune-stimulating entities such as viruses and bacteria, and in some cases, endogenous proteins may act as antigens. Identifying epitopes is crucial for accelerating the development of vaccines and immunotherapies. However, mapping epitopes in pathogen proteomes is challenging using conventional methods. Screening artificial neoepitope libraries against antibodies can overcome this issue. Here, we applied conventional sequence analysis and methods inspired in natural language processing to reveal specific sequence patterns in the linear epitopes deposited in the Immune Epitope Database (www.iedb.org) that can serve as building blocks for the design of universal epitope libraries. Our results reveal that amino acid frequency in annotated linear epitopes differs from that in the human proteome. Aromatic residues are overrepresented, while the presence of cysteines is practically null in epitopes. Byte pair encoding tokenization shows high frequencies of tryptophan in tokens of 5, 6, and 7 amino acids, corroborating the findings of the conventional sequence analysis. These results can be applied to reduce the diversity of linear epitope libraries by orders of magnitude.


Assuntos
Vírus , Humanos , Epitopos/genética , Sequência de Aminoácidos , Mapeamento de Epitopos/métodos , Proteoma , Aminoácidos
8.
Nat Commun ; 13(1): 4348, 2022 07 27.
Artigo em Inglês | MEDLINE | ID: mdl-35896542

RESUMO

Protein design aims to build novel proteins customized for specific purposes, thereby holding the potential to tackle many environmental and biomedical problems. Recent progress in Transformer-based architectures has enabled the implementation of language models capable of generating text with human-like capabilities. Here, motivated by this success, we describe ProtGPT2, a language model trained on the protein space that generates de novo protein sequences following the principles of natural ones. The generated proteins display natural amino acid propensities, while disorder predictions indicate that 88% of ProtGPT2-generated proteins are globular, in line with natural sequences. Sensitive sequence searches in protein databases show that ProtGPT2 sequences are distantly related to natural ones, and similarity networks further demonstrate that ProtGPT2 is sampling unexplored regions of protein space. AlphaFold prediction of ProtGPT2-sequences yields well-folded non-idealized structures with embodiments and large loops and reveals topologies not captured in current structure databases. ProtGPT2 generates sequences in a matter of seconds and is freely available.


Assuntos
Idioma , Proteínas , Sequência de Aminoácidos , Aminoácidos , Bases de Dados de Proteínas , Humanos , Proteínas/química , Proteínas/genética
10.
Structure ; 29(5): 433-443.e4, 2021 05 06.
Artigo em Inglês | MEDLINE | ID: mdl-33406388

RESUMO

Periplasmic binding proteins (PBPs) are ubiquitous receptors in gram-negative bacteria. They sense solutes and play key roles in nutrient uptake. Escherichia coli's putrescine receptor PotF has been reported to bind putrescine and spermidine. We reveal that several similar biogenic polyamines are recognized by PotF. Using isothermal titration calorimetry paired with X-ray crystallography of the different complexes, we unveil PotF's binding modes in detail. The binding site for PBPs is located between two lobes that undergo a large conformational change upon ligand recognition. Hence, analyzing the influence of ligands on complex formation is crucial. Therefore, we solved crystal structures of an open and closed apo state and used them as a basis for molecular dynamics simulations. In addition, we accessed structural behavior in solution for all complexes by 1H-15N HSQC NMR spectroscopy. This combined analysis provides a robust framework for understanding ligand binding for future developments in drug design and protein engineering.


Assuntos
Proteínas de Escherichia coli/química , Proteínas Periplásmicas de Ligação/química , Receptores de Amina Biogênica/química , Sítios de Ligação , Proteínas de Escherichia coli/metabolismo , Ligantes , Proteínas Periplásmicas de Ligação/metabolismo , Poliaminas/química , Poliaminas/metabolismo , Ligação Proteica , Receptores de Amina Biogênica/metabolismo
11.
Front Mol Biosci ; 8: 715972, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34485385

RESUMO

Modern proteins have been shown to share evolutionary relationships via subdomain-sized fragments. The assembly of such fragments through duplication and recombination events led to the complex structures and functions we observe today. We previously implemented a pipeline that identified more than 1,000 of these fragments that are shared by different protein folds and developed a web interface to analyze and search for them. This resource named Fuzzle helps structural and evolutionary biologists to identify and analyze conserved parts of a protein but it also provides protein engineers with building blocks for example to design proteins by fragment combination. Here, we describe a new version of this web resource that was extended to include ligand information. This addition is a significant asset to the database since now protein fragments that bind specific ligands can be identified and analyzed. Often the mode of ligand binding is conserved in proteins thereby supporting a common evolutionary origin. The same can now be explored for subdomain-sized fragments within this database. This ligand binding information can also be used in protein engineering to graft binding pockets into other protein scaffolds or to transfer functional sites via recombination of a specific fragment. Fuzzle 2.0 is freely available at https://fuzzle.uni-bayreuth.de/2.0.

12.
J Mol Biol ; 432(13): 3898-3914, 2020 06 12.
Artigo em Inglês | MEDLINE | ID: mdl-32330481

RESUMO

Natural evolution has generated an impressively diverse protein universe via duplication and recombination from a set of protein fragments that served as building blocks. The application of these concepts to the design of new proteins using subdomain-sized fragments from different folds has proven to be experimentally successful. To better understand how evolution has shaped our protein universe, we performed an all-against-all comparison of protein domains representing all naturally existing folds and identified conserved homologous protein fragments. Overall, we found more than 1000 protein fragments of various lengths among different folds through similarity network analysis. These fragments are present in very different protein environments and represent versatile building blocks for protein design. These data are available in our web server called F(old P)uzzle (fuzzle.uni-bayreuth.de), which allows to individually filter the dataset and create customized networks for folds of interest. We believe that our results serve as an invaluable resource for structural and evolutionary biologists and as raw material for the design of custom-made proteins.


Assuntos
Evolução Molecular , Dobramento de Proteína , Proteínas/química , Biologia Computacional , Internet , Modelos Moleculares , Domínios Proteicos/genética , Engenharia de Proteínas/tendências , Proteínas/genética , Proteínas/ultraestrutura , Homologia de Sequência de Aminoácidos , Software
14.
Sci Rep ; 9(1): 6076, 2019 Apr 10.
Artigo em Inglês | MEDLINE | ID: mdl-30967561

RESUMO

A correction to this article has been published and is linked from the HTML and PDF versions of this paper. The error has not been fixed in the paper.

15.
Curr Opin Chem Biol ; 47: 67-76, 2018 12.
Artigo em Inglês | MEDLINE | ID: mdl-30248579

RESUMO

The design of tailor-made enzymes is a major goal in biochemical research that can result in wide-range applications and will lead to a better understanding of how proteins fold and function. In this review we highlight recent advances in enzyme and small molecule binder design. A focus is placed on novel strategies for the design of scaffolds, developments in computational methods, and recent applications of these techniques on receptors, sensors, and enzymes. Further, the integration of computational and experimental methodologies is discussed. The outlined examples of designed enzymes and binders for various purposes highlight the importance of this topic and underline the need for tailor-made proteins.


Assuntos
Enzimas/química , Modelos Moleculares , Engenharia de Proteínas , Dobramento de Proteína
16.
Sci Rep ; 8(1): 897, 2018 01 17.
Artigo em Inglês | MEDLINE | ID: mdl-29343833

RESUMO

The recent increase in the number of X-ray crystal structures of G-protein coupled receptors (GPCRs) has been enabling for structure-based drug design (SBDD) efforts. These structures have revealed that GPCRs are highly dynamic macromolecules whose function is dependent on their intrinsic flexibility. Unfortunately, the use of static structures to understand ligand binding can potentially be misleading, especially in systems with an inherently high degree of conformational flexibility. Here, we show that docking a set of dopamine D3 receptor compounds into the existing eticlopride-bound dopamine D3 receptor (D3R) X-ray crystal structure resulted in poses that were not consistent with results obtained from site-directed mutagenesis experiments. We overcame the limitations of static docking by using large-scale high-throughput molecular dynamics (MD) simulations and Markov state models (MSMs) to determine an alternative pose consistent with the mutation data. The new pose maintains critical interactions observed in the D3R/eticlopride X-ray crystal structure and suggests that a cryptic pocket forms due to the shift of a highly conserved residue, F6.52. Our study highlights the importance of GPCR dynamics to understand ligand binding and provides new opportunities for drug discovery.


Assuntos
Receptores de Dopamina D3/antagonistas & inibidores , Receptores Acoplados a Proteínas G/química , Receptores Acoplados a Proteínas G/metabolismo , Animais , Sítios de Ligação/fisiologia , Linhagem Celular , Cristalografia por Raios X/métodos , Humanos , Ligantes , Simulação de Acoplamento Molecular/métodos , Simulação de Dinâmica Molecular , Mutagênese Sítio-Dirigida/métodos , Ligação Proteica/fisiologia , Salicilamidas/química , Salicilamidas/metabolismo , Células Sf9
17.
Nat Biotechnol ; 40(2): 171-172, 2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-35075248
18.
Mol Inform ; 35(6-7): 216-26, 2016 07.
Artigo em Inglês | MEDLINE | ID: mdl-27492236

RESUMO

Over the last years, researchers have increasingly become interested in measuring and understanding drugs' binding kinetics, namely the time in which drug and its target associate and dissociate. Historically, drug discovery programs focused on the optimization of target affinity as a proxy of in-vivo efficacy. However, often the efficacy of a ligand is not appropriately described by the in-vitro measured drug-receptor affinity, but rather depends on the lifetime of the in-vivo drug-receptor interaction. In this review we review recent works that highlight the importance of binding kinetics, molecular determinants for rational optimization and the recent emergence of computational methods as powerful tools in measuring and understanding binding kinetics.


Assuntos
Avaliação Pré-Clínica de Medicamentos , Animais , Humanos , Cinética , Termodinâmica
19.
Sci Rep ; 6: 30275, 2016 07 21.
Artigo em Inglês | MEDLINE | ID: mdl-27440438

RESUMO

Molecular recognition is rarely a two-body protein-ligand problem, as it often involves the dynamic interplay of multiple molecules that together control the binding process. Myo-inositol monophosphatase (IMPase), a drug target for bipolar disorder, depends on 3 Mg(2+) ions as cofactor for its catalytic activity. Although the crystallographic pose of the pre-catalytic complex is well characterized, the binding process by which substrate, cofactor and protein cooperate is essentially unknown. Here, we have characterized cofactor and substrate cooperative binding by means of large-scale molecular dynamics. Our study showed the first and second Mg(2+) ions identify the binding pocket with fast kinetics whereas the third ion presents a much higher energy barrier. Substrate binding can occur in cooperation with cofactor, or alone to a binary or ternary cofactor-IMPase complex, although the last scenario occurs several orders of magnitude faster. Our atomic description of the three-body mechanism offers a particularly challenging example of pathway reconstruction, and may prove particularly useful in realistic contexts where water, ions, cofactors or other entities cooperate and modulate the binding process.


Assuntos
Coenzimas/química , Inositol/química , Magnésio/química , Monoéster Fosfórico Hidrolases/química , Motivos de Aminoácidos , Sítios de Ligação , Cátions Bivalentes , Coenzimas/metabolismo , Humanos , Inositol/metabolismo , Cinética , Magnésio/metabolismo , Simulação de Dinâmica Molecular , Monoéster Fosfórico Hidrolases/metabolismo , Ligação Proteica , Conformação Proteica em alfa-Hélice , Conformação Proteica em Folha beta , Domínios e Motivos de Interação entre Proteínas , Especificidade por Substrato , Termodinâmica
20.
Clin Cancer Res ; 21(9): 2157-66, 2015 May 01.
Artigo em Inglês | MEDLINE | ID: mdl-25623215

RESUMO

PURPOSE: Patients with colorectal cancer who respond to the anti-EGFR antibody cetuximab often develop resistance within several months of initiating therapy. To design new lines of treatment, the molecular landscape of resistant tumors must be ascertained. We investigated the role of mutations in the EGFR signaling axis on the acquisition of resistance to cetuximab in patients and cellular models. EXPERIMENTAL DESIGN: Tissue samples were obtained from 37 patients with colorectal cancer who became refractory to cetuximab. Colorectal cancer cells sensitive to cetuximab were treated until resistant derivatives emerged. Mutational profiling of biopsies and cell lines was performed. Structural modeling and functional analyses were performed to causally associate the alleles to resistance. RESULTS: The genetic profile of tumor specimens obtained after cetuximab treatment revealed the emergence of a complex pattern of mutations in EGFR, KRAS, NRAS, BRAF, and PIK3CA genes, including two novel EGFR ectodomain mutations (R451C and K467T). Mutational profiling of cetuximab-resistant cells recapitulated the molecular landscape observed in clinical samples and revealed three additional EGFR alleles: S464L, G465R, and I491M. Structurally, these mutations are located in the cetuximab-binding region, except for the R451C mutant. Functionally, EGFR ectodomain mutations prevent binding to cetuximab but a subset is permissive for interaction with panitumumab. CONCLUSIONS: Colorectal tumors evade EGFR blockade by constitutive activation of downstream signaling effectors and through mutations affecting receptor-antibody binding. Both mechanisms of resistance may occur concomitantly. Our data have implications for designing additional lines of therapy for patients with colorectal cancer who relapse upon treatment with anti-EGFR antibodies.


Assuntos
Neoplasias Colorretais/genética , Resistencia a Medicamentos Antineoplásicos/genética , Genes erbB-1/genética , Mutação , Antineoplásicos/uso terapêutico , Western Blotting , Linhagem Celular Tumoral , Cetuximab/uso terapêutico , Neoplasias Colorretais/tratamento farmacológico , Análise Mutacional de DNA , Espaço Extracelular/genética , Citometria de Fluxo , Humanos , Reação em Cadeia da Polimerase em Tempo Real
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA