Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
1.
Nucleic Acids Res ; 48(D1): D277-D287, 2020 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-31612957

RESUMEN

DNAproDB (https://dnaprodb.usc.edu) is a web-based database and structural analysis tool that offers a combination of data visualization, data processing and search functionality that improves the speed and ease with which researchers can analyze, access and visualize structural data of DNA-protein complexes. In this paper, we report significant improvements made to DNAproDB since its initial release. DNAproDB now supports any DNA secondary structure from typical B-form DNA to single-stranded DNA to G-quadruplexes. We have updated the structure of our data files to support complex DNA conformations, multiple DNA-protein complexes within a DNAproDB entry and model indexing for analysis of ensemble data. Support for chemically modified residues and nucleotides has been significantly improved along with the addition of new structural features, improved structural moiety assignment and use of more sequence-based annotations. We have redesigned our report pages and search forms to support these enhancements, and the DNAproDB website has been improved to be more responsive and user-friendly. DNAproDB is now integrated with the Nucleic Acid Database, and we have increased our coverage of available Protein Data Bank entries. Our database now contains 95% of all available DNA-protein complexes, making our tools for analysis of these structures accessible to a broad community.


Asunto(s)
Proteínas de Unión al ADN/química , ADN/química , Bases de Datos Genéticas , Internet , Conformación de Ácido Nucleico , Conformación Proteica , Programas Informáticos , Interfaz Usuario-Computador
2.
Nucleic Acids Res ; 46(5): 2636-2647, 2018 03 16.
Artículo en Inglés | MEDLINE | ID: mdl-29390080

RESUMEN

Recognition of DNA by proteins depends on DNA sequence and structure. Often unanswered is whether the structure of naked DNA persists in a protein-DNA complex, or whether protein binding changes DNA shape. While X-ray structures of protein-DNA complexes are numerous, the structure of naked cognate DNA is seldom available experimentally. We present here an experimental and computational analysis pipeline that uses hydroxyl radical cleavage to map, at single-nucleotide resolution, DNA minor groove width, a recognition feature widely exploited by proteins. For 11 protein-DNA complexes, we compared experimental maps of naked DNA minor groove width with minor groove width measured from X-ray co-crystal structures. Seven sites had similar minor groove widths as naked DNA and when bound to protein. For four sites, part of the DNA in the complex had the same structure as naked DNA, and part changed structure upon protein binding. We compared the experimental map with minor groove patterns of DNA predicted by two computational approaches, DNAshape and ORChID2, and found good but not perfect concordance with both. This experimental approach will be useful in mapping structures of DNA sequences for which high-resolution structural data are unavailable. This approach allows probing of protein family-dependent readout mechanisms.


Asunto(s)
Proteínas de Unión al ADN/metabolismo , ADN/química , Sitios de Unión , ADN/metabolismo , Modelos Moleculares , Conformación de Ácido Nucleico , Nucleótidos/química , Unión Proteica
3.
Nucleic Acids Res ; 45(W1): W89-W97, 2017 07 03.
Artículo en Inglés | MEDLINE | ID: mdl-28431131

RESUMEN

Many biological processes are mediated by complex interactions between DNA and proteins. Transcription factors, various polymerases, nucleases and histones recognize and bind DNA with different levels of binding specificity. To understand the physical mechanisms that allow proteins to recognize DNA and achieve their biological functions, it is important to analyze structures of DNA-protein complexes in detail. DNAproDB is a web-based interactive tool designed to help researchers study these complexes. DNAproDB provides an automated structure-processing pipeline that extracts structural features from DNA-protein complexes. The extracted features are organized in structured data files, which are easily parsed with any programming language or viewed in a browser. We processed a large number of DNA-protein complexes retrieved from the Protein Data Bank and created the DNAproDB database to store this data. Users can search the database by combining features of the DNA, protein or DNA-protein interactions at the interface. Additionally, users can upload their own structures for processing privately and securely. DNAproDB provides several interactive and customizable tools for creating visualizations of the DNA-protein interface at different levels of abstraction that can be exported as high quality figures. All functionality is documented and freely accessible at http://dnaprodb.usc.edu.


Asunto(s)
Proteínas de Unión al ADN/química , ADN/química , Bases de Datos de Ácidos Nucleicos , Bases de Datos de Proteínas , Interfaz Usuario-Computador , Animales , Gráficos por Computador , ADN/genética , ADN/metabolismo , Proteínas de Unión al ADN/genética , Proteínas de Unión al ADN/metabolismo , Humanos , Almacenamiento y Recuperación de la Información , Modelos Moleculares , Conformación de Ácido Nucleico , Estructura Secundaria de Proteína
4.
Nucleic Acids Res ; 45(22): 12877-12887, 2017 Dec 15.
Artículo en Inglés | MEDLINE | ID: mdl-29165643

RESUMEN

Uncovering the mechanisms that affect the binding specificity of transcription factors (TFs) is critical for understanding the principles of gene regulation. Although sequence-based models have been used successfully to predict TF binding specificities, we found that including DNA shape information in these models improved their accuracy and interpretability. Previously, we developed a method for modeling DNA binding specificities based on DNA shape features extracted from Monte Carlo (MC) simulations. Prediction accuracies of our models, however, have not yet been compared to accuracies of models incorporating DNA shape information extracted from X-ray crystallography (XRC) data or Molecular Dynamics (MD) simulations. Here, we integrated DNA shape information extracted from MC or MD simulations and XRC data into predictive models of TF binding and compared their performance. Models that incorporated structural information consistently showed improved performance over sequence-based models regardless of data source. Furthermore, we derived and validated nine additional DNA shape features beyond our original set of four features. The expanded repertoire of 13 distinct DNA shape features, including six intra-base pair and six inter-base pair parameters and minor groove width, is available in our R/Bioconductor package DNAshapeR and enables a comprehensive structural description of the double helix on a genome-wide scale.


Asunto(s)
Algoritmos , Biología Computacional/métodos , ADN/química , Estudio de Asociación del Genoma Completo/métodos , Factores de Transcripción/química , Secuencia de Bases , Cristalografía por Rayos X , ADN/genética , ADN/metabolismo , Simulación de Dinámica Molecular , Método de Montecarlo , Conformación de Ácido Nucleico , Unión Proteica , Reproducibilidad de los Resultados , Factores de Transcripción/metabolismo
5.
Biochemistry ; 56(29): 3745-3753, 2017 07 25.
Artículo en Inglés | MEDLINE | ID: mdl-28644006

RESUMEN

FOXA2, a member of the forkhead family of transcription factors, plays essential roles in liver development and bile acid homeostasis. In this study, we report a 2.8 Å co-crystal structure of the FOXA2 DNA-binding domain (FOXA2-DBD) bound to a DNA duplex containing a forkhead consensus binding site (GTAAACA). The FOXA2-DBD adopts the canonical winged-helix fold, with helix H3 and wing 1 regions mainly mediating the DNA recognition. Although the wing 2 region was not defined in the structure, isothermal titration calorimetry assays suggested that this region was required for optimal DNA binding. Structure comparison with the FOXA3-DBD bound to DNA revealed more major groove contacts and fewer minor groove contacts in the FOXA2 structure than in the FOXA3 structure. Structure comparison with the FOXO1-DBD bound to DNA showed that different forkhead proteins could induce different DNA conformations upon binding to identical DNA sequences. Our findings provide the structural basis for FOXA2 protein binding to a consensus forkhead site and elucidate how members of the forkhead protein family bind different DNA sites.


Asunto(s)
ADN/química , Factor Nuclear 3-beta del Hepatocito/química , Motivos de Nucleótidos , Cristalografía por Rayos X , ADN/metabolismo , Factor Nuclear 3-beta del Hepatocito/metabolismo , Factor Nuclear 3-gamma del Hepatocito/química , Factor Nuclear 3-gamma del Hepatocito/metabolismo , Humanos , Unión Proteica , Dominios Proteicos , Homología Estructural de Proteína
6.
Nano Lett ; 13(6): 2851-6, 2013 Jun 12.
Artículo en Inglés | MEDLINE | ID: mdl-23713705

RESUMEN

We present a direct atom-by-atom chemical identification of the nanostructures and defects of topological insulators (TIs) with a state-of-the-art atomic mapping technology. Combining this technique and density function theory calculations, we identify and explain the layer chemistry evolution of Bi(2)Te(3­x)Se(x) ternary TIs. We also reveal a long neglected but crucially important extended defect found to be universally present in Bi(2)Te(3) films, the seven-layer Bi(3)Te(4) nanolamella acceptors. Intriguingly, this defect is found to locally pull down the conduction band, leading to local n-type conductivity, despite being an acceptor which pins the Fermi energy near the valence band maximum. This nanolamella may explain inconsistencies in measured conduction type as well as open up a new route to manipulate bulk carrier concentration. Our work may pave the way to more thoroughly understand and tailor the nature of the bulk, as well as secure controllable bulk states for future applications in quantum computing and dissipationless devices.

7.
bioRxiv ; 2024 Mar 02.
Artículo en Inglés | MEDLINE | ID: mdl-38529493

RESUMEN

The recognition and binding of nucleic acids (NAs) by proteins depends upon complementary chemical, electrostatic and geometric properties of the protein-NA binding interface. Structural models of protein-NA complexes provide insights into these properties but are scarce relative to models of unbound proteins. We present a deep learning approach for predicting protein-NA binding given the apo structure of a protein (PNAbind). Our method utilizes graph neural networks to encode spatial distributions of physicochemical and geometric properties of the protein molecular surface that are predictive of NA binding. Using global physicochemical encodings, our models predict the overall binding function of a protein and can discriminate between specificity for DNA or RNA binding. We show that such predictions made on protein structures modeled with AlphaFold2 can be used to gain mechanistic understanding of chemical and structural features that determine NA recognition. Using local encodings, our models predict the location of NA binding sites at the level of individual binding residues. Binding site predictions were validated against benchmark datasets, achieving AUROC scores in the range of 0.92-0.95. We applied our models to the HIV-1 restriction factor APOBEC3G and show that our predictions are consistent with experimental RNA binding data.

8.
J Mol Biol ; : 168546, 2024 Mar 18.
Artículo en Inglés | MEDLINE | ID: mdl-38508301

RESUMEN

IHMCIF (github.com/ihmwg/IHMCIF) is a data information framework that supports archiving and disseminating macromolecular structures determined by integrative or hybrid modeling (IHM), and making them Findable, Accessible, Interoperable, and Reusable (FAIR). IHMCIF is an extension of the Protein Data Bank Exchange/macromolecular Crystallographic Information Framework (PDBx/mmCIF) that serves as the framework for the Protein Data Bank (PDB) to archive experimentally determined atomic structures of biological macromolecules and their complexes with one another and small molecule ligands (e.g., enzyme cofactors and drugs). IHMCIF serves as the foundational data standard for the PDB-Dev prototype system, developed for archiving and disseminating integrative structures. It utilizes a flexible data representation to describe integrative structures that span multiple spatiotemporal scales and structural states with definitions for restraints from a variety of experimental methods contributing to integrative structural biology. The IHMCIF extension was created with the benefit of considerable community input and recommendations gathered by the Worldwide Protein Data Bank (wwPDB) Task Force for Integrative or Hybrid Methods (wwpdb.org/task/hybrid). Herein, we describe the development of IHMCIF to support evolving methodologies and ongoing advancements in integrative structural biology. Ultimately, IHMCIF will facilitate the unification of PDB-Dev data and tools with the PDB archive so that integrative structures can be archived and disseminated through PDB.

9.
bioRxiv ; 2023 Dec 16.
Artículo en Inglés | MEDLINE | ID: mdl-38293168

RESUMEN

Predicting specificity in protein-DNA interactions is a challenging yet essential task for understanding gene regulation. Here, we present Deep Predictor of Binding Specificity (DeepPBS), a geometric deep-learning model designed to predict binding specificity across protein families based on protein-DNA structures. The DeepPBS architecture allows investigation of different family-specific recognition patterns. DeepPBS can be applied to predicted structures, and can aid in the modeling of protein-DNA complexes. DeepPBS is interpretable and can be used to calculate protein heavy atom-level importance scores, demonstrated as a case-study on p53-DNA interface. When aggregated at the protein residue level, these scores conform well with alanine scanning mutagenesis experimental data. The inference time for DeepPBS is sufficiently fast for analyzing simulation trajectories, as demonstrated on a molecular-dynamics simulation of a Drosophila Hox-DNA tertiary complex with its cofactor. DeepPBS and its corresponding data resources offer a foundation for machine-aided protein-DNA interaction studies, guiding experimental choices and complex design, as well as advancing our understanding of molecular interactions.

10.
J Mol Biol ; 432(19): 5499-5508, 2020 09 04.
Artículo en Inglés | MEDLINE | ID: mdl-32681840

RESUMEN

MEF2 and NKX2-5 transcription factors interact with each other in cardiogenesis and are necessary for normal heart formation. Despite evidence suggesting that these two transcription factors function synergistically and possibly through direct physical interactions, molecular mechanisms by which they interact are not clear. Here we determined the crystal structures of ternary complexes of MEF2 and NKX2-5 bound to myocardin enhancer DNA in two crystal forms. These crystal structures are the first example of human MADS-box/homeobox ternary complex structures involved in cardiogenesis. Our structures reveal two possible modes of interactions between MEF2 and NKX2-5: MEF2 and NKX bind to adjacent DNA sites to recognize DNA in cis; and MEF2 and NKX bind to different DNA strands to interact with each other in trans via a conserved protein-protein interface observed in both crystal forms. Disease-related mutations are mapped to the observed protein-protein interface. Our structural studies provide a starting point to understand and further study the molecular mechanisms of the interactions between MEF2 and NKX2.5 and their roles in cardiogenesis.


Asunto(s)
ADN/metabolismo , Proteína Homeótica Nkx-2.5/metabolismo , Secuencia de Aminoácidos , Sitios de Unión , Cristalografía por Rayos X , ADN/química , Proteína Homeótica Nkx-2.5/química , Humanos , Factores de Transcripción MEF2/química , Factores de Transcripción MEF2/metabolismo , Simulación del Acoplamiento Molecular , Unión Proteica , Dominios y Motivos de Interacción de Proteínas , Mapas de Interacción de Proteínas
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA