Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 93
Filtrar
1.
Bioinformatics ; 38(15): 3833-3834, 2022 08 02.
Artigo em Inglês | MEDLINE | ID: mdl-35695777

RESUMO

MOTIVATION: The role of metals in shaping and functioning of RNA is a well-established fact, and the understanding of that through the analysis of structural data has biological relevance. Often metal ions bind to one or more atoms of the nucleobase of an RNA. This fact becomes more interesting when such bases form a base pair with any other base. Furthermore, when metal ions bind to any residue of an RNA, the secondary structural features of the residue (helix, loop, unpaired, etc.) are also biologically important. The available metal-binding-related software tools cannot address such type-specific queries. RESULTS: To fill this limitation, we have designed a software tool, called MetBP that meets the goal. This tool is a stand-alone command-line-based tool and has no dependency on the other existing software. It accepts a structure file in mmCIF or PDB format and computes the base pairs and thereafter reports all metals that bind to one or more nucleotides that form pairs with another. It reports binding distance, angles along with base pair stability. It also reports several other important aspects, e.g. secondary structure of the residue in the RNA. MetBP can be used as a generalized metal-binding site detection tool for Proteins and DNA as well. AVAILABILITY AND IMPLEMENTATION: https://github.com/computational-biology/metbp. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
RNA , Software , Pareamento de Bases , RNA/química , Estrutura Secundária de Proteína , Metais , Íons
2.
J Chem Phys ; 159(9)2023 Sep 07.
Artigo em Inglês | MEDLINE | ID: mdl-37655772

RESUMO

DNA deformability and differential hydration are crucial determinants of biological processes ranging from genetic material packaging to gene expression; their associative details, however, remain inadequately understood. Herein, we report investigations of the dynamic and thermodynamic responses of the local hydration of a variety of base pair sequences. Leveraging in silico sampling and our in-house analyses, we first report the local conformational propensity of sequences that are either predisposed toward the canonical A- or B-conformations or are restrained to potential transitory pathways. It is observed that the transition from the unrestrained A-form to the B-form leads to lengthwise structural deformation. The insertion of intermittent -(CG)- base pairs in otherwise homogeneous -(AT)- sequences bears dynamical consequences for the vicinal hydration layer. Calculation of the excess (pair) entropy suggests substantially higher values of hydration water surrounding A conformations over the B- conformations. Applying the Rosenfeld approximation, we project that the diffusivity of water molecules proximal to canonical B conformation is least for the minor groove of the canonical B-conformation. We determine that structure, composition, and conformation specific groove dimension together influence the local hydration characteristics and, therefore, are expected to be important determinants of biological processes.

3.
Proteins ; 90(7): 1390-1412, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35157344

RESUMO

According to the "jigsaw puzzle" model of protein folding, the isomorphism between sequence and structure is substantially determined by the specific geometry of side-chain interactions, within the protein interior. In this work, we have attempted to predict the hydrophobic core of cyclophilin (LdCyp) from Leishmania donovani, utilizing a surface complementarity function, which selects for high goodness of fit between hydrophobic side-chain surfaces, rather in the manner of assembling a three-dimensional jigsaw puzzle. The computational core prediction method implemented here has been tried on two distinct scenarios, on the LdCyp polypeptide chain with native non-core residues and all core residues initially set to alanine, on a poly-glycine polypeptide chain. Molecular dynamics simulations appeared to indicate partial destabilization of the two designed sequences. However, experimental characterization of the designed sequences by circular dichroism (CD) spectroscopy and denaturant (GdmCl) induced unfolding, demonstrated disordered proteins. Stepwise reconstruction of the designed cores by cumulative sequential mutations identified the specific mutation (M122L) as primarily responsible for fold collapse and all design objectives were achieved upon rectifying this mutation. In summary, the study demonstrates regions of the core to contain highly specific (jigsaw puzzle-like) interactions sensitive to any perturbations and a predictive algorithm to identify such regions. A mutation within the core has been identified which exercises an inordinate influence on the global fold, reminiscent of metamorphic proteins. In addition, the computational procedure could predict substantial regions of the core (given main-chain coordinates) without any reference to non-core residues.


Assuntos
Dobramento de Proteína , Proteínas , Dicroísmo Circular , Interações Hidrofóbicas e Hidrofílicas , Simulação de Dinâmica Molecular , Peptídeos
4.
Biopolymers ; 113(7): e23492, 2022 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-35615897

RESUMO

Specific recognition of DNA base sequences by proteins is vital for life-cycles of all organisms. In a large number of crystal structures of protein-DNA complexes, DNA conformation significantly deviates from the canonical B-DNA structure. A key question is whether such alternate conformations exist prior to protein binding and one is selected for complexation or the structure observed is induced by protein binding. Non-canonical base pairs, such as Hoogsteen base pairs, are often observed in crystal structures of protein-DNA complexes. We decided to explore whether the occurrence of such non-canonical base pairs in protein-DNA complexes is induced by the protein or is selected from pre-existing conformations. Detailed quantum chemical calculations with dispersion-corrected density functional theory (DFT-D) indicated that most of the non-canonical base pairs with DNA bases are stable even in the absence of the interacting amino acids. However, the G:G Hoogsteen base pair, which also appears in the telomere structure, appears to be unstable in the absence of other stabilizing agents, such as positively charged amino acids. Thus, the stability of many of the non-canonical base pair containing duplexes may be close to the canonical B-DNA structure and hence energetically accessible in the ground state; suggesting that the selection from pre-existing conformations may be an important mechanism for observed non-canonical base pairs in protein-DNA complexes.


Assuntos
DNA de Forma B , Aminoácidos , Pareamento de Bases , DNA/química , Ligação de Hidrogênio , Conformação de Ácido Nucleico
5.
J Comput Aided Mol Des ; 36(2): 131-140, 2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-35059942

RESUMO

Base pairing in RNA are significantly rich and versatile due to the potential non-canonical base pairing amongst nucleotides. Not only that, one base in RNA can pair with more than one bases simultaneously. This opens up a new dimension of research to detect such types of base-base pair networks in RNA and to analyze them. Even if a base do not form a pair, it may have significant extent of [Formula: see text]-[Formula: see text] stacking overlap that can stabilize the structures. In this work, we report a software tool, called BPNet, that accepts a mmCIF or PDB file and computes the base-pair/[Formula: see text]-[Formula: see text] contact network components using graph formalism. The software can run on Linux platform in both serial and parallel modes. It generates several information in suitable file formats for visualization of the networks. This paper describes the BPNet software and also presents some interesting results obtained by analyzing several RNA structures by the software to show its effectiveness.


Assuntos
Biologia Computacional , RNA , Pareamento de Bases , Ligação de Hidrogênio , Conformação de Ácido Nucleico , RNA/química
6.
RNA ; 25(5): 573-589, 2019 05.
Artigo em Inglês | MEDLINE | ID: mdl-30792229

RESUMO

Identification and characterization of base-multiplets, which are essentially mediated by base-pairing interactions, can provide insights into the diversity in the structure and dynamics of complex functional RNAs, and thus facilitate hypothesis driven biological research. The necessary nomenclature scheme, an extension of the geometric classification scheme for base-pairs by Leontis and Westhof, is however available only for base-triplets. In the absence of information on topology, this scheme is not applicable to quartets and higher order multiplets. Here we propose a topology-based classification scheme which, in conjunction with a graph-based algorithm, can be used for the automated identification and characterization of higher order base-multiplets in RNA structures. Here, the RNA structure is represented as a graph, where nodes represent nucleotides and edges represent base-pairing connectivity. Sets of connected components (of n nodes) within these graphs constitute subgraphs representing multiplets of "n" nucleotides. The different topological variants of the RNA multiplets thus correspond to different nonisomorphic forms of these subgraphs. To annotate RNA base-multiplets unambiguously, we propose a set of topology-based nomenclature rules for quartets, which are extendable to higher multiplets. We also demonstrate the utility of our approach toward the identification and annotation of higher order RNA multiplets, by investigating the occurrence contexts of selected examples in order to gain insights regarding their probable functional roles.


Assuntos
Algoritmos , Conformação de Ácido Nucleico , Nucleotídeos/química , RNA Bacteriano/química , RNA Fúngico/química , Pareamento de Bases , Biologia Computacional/métodos , Ligação de Hidrogênio , Nucleotídeos/genética , Nucleotídeos/metabolismo , RNA Bacteriano/genética , RNA Bacteriano/metabolismo , RNA Fúngico/genética , RNA Fúngico/metabolismo , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Software , Thermus thermophilus/genética , Thermus thermophilus/metabolismo
7.
J Comput Aided Mol Des ; 34(1): 83-95, 2020 01.
Artigo em Inglês | MEDLINE | ID: mdl-31820194

RESUMO

DNA is an essential target for the treatment of various pathologies, especially cancer. Hence targeting DNA double helix for alteration of its function has been attempted by several ways. Drug-DNA intercalation, one such biophysical process, could not be studied extensively as this requires significant deformation of the receptor DNA. Here we report thorough theoretical investigation of intercalation process in daunomycin-DNA interaction, by performing molecular dynamics simulations of the drug-DNA complexes for various DNA sequences, followed by Free-energy analysis and density functional theory (DFT) based studies to understand the binding preference. The classical energy based analyses indicate that the drug prefers to bind to TC/GA sequence over others. The DFT based energies of supra-molecular complexes are always contaminated with basis set superposition error (BSSE), which can be corrected by counterpoise method. This method is quite effective for systems containing two molecular fragments but is not appropriate for studying interaction between two base pair fragments and the drug intercalated between them. We have adopted an extension of the counterpoise method for BSSE corrected interaction energy calculation. These interaction energies, along with the energy penalty due to un-stacking of the base pairs, also indicate TC/GA sequence is the most preferred sequence for binding.


Assuntos
Antibióticos Antineoplásicos/farmacologia , DNA/química , Daunorrubicina/farmacologia , Substâncias Intercalantes/farmacologia , Teoria da Densidade Funcional , Simulação de Dinâmica Molecular , Conformação de Ácido Nucleico/efeitos dos fármacos , Termodinâmica
8.
Phys Chem Chem Phys ; 21(38): 21549-21560, 2019 Oct 02.
Artigo em Inglês | MEDLINE | ID: mdl-31536074

RESUMO

Repetitive cytosine rich i-motif forming sequences are abundant in the telomere, centromere and promoters of several oncogenes and in some instances are known to regulate transcription and gene expression. The in vivo existence of i-motif structures demands further insight into the factors affecting their formation and stability and development of better understanding of their gene regulatory functions. Most prior studies characterizing the conformational dynamics of i-motifs are based on i-motif forming synthetic constructs. Here, we present a systematic study on the stability and structural properties of biologically relevant i-motifs of telomeric and centromeric repeat fragments. Our results based on molecular dynamics simulations and quantum chemical calculations indicate that along with base pairing interactions within the i-motif core the overall folded conformation is associated with the stable C-HO sugar "zippers" in the narrow grooves and structured water molecules along the wide grooves. The stacked geometry of the hemi-protonated cytosine pairs within the i-motif core is mainly governed by the repulsive base stacking interaction. The loop sequence can affect the structural dynamics of the i-motif by altering the loop motion and backbone conformation. Overall this study provides microscopic insight into the i-motif structure that will be helpful to understand the structural aspect of mechanisms of gene regulation by i-motif DNA.


Assuntos
DNA/química , Substâncias Intercalantes/química , Motivos de Nucleotídeos , Solventes/química , Telômero/química , Pareamento de Bases , Citosina/química , Ligação de Hidrogênio , Simulação de Dinâmica Molecular
9.
Biochim Biophys Acta Gen Subj ; 1862(9): 1883-1892, 2018 09.
Artigo em Inglês | MEDLINE | ID: mdl-29902552

RESUMO

BACKGROUND: Human Chronic and Acute Myeloid Leukemia are myeloproliferative disorders in myeloid lineage of blood cells characterized by accumulation of aberrant white blood cells. In cancer, the anomalous transcriptome includes deregulated expression of non-coding RNAs in conjunction with protein-coding mRNAs in human genome. The coding or non-coding RNA transcripts harboring miRNA-binding sites can converse with and regulate each other by explicitly contending for a limited pool of shared miRNAs and act as competitive endogenous RNAs (ceRNAs). An unifying hypothesis attributing 'modulation of expression of transcripts' in this fashion had been defined as 'competitive endogenous RNA hypothesis'. Network built with ceRNAs evidently offers a platform to elucidate complex regulatory interactions at post-transcriptional level in human cancers. METHODS: Contemplating cancers of human myeloid lineage we constructed ceRNA networks for CML and AML coding and non-coding repertoire utilizing patient sample data. Through functional enrichment analysis we selected the significant functional modules for transcripts being differentially expressed in Blastic phases of each cancer types with respect to Normal. After retrieving free energy of binding and duplex formation of shared miRNAs on ceRNAs, we performed statistical averaging of energy values over the ensemble of populations considering cellular system as in canonical (Iso-thermal) situation. RESULTS AND CONCLUSIONS: We aimed to shed light on 'Sibling Rivalry' in ceRNA partners from the perspective of statistical thermodynamics, identified major cross-talking tracks and ceRNAs influencing transcripts concerned in myeloid cancer systems. GENERAL SIGNIFICANCE: Insights into ceRNA-regulation will shed light on progression and prognosis of human Chronic and Acute Myeloid Leukemia.


Assuntos
Biomarcadores Tumorais/genética , Redes Reguladoras de Genes , Leucemia Mielogênica Crônica BCR-ABL Positiva/genética , Leucemia Mieloide Aguda/genética , MicroRNAs/genética , RNA Longo não Codificante/genética , RNA Mensageiro/genética , Estudos de Casos e Controles , Bases de Dados Factuais , Regulação Neoplásica da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Transcriptoma
10.
Phys Chem Chem Phys ; 20(34): 21934-21948, 2018 Aug 29.
Artigo em Inglês | MEDLINE | ID: mdl-30088497

RESUMO

Metal ions are crucial for folding and function of noncoding RNAs. The fact that RNAs have very specific metal ion binding motifs further implies that contribution of metal ions (like Mg2+) in RNA's folding is not limited to simple compensation of electrostatic repulsions. Rather, their binding to RNA is driven by very specific contextual requirements. Elucidation of such factors is necessary for a comprehensive understanding of the sequence-structure-function paradigm in RNA. In this work, we have studied the consequences of Mg2+ binding on the geometry and stability of different noncanonical base pairs that shape up the complex structural landscape of RNA. Our results show that majority of the Mg2+ bound nucleobases are also part of a base pair. Interestingly, such base pairs belong only to a specific set of base pairing geometries. Out of them, we are able to identify 14 unique cases for which the native base pairing geometries are unstable under gas phase geometry optimization carried out in the absence of Mg2+ binding. Our density functional theory based calculations, performed using dispersion corrected M05-2X functional, suggest that, depending on its mode of binding, Mg2+ can stabilize and even fine tune a number of such base pairing geometries. These findings not only provide insights into how metal ions modulate the structure and dynamics of RNA molecules, they also provide a basis for improving the RNA structure prediction algorithms.


Assuntos
Magnésio/metabolismo , RNA/metabolismo , Pareamento de Bases , Ligação de Hidrogênio , Íons/química , Magnésio/química , Conformação de Ácido Nucleico , Teoria Quântica , RNA/química , Estabilidade de RNA
11.
Biophys J ; 113(2): 277-289, 2017 Jul 25.
Artigo em Inglês | MEDLINE | ID: mdl-28506525

RESUMO

Reverse Watson-Crick G:C basepairs (G:C W:W Trans) occur frequently in different functional RNAs. This is one of the few basepairs whose gas-phase-optimized isolated geometry is inconsistent with the corresponding experimental geometry. Several earlier studies indicate that through post-transcriptional modification, direct protonation, or coordination with Mg2+, accumulation of positive charge near N7 of guanine can stabilize the experimental geometry. Interestingly, recent studies reveal significant variation in the position of putatively bound Mg2+. This, in conjunction with recently raised doubts regarding some of the Mg2+ assignments near the imino nitrogen of guanine, is suggestive of the existence of multiple Mg2+ binding modes for this basepair. Our detailed investigation of Mg2+-bound G:C W:W Trans pairs occurring in high-resolution RNA crystal structures shows that they are found in 14 different contexts, eight of which display Mg2+ binding at the Hoogsteen edge of guanine. Further examination of occurrences in these eight contexts led to the characterization of three different Mg2+ binding modes: 1) direct binding via N7 coordination, 2) direct binding via O6 coordination, and 3) binding via hydrogen-bonding interaction with the first-shell water molecules. In the crystal structures, the latter two modes are associated with a buckled and propeller-twisted geometry of the basepair. Interestingly, respective optimized geometries of these different Mg2+ binding modes (optimized using six different DFT functionals) are consistent with their corresponding experimental geometries. Subsequent interaction energy calculations at the MP2 level, and decomposition of its components, suggest that for G:C W:W Trans , Mg2+ binding can fine tune the basepair geometries without compromising with their stability. Our results, therefore, underline the importance of the mode of binding of Mg2+ ions in shaping RNA structure, folding and function.


Assuntos
Pareamento de Bases/fisiologia , Citosina/metabolismo , Guanina/metabolismo , Magnésio/metabolismo , Dobramento de RNA/fisiologia , RNA/metabolismo , Bactérias , Cátions Bivalentes/química , Cátions Bivalentes/metabolismo , Citosina/química , Guanina/química , Ligação de Hidrogênio , Magnésio/química , Modelos Genéticos , Modelos Moleculares , RNA/química , Estabilidade de RNA/fisiologia , Água/química
12.
J Comput Aided Mol Des ; 31(2): 219-235, 2017 02.
Artigo em Inglês | MEDLINE | ID: mdl-28102461

RESUMO

Comprehensive analyses of structural features of non-canonical base pairs within a nucleic acid double helix are limited by the availability of a small number of three dimensional structures. Therefore, a procedure for model building of double helices containing any given nucleotide sequence and base pairing information, either canonical or non-canonical, is seriously needed. Here we describe a program RNAHelix, which is an updated version of our widely used software, NUCGEN. The program can regenerate duplexes using the dinucleotide step and base pair orientation parameters for a given double helical DNA or RNA sequence with defined Watson-Crick or non-Watson-Crick base pairs. The original structure and the corresponding regenerated structure of double helices were found to be very close, as indicated by the small RMSD values between positions of the corresponding atoms. Structures of several usual and unusual double helices have been regenerated and compared with their original structures in terms of base pair RMSD, torsion angles and electrostatic potentials and very high agreements have been noted. RNAHelix can also be used to generate a structure with a sequence completely different from an experimentally determined one or to introduce single to multiple mutation, but with the same set of parameters and hence can also be an important tool in homology modeling and study of mutation induced structural changes.


Assuntos
Modelos Moleculares , Ácidos Nucleicos/química , Pareamento de Bases , Conformação de Ácido Nucleico , Eletricidade Estática
13.
Phys Chem Chem Phys ; 19(42): 28718-28730, 2017 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-29043327

RESUMO

The G:A basepair, stabilized by hydrogen bonding through the sugar edge of guanine and Hoogsteen edge of adenine in trans orientation (G:A S:HT), appears very frequently in the solved RNA structures and is very stable. We have carried out stacking energy analyses of two sequences, namely C:G W:WC::G:A S:HT and G:C W:WC::G:A S:HT (':' represents basepairing and '::' represents stacking interactions), formed by this non-Watson-Crick basepair, by DFT-D. We have scanned nearly the complete phase space by modeling structures of these sequences using different values of the basepair orientation parameters to determine the most stable conformations. It is found that the most stable conformations are rather far from the most frequently found orientations. We have considered the effect of sugar-phosphate backbone connectivity as an energy penalty arising from deformation of pseudo bond lengths between C1' atoms of successive bases along the strands. Augmentation of stacking energy from DFT-D by this coarse grain energy gives predicted structures extremely similar to the experimentally determined ones. It has been observed that the best stacking with small twist values is associated with positive roll and negative slide values, which are similar to their values in A-RNA structures for most sequences. Among the two base pair steps, C:G W:WC::G:A S:HT appears to be more stable in terms of stacking energy as compared to G:C W:WC::G:A S:HT possibly due to larger stacking overlap in the former one.

14.
Phys Chem Chem Phys ; 19(22): 14781-14792, 2017 Jun 07.
Artigo em Inglês | MEDLINE | ID: mdl-28548177

RESUMO

Most genes are regulated by multiple transcription factors, often assembling into multi-protein complexes in the gene regulatory region. Understanding of the molecular origin of specificity of gene regulatory complex formation in the context of the whole genome is currently inadequate. A phage transcription factor λ-CI forms repressive multi-protein complexes by binding to multiple binding sites in the genome to regulate the lifecycle of the phage. The protein-protein interaction between two DNA-bound λ-CI molecules is stronger when they are bound to the correct pair of binding sites, suggesting allosteric transmission of recognition of correct DNA sequences to the protein-protein interaction interface. Exploration of conformation and dynamics by time-resolved fluorescence anisotropy decay and molecular dynamics suggests a change in protein dynamics to be a crucial factor in mediating allostery. A lattice-based model suggests that DNA-sequence induced allosteric effects could be crucial underlying factors in differentially stabilizing the correct site-specific gene regulatory complexes. We conclude that transcription factors have evolved multiple mechanisms to augment the specificity of DNA-protein interactions in order to achieve an extraordinarily high degree of spatial and temporal specificities of gene regulatory complexes, and DNA-sequence induced allostery plays an important role in the formation of sequence-specific gene regulatory complexes.


Assuntos
Proteínas de Ligação a DNA/química , DNA/química , Conformação Proteica , Fatores de Transcrição , Sequência de Bases , DNA/metabolismo , Polarização de Fluorescência , Simulação de Dinâmica Molecular , Ligação Proteica
15.
Biopolymers ; 105(2): 83-99, 2016 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-26448055

RESUMO

The telomeric G-quadruplexes for their unique structural features are considered as potential anticancer drug targets. These, however, exhibit structural polymorphism as different topology types for the intra-molecular G-quadruplexes from human telomeric G-rich sequences have been reported based on NMR spectroscopy and X-ray crystallography. These techniques provide detailed atomic-level information about the molecule but relative conformational stability of the different topologies remains unsolved. Therefore, to understand the conformational preference, we have carried out quantum chemical calculations on G-quartets; used all-atom molecular dynamics (MD) simulations and steered molecular dynamics (SMD) simulations to characterize the four human telomeric G-quadruplex topologies based on its G-tetrad core-types, viz., parallel, anti-parallel, mixed-(3 + 1)-form1 and mixed-(3 + 1)-form2. We have also studied a non-telomeric sequence along with these telomeric forms giving a comparison between the two G-rich forms. The structural properties such as base pairing, stacking geometry and backbone conformations have been analyzed. The quantum calculations indicate that presence of a sodium ion inside the G-tetrad plane or two potassium ions on both sides of the plane give it an overall planarity which is much needed for good stacking to form a helix. MD simulations indicate that capping of the G-tetrad core by the TTA loops keep the terminal guanine bases away from water. The SMD simulations along with equilibrium MD studies indicate that the parallel and non-telomeric forms are comparatively less stable. We could come to the conclusion that the anti-parallel form and also the mixed-(3 + 1)-form1 topology are most likely to represent the major conformation., 2016. © 2015 Wiley Periodicals, Inc. Biopolymers 105: 83-99, 2016.

16.
Biopolymers ; 105(4): 212-26, 2016 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-26600167

RESUMO

Stacking interaction between the aromatic heterocyclic bases plays an important role in the double helical structures of nucleic acids. Considering the base as rigid body, there are total of 18 degrees of freedom of a dinucleotide step. Some of these parameters show sequence preferences, indicating that the detailed atomic interactions are important in the stacking. Large variants of non-canonical base pairs have been seen in the crystallographic structures of RNA. However, their stacking preferences are not thoroughly deciphered yet from experimental results. The current theoretical approaches use either the rigid body degrees of freedom where the atomic information are lost or computationally expensive all atom simulations. We have used a hybrid simulation approach incorporating Monte-Carlo Metropolis sampling in the hyperspace of 18 stacking parameters where the interaction energies using AMBER-parm99bsc0 and CHARMM-36 force-fields were calculated from atomic positions. We have also performed stacking energy calculations for structures from Monte-Carlo ensemble by Dispersion corrected density functional theory. The available experimental data with Watson-Crick base pairs are compared to establish the validity of the method. Stacking interaction involving A:U and G:C base pairs with non-canonical G:U base pairs also were calculated and showed that these structures were also sequence dependent. This approach could be useful to generate multiscale modeling of nucleic acids in terms of coarse-grained parameters where the atomic interactions are preserved. This method would also be useful to predict structure and dynamics of different base pair steps containing non Watson-Crick base pairs, as found often in the non-coding RNA structures. © 2015 Wiley Periodicals, Inc. Biopolymers 105: 212-226, 2016.


Assuntos
Pareamento de Bases , Ligação de Hidrogênio , Modelos Moleculares , Conformação de Ácido Nucleico , Teoria Quântica
17.
Phys Chem Chem Phys ; 18(31): 21618-28, 2016 Aug 21.
Artigo em Inglês | MEDLINE | ID: mdl-27426617

RESUMO

Recognition of multiple functional DNA sequences by a DNA-binding protein occurs widely in nature. The physico-chemical basis of this phenomenon is not well-understood. The E. coli gal repressor, a gene regulatory protein, binds two homologous but non-identical sixteen basepair sequences in the gal operon and interacts by protein-protein interaction to regulate gene expression. The two sites have nearly equal affinities for the Gal repressor. Spectroscopic studies of the Gal repressor bound to these two different DNA sequences detected significant conformational differences between them. Comprehensive single base-substitution and binding measurements were carried out on the two sequences to understand the nature of the two protein-DNA interfaces. Magnitudes of basepair-protein interaction energy show significant variation between homologous positions of the two DNA sequences. Magnitudes of variation are such that when summed over the whole sequence they largely cancel each other out, thus producing nearly equal net affinity. Modeling suggests significant alterations in the protein-DNA interface in the two complexes, which are consistent with conformational adaptation of the protein to different DNA sequences. The functional role of the two sequences was studied by substitution of one site by the other and vice versa. In both cases, substitution reduces repression in vivo. This suggests that naturally occurring DNA sequence variations play functional roles beyond merely acting as high-affinity anchoring points. We propose that two different pre-existing conformations in the conformational ensemble of the free protein are selected by two different DNA sequences for efficient sequence read-out and the conformational difference of the bound proteins leads to different functional roles.


Assuntos
DNA Bacteriano , Proteínas de Ligação a DNA , Sítios de Ligação , Escherichia coli , Proteínas de Escherichia coli , Conformação de Ácido Nucleico
18.
Biopolymers ; 103(3): 134-47, 2015 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-25257334

RESUMO

Understanding dinucleotide sequence directed structures of nuleic acids and their variability from experimental observation remained ineffective due to unavailability of statistically meaningful data. We have attempted to understand this from energy scan along twist, roll, and slide degrees of freedom which are mostly dependent on dinucleotide sequence using ab initio density functional theory. We have carried out stacking energy analysis in these dinucleotide parameter phase space for all ten unique dinucleotide steps in DNA and RNA using DFT-D by ωB97X-D/6-31G(2d,2p), which appears to satisfactorily explain conformational preferences for AU/AU step in our recent study. We show that values of roll, slide, and twist of most of the dinucleotide sequences in crystal structures fall in the low energy region. The minimum energy regions with large twist values are associated with the roll and slide values of B-DNA, whereas, smaller twist values correspond to higher stability to RNA and A-DNA like conformations. Incorporation of solvent effect by CPCM method could explain the preference shown by some sequences to occur in B-DNA or A-DNA conformations. Conformational preference of BII sub-state in B-DNA is preferentially displayed mainly by pyrimidine-purine steps and partly by purine-purine steps. The purine-pyrimidine steps show largest effect of 5-methyl group of thymine in stacking energy and the introduction of solvent reduces this effect significantly. These predicted structures and variabilities can explain the effect of sequence on DNA and RNA functionality.


Assuntos
DNA/química , Nucleotídeos/química , RNA/química , Pareamento de Bases , Conformação de Ácido Nucleico , Termodinâmica
19.
Biopolymers ; 103(6): 328-38, 2015 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-25652776

RESUMO

Emergence of thousands of crystal structures of noncoding RNA molecules indicates its structural and functional diversity. RNA function is based upon a large variety of structural elements which are specifically assembled in the folded molecules. Along with the canonical Watson-Crick base pairs, different orientations of the bases to form hydrogen-bonded non-canonical base pairs have also been observed in the available RNA structures. Frequencies of occurrences of different non-canonical base pairs in RNA indicate their important role to maintain overall structure and functions of RNA. There are several reports on geometry and energetic stabilities of these non-canonical base pairs. However, their stacking geometry and stacking stability with the neighboring base pairs are not well studied. Among the different non-canonical base pairs, the G:U wobble base pair (G:U W:WC) is most frequently observed in the RNA double helices. Using quantum chemical method and available experimental data set we have studied the stacking geometry of G:U W:WC base pair containing dinucleotide sequences in roll-slide parameters hyperspace for different values of twist. This study indicates that the G:U W:WC base pair can stack well with the canonical base pairs giving rise to large interaction energy. The overall preferred stacking geometry in terms of roll, twist and slide for the eleven possible dinucleotide sequences is seen to be quite dependent on their sequences.


Assuntos
Pareamento de Bases/fisiologia , RNA/química , Ligação de Hidrogênio , Conformação de Ácido Nucleico
20.
J Comput Aided Mol Des ; 29(3): 283-95, 2015 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-25575717

RESUMO

Gene expression generally initiates from recognition of TATA-box binding protein (TBP) to the minor groove of DNA of TATA box sequence where the DNA structure is significantly different from B-DNA. We have carried out molecular dynamics simulation studies of TBP-DNA system to understand how the DNA structure alters for efficient binding. We observed rigid nature of the protein while the DNA of TATA box sequence has an inherent flexibility in terms of bending and minor groove widening. The bending analysis of the free DNA and the TBP bound DNA systems indicate presence of some similar structures. Principal coordinate ordination analysis also indicates some structural features of the protein bound and free DNA are similar. Thus we suggest that the DNA of TATA box sequence regularly oscillates between several alternate structures and the one suitable for TBP binding is induced further by the protein for proper complex formation.


Assuntos
TATA Box , Proteína de Ligação a TATA-Box/química , Proteína de Ligação a TATA-Box/metabolismo , Sítios de Ligação , Simulação de Dinâmica Molecular , Conformação de Ácido Nucleico , Análise de Componente Principal , Estabilidade Proteica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA