Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 51
Filter
Add more filters










Publication year range
1.
Brief Bioinform ; 25(4)2024 May 23.
Article in English | MEDLINE | ID: mdl-38856168

ABSTRACT

Nucleic acid-binding proteins (NABPs), including DNA-binding proteins (DBPs) and RNA-binding proteins (RBPs), play important roles in essential biological processes. To facilitate functional annotation and accurate prediction of different types of NABPs, many machine learning-based computational approaches have been developed. However, the datasets used for training and testing as well as the prediction scopes in these studies have limited their applications. In this paper, we developed new strategies to overcome these limitations by generating more accurate and robust datasets and developing deep learning-based methods including both hierarchical and multi-class approaches to predict the types of NABPs for any given protein. The deep learning models employ two layers of convolutional neural network and one layer of long short-term memory. Our approaches outperform existing DBP and RBP predictors with a balanced prediction between DBPs and RBPs, and are more practically useful in identifying novel NABPs. The multi-class approach greatly improves the prediction accuracy of DBPs and RBPs, especially for the DBPs with ~12% improvement. Moreover, we explored the prediction accuracy of single-stranded DNA binding proteins and their effect on the overall prediction accuracy of NABP predictions.


Subject(s)
Computational Biology , DNA-Binding Proteins , Deep Learning , RNA-Binding Proteins , RNA-Binding Proteins/metabolism , DNA-Binding Proteins/metabolism , Computational Biology/methods , Neural Networks, Computer , Humans
2.
Toxicol Res (Camb) ; 13(3): tfae072, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38737339

ABSTRACT

Lead (Pb) is a nonessential heavy metal, which can cause many health problems. Isochlorogenic acid A (ICAA), a phenolic acid present in tea, fruits, vegetables, coffee, plant-based food products, and various medicinal plants, exerts multiple effects, including anti-oxidant, antiviral, anti-inflammatory and antifibrotic functions. Thus, the purpose of our study was to determine if ICAA could prevent Pb-induced hepatotoxicity in ICR mice. An evaluation was performed on oxidative stress, inflammation and fibrosis, and related signaling. The results indicate that ICAA attenuates Pb-induced abnormal liver function. ICAA reduced liver fibrosis, inflammation and oxidative stress caused by Pb. ICAA abated Pb-induced fibrosis and decreased inflammatory cytokines interleukin-1ß (IL-1ß) and tumor necrosis factor-alpha (TNF-α). ICAA abrogated reductions in activities of superoxide dismutase (SOD), catalase (CAT), and glutathione peroxidase (GPx). Masson staining revealed that ICAA reduced collagen fiber deposition in Pb-induced fibrotic livers. Western blot and immunohistochemistry analyses showed ICAA increased phosphorylated AMP-activated protein kinase (p-AMPK) expression. ICAA also reduced the expression of collagen I, α-smooth muscle actin (α-SMA), phosphorylated extracellular signal-regulated kinase (p-ERK), phosphorylated c-jun N-terminal kinase (p-JNK), p-p38, phosphorylated signal transducer and phosphorylated activator of transcription 3 (p-STAT3), transforming growth factor ß1 (TGF-ß1), and p-Smad2/3 in livers of mice. Overall, ICAA ameliorates Pb-induced hepatitis and fibrosis by inhibiting the AMPK/MAPKs/NF-κB and STAT3/TGF-ß1/Smad2/3 pathways.

3.
Proteins ; 91(8): 1077-1088, 2023 08.
Article in English | MEDLINE | ID: mdl-36978156

ABSTRACT

Computational modeling of protein-DNA complex structures has important implications in biomedical applications such as structure-based, computer aided drug design. A key step in developing methods for accurate modeling of protein-DNA complexes is similarity assessment between models and their reference complex structures. Existing methods primarily rely on distance-based metrics and generally do not consider important functional features of the complexes, such as interface hydrogen bonds that are critical to specific protein-DNA interactions. Here, we present a new scoring function, ComparePD, which takes interface hydrogen bond energy and strength into account besides the distance-based metrics for accurate similarity measure of protein-DNA complexes. ComparePD was tested on two datasets of computational models of protein-DNA complexes generated using docking (classified as easy, intermediate, and difficult cases) and homology modeling methods. The results were compared with PDDockQ, a modified version of DockQ tailored for protein-DNA complexes, as well as the metrics employed by the community-wide experiment CAPRI (Critical Assessment of PRedicted Interactions). We demonstrated that ComparePD provides an improved similarity measure over PDDockQ and the CAPRI classification method by considering both conformational similarity and functional importance of the complex interface. ComparePD identified more meaningful models as compared to PDDockQ for all the cases having different top models between ComparePD and PDDockQ except for one intermediate docking case.


Subject(s)
Protein Interaction Mapping , Proteins , Protein Interaction Mapping/methods , Proteins/chemistry , Protein Binding , Protein Conformation , Hydrogen Bonding , Benchmarking , Algorithms , Computational Biology/methods , Software , Molecular Docking Simulation
4.
Biomolecules ; 12(9)2022 08 26.
Article in English | MEDLINE | ID: mdl-36139026

ABSTRACT

Single-stranded DNA (ssDNA) binding proteins (SSBs) are critical in maintaining genome stability by protecting the transient existence of ssDNA from damage during essential biological processes, such as DNA replication and gene transcription. The single-stranded region of telomeres also requires protection by ssDNA binding proteins from being attacked in case it is wrongly recognized as an anomaly. In addition to their critical roles in genome stability and integrity, it has been demonstrated that ssDNA and SSB-ssDNA interactions play critical roles in transcriptional regulation in all three domains of life and viruses. In this review, we present our current knowledge of the structure and function of SSBs and the structural features for SSB binding specificity. We then discuss the machine learning-based approaches that have been developed for the prediction of SSBs from double-stranded DNA (dsDNA) binding proteins (DSBs).


Subject(s)
DNA, Single-Stranded , DNA-Binding Proteins , DNA/chemistry , DNA-Binding Proteins/metabolism , Genomic Instability , Humans , Machine Learning , Protein Binding
5.
Proteins ; 90(6): 1303-1314, 2022 06.
Article in English | MEDLINE | ID: mdl-35122321

ABSTRACT

Hydrogen bonds play important roles in protein folding and protein-ligand interactions, particularly in specific protein-DNA recognition. However, the distributions of hydrogen bonds, especially hydrogen bond energy (HBE) in different types of protein-ligand complexes, is unknown. Here we performed a comparative analysis of hydrogen bonds among three non-redundant datasets of protein-protein, protein-peptide, and protein-DNA complexes. Besides comparing the number of hydrogen bonds in terms of types and locations, we investigated the distributions of HBE. Our results indicate that while there is no significant difference of hydrogen bonds within protein chains among the three types of complexes, interfacial hydrogen bonds are significantly more prevalent in protein-DNA complexes. More importantly, the interfacial hydrogen bonds in protein-DNA complexes displayed a unique energy distribution of strong and weak hydrogen bonds whereas majority of the interfacial hydrogen bonds in protein-protein and protein-peptide complexes are of predominantly high strength with low energy. Moreover, there is a significant difference in the energy distributions of minor groove hydrogen bonds between protein-DNA complexes with different binding specificity. Highly specific protein-DNA complexes contain more strong hydrogen bonds in the minor groove than multi-specific complexes, suggesting important role of minor groove in specific protein-DNA recognition. These results can help better understand protein-DNA interactions and have important implications in improving quality assessments of protein-DNA complex models.


Subject(s)
DNA , Proteins , DNA/chemistry , Hydrogen Bonding , Ligands , Proteins/chemistry
6.
Sci Rep ; 11(1): 21178, 2021 10 27.
Article in English | MEDLINE | ID: mdl-34707120

ABSTRACT

Insertions and deletions (Indels) represent one of the major variation types in the human genome and have been implicated in diseases including cancer. To study the features of somatic indels in different cancer genomes, we investigated the indels from two large samples of cancer types: invasive breast carcinoma (BRCA) and lung adenocarcinoma (LUAD). Besides mapping somatic indels in both coding and untranslated regions (UTRs) from the cancer whole exome sequences, we investigated the overlap between these indels and transcription factor binding sites (TFBSs), the key elements for regulation of gene expression that have been found in both coding and non-coding sequences. Compared to the germline indels in healthy genomes, somatic indels contain more coding indels with higher than expected frame-shift (FS) indels in cancer genomes. LUAD has a higher ratio of deletions and higher coding and FS indel rates than BRCA. More importantly, these somatic indels in cancer genomes tend to locate in sequences with important functions, which can affect the core secondary structures of proteins and have a bigger overlap with predicted TFBSs in coding regions than the germline indels. The somatic CDS indels are also enriched in highly conserved nucleotides when compared with germline CDS indels.


Subject(s)
Adenocarcinoma of Lung/genetics , Breast Neoplasms/genetics , INDEL Mutation , Lung Neoplasms/genetics , Adenocarcinoma of Lung/metabolism , Breast Neoplasms/metabolism , Female , Gene Expression Regulation, Neoplastic , Humans , Lung Neoplasms/metabolism , Male , Open Reading Frames/genetics , Transcription Factors/metabolism , Untranslated Regions/genetics
7.
NAR Genom Bioinform ; 3(1): lqab006, 2021 Mar.
Article in English | MEDLINE | ID: mdl-33655206

ABSTRACT

Single-stranded DNA-binding proteins (SSBs) play crucial roles in DNA replication, recombination and repair, and serve as key players in the maintenance of genomic stability. While a number of SSBs bind single-stranded DNA (ssDNA) non-specifically, the others recognize and bind specific ssDNA sequences. The mechanisms underlying this binding discrepancy, however, are largely unknown. Here, we present a comparative study of protein-ssDNA interactions by annotating specific and non-specific SSBs and comparing structural features such as DNA-binding propensities and secondary structure types of residues in SSB-ssDNA interactions, protein-ssDNA hydrogen bonding and π-π interactions between specific and non-specific SSBs. Our results suggest that protein side chain-DNA base hydrogen bonds are the major contributors to protein-ssDNA binding specificity, while π-π interactions may mainly contribute to binding affinity. We also found the enrichment of aspartate in the specific SSBs, a key feature in specific protein-double-stranded DNA (dsDNA) interactions as reported in our previous study. In addition, no significant differences between specific and non-specific groups with respect of conformational changes upon ssDNA binding were found, suggesting that the flexibility of SSBs plays a lesser role than that of dsDNA-binding proteins in conferring binding specificity.

8.
BMC Med Genomics ; 13(1): 170, 2020 11 10.
Article in English | MEDLINE | ID: mdl-33167946

ABSTRACT

BACKGROUND: Insertion and deletion (indel) is one of the major variation types in human genomes. Accurate annotation of indels is of paramount importance in genetic variation analysis and investigation of their roles in human diseases. Previous studies revealed a high number of false positives from existing indel calling methods, which limits downstream analyses of the effects of indels on both healthy and disease genomes. In this study, we evaluated seven commonly used general indel calling programs for germline indels and four somatic indel calling programs through comparative analysis to investigate their common features and differences and to explore ways to improve indel annotation accuracy. METHODS: In our comparative analysis, we adopted a more stringent evaluation approach by considering both the indel positions and the indel types (insertion or deletion sequences) between the samples and the reference set. In addition, we applied an efficient way to use a benchmark for improved performance comparisons for the general indel calling programs RESULTS: We found that germline indels in healthy genomes derived by combining several indel calling tools could help remove a large number of false positive indels from individual programs without compromising the number of true positives. The performance comparisons of somatic indel calling programs are more complicated due to the lack of a reliable and comprehensive benchmark. Nevertheless our results revealed large variations among the programs and among cancer types. CONCLUSIONS: While more accurate indel calling programs are needed, we found that the performance for germline indel annotations can be improved by combining the results from several programs. In addition, well-designed benchmarks for both germline and somatic indels are key in program development and evaluations.


Subject(s)
High-Throughput Nucleotide Sequencing , INDEL Mutation , Neoplasms/genetics , Benchmarking , Datasets as Topic , False Positive Reactions , Genome, Human , Germ-Line Mutation , Humans , Molecular Sequence Annotation , Reproducibility of Results , Software
9.
Nucleic Acids Res ; 47(21): 11103-11113, 2019 12 02.
Article in English | MEDLINE | ID: mdl-31665426

ABSTRACT

Knowledge of protein-DNA binding specificity has important implications in understanding DNA metabolism, transcriptional regulation and developing therapeutic drugs. Previous studies demonstrated hydrogen bonds between amino acid side chains and DNA bases play major roles in specific protein-DNA interactions. In this paper, we investigated the roles of individual DNA strands and protein secondary structure types in specific protein-DNA recognition based on side chain-base hydrogen bonds. By comparing the contribution of each DNA strand to the overall binding specificity between DNA-binding proteins with different degrees of binding specificity, we found that highly specific DNA-binding proteins show balanced hydrogen bonding with each of the two DNA strands while multi-specific DNA binding proteins are generally biased towards one strand. Protein-base pair hydrogen bonds, in which both bases of a base pair are involved in forming hydrogen bonds with amino acid side chains, are more prevalent in the highly specific protein-DNA complexes than those in the multi-specific group. Amino acids involved in side chain-base hydrogen bonds favor strand and coil secondary structure types in highly specific DNA-binding proteins while multi-specific DNA-binding proteins prefer helices.


Subject(s)
DNA-Binding Proteins/chemistry , DNA/chemistry , Models, Molecular , Amino Acids/chemistry , Base Pairing , Binding Sites , Hydrogen Bonding , Nucleic Acid Conformation , Protein Structure, Secondary
10.
BMC Bioinformatics ; 19(Suppl 20): 506, 2018 Dec 21.
Article in English | MEDLINE | ID: mdl-30577740

ABSTRACT

BACKGROUND: Atomic details of protein-DNA complexes can provide insightful information for better understanding of the function and binding specificity of DNA binding proteins. In addition to experimental methods for solving protein-DNA complex structures, protein-DNA docking can be used to predict native or near-native complex models. A docking program typically generates a large number of complex conformations and predicts the complex model(s) based on interaction energies between protein and DNA. However, the prediction accuracy is hampered by current approaches to model assessment, especially when docking simulations fail to produce any near-native models. RESULTS: We present here a Support Vector Machine (SVM)-based approach for quality assessment of the predicted transcription factor (TF)-DNA complex models. Besides a knowledge-based protein-DNA interaction potential DDNA3, we applied several structural features that have been shown to play important roles in binding specificity between transcription factors and DNA molecules to quality assessment of complex models. To address the issue of unbalanced positive and negative cases in the training dataset, we applied hard-negative mining, an iterative training process that selects an initial training dataset by combining all of the positive cases and a random sample from the negative cases. Results show that the SVM model greatly improves prediction accuracy (84.2%) over two knowledge-based protein-DNA interaction potentials, orientation potential (60.8%) and DDNA3 (68.4%). The improvement is achieved through reducing the number of false positive predictions, especially for the hard docking cases, in which a docking algorithm fails to produce any near-native complex models. CONCLUSIONS: A learning-based SVM scoring model with structural features for specific protein-DNA binding and an atomic-level protein-DNA interaction potential DDNA3 significantly improves prediction accuracy of complex models by successfully identifying cases without near-native structural models.


Subject(s)
DNA/metabolism , Models, Molecular , Support Vector Machine , Transcription Factors/metabolism , Algorithms , DNA/chemistry , Protein Binding
11.
Sci Rep ; 7(1): 9313, 2017 08 24.
Article in English | MEDLINE | ID: mdl-28839204

ABSTRACT

Insertions and deletions (indels) represent the second most common type of genetic variations in human genomes. Indels can be deleterious and contribute to disease susceptibility as recent genome sequencing projects revealed a large number of indels in various cancer types. In this study, we investigated the possible effects of small coding indels on protein structure and function, and the baseline characteristics of indels in 2504 individuals of 26 populations from the 1000 Genomes Project. We found that each population has a distinct pattern in genes with small indels. Frameshift (FS) indels are enriched in olfactory receptor activity while non-frameshift (NFS) indels are enriched in transcription-related proteins. Structural analysis of NFS indels revealed that they predominantly adopt coil or disordered conformations, especially in proteins with transcription-related NFS indels. These results suggest that the annotated coding indels from the 1000 Genomes Project, while contributing to genetic variations and phenotypic diversity, generally do not affect the core protein structures and have no deleterious effect on essential biological processes. In addition, we found that a number of reference genome annotations might need to be updated due to the high prevalence of annotated homozygous indels in the general population.


Subject(s)
INDEL Mutation , Proteins/genetics , Proteins/metabolism , Biological Variation, Population , Computational Biology , Genome, Human , Humans , Protein Conformation , Proteins/chemistry
12.
BMC Bioinformatics ; 18(1): 342, 2017 Jul 17.
Article in English | MEDLINE | ID: mdl-28715997

ABSTRACT

BACKGROUND: Gene expression is regulated by transcription factors binding to specific target DNA sites. Understanding how and where transcription factors bind at genome scale represents an essential step toward our understanding of gene regulation networks. Previously we developed a structure-based method for prediction of transcription factor binding sites using an integrative energy function that combines a knowledge-based multibody potential and two atomic energy terms. While the method performs well, it is not computationally efficient due to the exponential increase in the number of binding sequences to be evaluated for longer binding sites. In this paper, we present an efficient pentamer algorithm by splitting DNA binding sequences into overlapping fragments along with a simplified integrative energy function for transcription factor binding site prediction. RESULTS: A DNA binding sequence is split into overlapping pentamers (5 base pairs) for calculating transcription factor-pentamer interaction energy. To combine the results from overlapping pentamer scores, we developed two methods, Kmer-Sum and PWM (Position Weight Matrix) stacking, for full-length binding motif prediction. Our results show that both Kmer-Sum and PWM stacking in the new pentamer approach along with a simplified integrative energy function improved transcription factor binding site prediction accuracy and dramatically reduced computation time, especially for longer binding sites. CONCLUSION: Our new fragment-based pentamer algorithm and simplified energy function improve both efficiency and accuracy. To our knowledge, this is the first fragment-based method for structure-based transcription factor binding sites prediction.


Subject(s)
Algorithms , Sequence Analysis, DNA/methods , Transcription Factors/metabolism , Binding Sites , DNA/chemistry , DNA/metabolism , Nucleotide Motifs , Position-Specific Scoring Matrices , Protein Binding
13.
J Org Chem ; 82(4): 1888-1894, 2017 02 17.
Article in English | MEDLINE | ID: mdl-28107007

ABSTRACT

Natural pigment chlorophyll was used as a green photosensitizer for the first time in a visible-light photoredox catalysis for the efficient synthesis of tetrahydroquinolines from N,N-dimethylanilines and maleimides in an air atmosphere. The reaction involves direct cyclization via an sp3 C-H bond functionalization process to afford products in moderate to high yields (61-98%) from a wide range of substrates with a low loading of chlorophyll under mild conditions. This work demonstrates the potential benefits of chlorophyll as photosensitizer in visible light catalysis.

14.
Cladistics ; 33(1): 1-20, 2017 Feb.
Article in English | MEDLINE | ID: mdl-34724757

ABSTRACT

Zika virus was previously considered to cause only a benign infection in humans. Studies of recent outbreaks of Zika virus in the Pacific, South America, Mexico and the Caribbean have associated the virus with severe neuropathology. Viral evolution may be one factor contributing to an apparent change in Zika disease as it spread from Southeast Asia across the Pacific to the Americas. To address this possibility, we have employed computational tools to compare the phylogeny, geography, immunology and RNA structure of Zika virus isolates from Africa, Asia, the Pacific and the Americas. In doing so, we compare and contrast methods and results for tree search and rooting of Zika virus phylogenies. In some phylogenetic analyses we find support for the hypothesis that there is a deep common ancestor between African and Asian clades (the "Asia/Africa" hypothesis). In other phylogenetic analyses, we find that Asian lineages are descendent from African lineages (the "out of Africa" hypothesis). In addition, we identify and evaluate key mutations in viral envelope protein coding and untranslated terminal RNA regions. We find stepwise mutations that have altered both immunological motif sets and regulatory sequence elements. Both of these sets of changes distinguish viruses found in Africa from those in the emergent Asia-Pacific-Americas lineage. These findings support the working hypothesis that mutations acquired by Zika virus in the Pacific and Americas contribute to changes in pathology. These results can inform experiments required to elucidate the role of viral genetic evolution in changes in neuropathology, including microcephaly and other neurological and skeletomuscular issues in infants, and Guillain-Barré syndrome in adults.

15.
Bioinformatics ; 32(12): i306-i313, 2016 06 15.
Article in English | MEDLINE | ID: mdl-27307632

ABSTRACT

UNLABELLED: Transcription factors (TFs) regulate gene expression through binding to specific target DNA sites. Accurate annotation of transcription factor binding sites (TFBSs) at genome scale represents an essential step toward our understanding of gene regulation networks. In this article, we present a structure-based method for computational prediction of TFBSs using a novel, integrative energy (IE) function. The new energy function combines a multibody (MB) knowledge-based potential and two atomic energy terms (hydrogen bond and π interaction) that might not be accurately captured by the knowledge-based potential owing to the mean force nature and low count problem. We applied the new energy function to the TFBS prediction using a non-redundant dataset that consists of TFs from 12 different families. Our results show that the new IE function improves the prediction accuracy over the knowledge-based, statistical potentials, especially for homeodomain TFs, the second largest TF family in mammals. CONTACT: jguo4@uncc.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Transcription Factors/chemistry , Animals , Binding Sites , Computational Biology , DNA-Binding Proteins , Gene Expression Regulation , Protein Binding
16.
Proteins ; 84(8): 1147-61, 2016 08.
Article in English | MEDLINE | ID: mdl-27147539

ABSTRACT

DNA-binding proteins play critical roles in biological processes including gene expression, DNA packaging and DNA repair. They bind to DNA target sequences with different degrees of binding specificity, ranging from highly specific (HS) to nonspecific (NS). Alterations of DNA-binding specificity, due to either genetic variation or somatic mutations, can lead to various diseases. In this study, a comparative analysis of protein-DNA complex structures was carried out to investigate the structural features that contribute to binding specificity. Protein-DNA complexes were grouped into three general classes based on degrees of binding specificity: HS, multispecific (MS), and NS. Our results show a clear trend of structural features among the three classes, including amino acid binding propensities, simple and complex hydrogen bonds, major/minor groove and base contacts, and DNA shape. We found that aspartate is enriched in HS DNA binding proteins and predominately binds to a cytosine through a single hydrogen bond or two consecutive cytosines through bidentate hydrogen bonds. Aromatic residues, histidine and tyrosine, are highly enriched in the HS and MS groups and may contribute to specific binding through different mechanisms. To further investigate the role of protein flexibility in specific protein-DNA recognition, we analyzed the conformational changes between the bound and unbound states of DNA-binding proteins and structural variations. The results indicate that HS and MS DNA-binding domains have larger conformational changes upon DNA-binding and larger degree of flexibility in both bound and unbound states. Proteins 2016; 84:1147-1161. © 2016 Wiley Periodicals, Inc.


Subject(s)
Amino Acids/chemistry , DNA-Binding Proteins/chemistry , DNA/chemistry , Binding Sites , Hydrogen Bonding , Hydrophobic and Hydrophilic Interactions , Models, Molecular , Nucleic Acid Conformation , Protein Binding , Protein Interaction Domains and Motifs , Protein Structure, Secondary , Static Electricity , Thermodynamics
17.
J Biomol Struct Dyn ; 33(10): 2083-93, 2015.
Article in English | MEDLINE | ID: mdl-25495540

ABSTRACT

Transcription factors regulate gene expression through binding to specific DNA sequences. How transcription factors achieve high binding specificity is still not well understood. In this paper, we investigated the role of protein flexibility in protein-DNA-binding specificity by comparative molecular dynamics (MD) simulations. Protein flexibility has been considered as a key factor in molecular recognition, which is intrinsically a dynamic process involving fine structural fitting between binding components. In this study, we performed comparative MD simulations on wild-type and F10V mutant P22 Arc repressor in both free and complex conformations. The F10V mutant has lower DNA-binding specificity though both the bound and unbound main-chain structures between the wild-type and F10V mutant Arc are highly similar. We found that the DNA-binding motif of wild-type Arc is structurally more flexible than the F10V mutant in the unbound state, especially for the six DNA base-contacting residues in each dimer. We demonstrated that the flexible side chains of wild-type Arc lead to a higher DNA-binding specificity through forming more hydrogen bonds with DNA bases upon binding. Our simulations also showed a possible conformational selection mechanism for Arc-DNA binding. These results indicate the important roles of protein flexibility and dynamic properties in protein-DNA-binding specificity.


Subject(s)
Bacteriophage P22/chemistry , DNA, Viral/chemistry , Molecular Dynamics Simulation , Phenylalanine/chemistry , Repressor Proteins/chemistry , Valine/chemistry , Viral Regulatory and Accessory Proteins/chemistry , Amino Acid Motifs , Amino Acid Substitution , Base Sequence , Binding Sites , Humans , Hydrogen Bonding , Molecular Sequence Data , Mutation , Nucleic Acid Conformation , Protein Binding , Protein Multimerization , Protein Structure, Secondary , Protein Structure, Tertiary , Repressor Proteins/genetics , Thermodynamics , Viral Regulatory and Accessory Proteins/genetics
18.
Eur J Med Chem ; 80: 71-82, 2014 Jun 10.
Article in English | MEDLINE | ID: mdl-24763364

ABSTRACT

Semisynthetic analogues of the natural product 1-O-acetylbritannilactone (ABL), a sesquiterpene isolated from the medicinal plant Inula britannica, have been prepared and exhibited significant in vitro cytotoxic activities against four cell lines including three human cancer cell lines (HCT116, HEp-2 and HeLa) and one normal hamster cell line (CHO). Structure-activity relationships indicate that esterification of 6-OH (enhanced lipophilicity) and α-methylene-γ-lactone functionalities play important roles in conferring cytotoxicity. Among the tested compounds, 14 bearing a lauroyl group (12C) at the 6-OH position displayed most potent in vitro cytotoxic activity, with IC50 values between 2.91 and 6.78 µM, comparable to the positive control etoposide (VP-16, IC50 values between 2.13 and 4.79 µM). Moreover, the compound 14 triggered remarkable apoptosis at a low concentration, and induced cell cycle arrest in G2/M phase in HCT116 cells. The biological assays conducted with normal cells (CHO) revealed that all the synthetic compounds are no selective against cancer cell lines tested.


Subject(s)
Antineoplastic Agents/chemical synthesis , Antineoplastic Agents/pharmacology , Inula/chemistry , Lactones/chemical synthesis , Lactones/pharmacology , Animals , Antineoplastic Agents/chemistry , Apoptosis/drug effects , CHO Cells , Cell Cycle/drug effects , Cell Line, Tumor , Chemistry Techniques, Synthetic , Cricetinae , Cricetulus , Humans , Inhibitory Concentration 50 , Lactones/chemistry , Structure-Activity Relationship
19.
Nucleic Acids Res ; 42(7): 4375-90, 2014 Apr.
Article in English | MEDLINE | ID: mdl-24500196

ABSTRACT

The newly developed transcription activator-like effector protein (TALE) and clustered regularly interspaced short palindromic repeats/Cas9 transcription factors (TF) offered a powerful and precise approach for modulating gene expression. In this article, we systematically investigated the potential of these new tools in activating the stringently silenced pluripotency gene Oct4 (Pou5f1) in mouse and human somatic cells. First, with a number of TALEs and sgRNAs targeting various regions in the mouse and human Oct4 promoters, we found that the most efficient TALE-VP64s bound around -120 to -80 bp, while highly effective sgRNAs targeted from -147 to -89-bp upstream of the transcription start sites to induce high activity of luciferase reporters. In addition, we observed significant transcriptional synergy when multiple TFs were applied simultaneously. Although individual TFs exhibited marginal activity to up-regulate endogenous gene expression, optimized combinations of TALE-VP64s could enhance endogenous Oct4 transcription up to 30-fold in mouse NIH3T3 cells and 20-fold in human HEK293T cells. More importantly, the enhancement of OCT4 transcription ultimately generated OCT4 proteins. Furthermore, examination of different epigenetic modifiers showed that histone acetyltransferase p300 could enhance both TALE-VP64 and sgRNA/dCas9-VP64 induced transcription of endogenous OCT4. Taken together, our study suggested that engineered TALE-TF and dCas9-TF are useful tools for modulating gene expression in mammalian cells.


Subject(s)
Octamer Transcription Factor-3/genetics , Transcription Factors/metabolism , Transcriptional Activation , Animals , Cells, Cultured , Gene Silencing , Humans , Mice , Recombinant Fusion Proteins/chemistry , Transcription Factors/genetics , p300-CBP Transcription Factors/metabolism , RNA, Small Untranslated
20.
Methods Mol Biol ; 1084: 239-54, 2014.
Article in English | MEDLINE | ID: mdl-24061925

ABSTRACT

The Distance Constraint Model (DCM) is a computational modeling scheme that uniquely integrates thermodynamic and mechanical descriptions of protein structure. As such, quantitative stability-flexibility relationships (QSFR) that describe the interrelationships of thermodynamics and mechanics can be quickly computed. Using comparative QSFR analyses, we have previously investigated these relationships across a small number of protein orthologs, ranging from two to a dozen [1, 2]. However, our ultimate goal is provide a comprehensive analysis of whole protein families, which requires consideration of many more structures. To that end, we have developed homology modeling and assessment protocols so that we can robustly calculate QSFR properties for proteins without experimentally derived structures. The approach, which is presented here, starts from a large ensemble of potential homology models and uses a clustering algorithm to identify the best models, thus paving the way for a comprehensive QSFR analysis across hundreds of proteins in a protein family.


Subject(s)
Models, Molecular , Proteins/chemistry , Quantitative Structure-Activity Relationship , Animals , Cluster Analysis , Humans , Protein Conformation , Protein Stability , Thermodynamics
SELECTION OF CITATIONS
SEARCH DETAIL
...