Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 28
Filter
Add more filters










Publication year range
1.
bioRxiv ; 2023 Sep 12.
Article in English | MEDLINE | ID: mdl-37745370

ABSTRACT

Motivation: The two strands of the DNA double helix locally and spontaneously separate and recombine in living cells due to the inherent thermal DNA motion.This dynamics results in transient openings in the double helix and is referred to as "DNA breathing" or "DNA bubbles." The propensity to form local transient openings is important in a wide range of biological processes, such as transcription, replication, and transcription factors binding. However, the modeling and computer simulation of these phenomena, have remained a challenge due to the complex interplay of numerous factors, such as, temperature, salt content, DNA sequence, hydrogen bonding, base stacking, and others. Results: We present pyDNA-EPBD, a parallel software implementation of the Extended Peyrard-Bishop- Dauxois (EPBD) nonlinear DNA model that allows us to describe some features of DNA dynamics in detail. The pyDNA-EPBD generates genomic scale profiles of average base-pair openings, base flipping probability, DNA bubble probability, and calculations of the characteristically dynamic length indicating the number of base pairs statistically significantly affected by a single point mutation using the Markov Chain Monte Carlo (MCMC) algorithm.

2.
Physiol Rep ; 11(15): e15742, 2023 08.
Article in English | MEDLINE | ID: mdl-37537137

ABSTRACT

Obesity continues to rise in the juveniles and obese children are more likely to develop metabolic syndrome (MetS) and related cardiovascular disease. Unfortunately, effective prevention and long-term treatment options remain limited. We determined the juvenile cardiac response to MetS in a swine model. Juvenile male swine were fed either an obesogenic diet, to induce MetS, or a lean diet, as a control (LD). Myocardial ischemia was induced with surgically placed ameroid constrictor on the left circumflex artery. Physiological data were recorded and at 22 weeks of age the animals underwent a terminal harvest procedure and myocardial tissue was extracted for total metabolic and proteomic LC/MS-MS, RNA-seq analysis, and data underwent nonnegative matrix factorization for metabolic signatures. Significantly altered in MetS versus. LD were the glycolysis-related metabolites and enzymes. In MetS compared with LD Glycogen synthase 1 (GYS1)-glycogen phosphorylases (PYGM/PYGL) expression disbalance resulted in a loss of myocardial glycogen. Our findings are consistent with the concept that transcriptionally driven myocardial changes in glycogen and glucose metabolism-related enzymes lead to a deficiency of their metabolite products in MetS. This abnormal energy metabolism provides insight into the pathogenesis of the juvenile heart in MetS. This study reveals that MetS and ischemia diminishes ATP availability in the myocardium via altering the glucose-G6P-pyruvate axis at the level of metabolites and gene expression of related enzymes. The observed severe glycogen depletion in MetS coincides with disbalance in expression of GYS1 and both PYGM and PYGL. This altered energy substrate metabolism is a potential target of pharmacological agents for improving juvenile myocardial function in MetS and ischemia.


Subject(s)
Metabolic Syndrome , Pediatric Obesity , Swine , Male , Animals , Metabolic Syndrome/metabolism , Proteomics/methods , Myocardium/metabolism , Glycolysis , Ischemia/metabolism , Disease Models, Animal
3.
Cell Genom ; 2(11): None, 2022 Nov 09.
Article in English | MEDLINE | ID: mdl-36388765

ABSTRACT

Mutational signature analysis is commonly performed in cancer genomic studies. Here, we present SigProfilerExtractor, an automated tool for de novo extraction of mutational signatures, and benchmark it against another 13 bioinformatics tools by using 34 scenarios encompassing 2,500 simulated signatures found in 60,000 synthetic genomes and 20,000 synthetic exomes. For simulations with 5% noise, reflecting high-quality datasets, SigProfilerExtractor outperforms other approaches by elucidating between 20% and 50% more true-positive signatures while yielding 5-fold less false-positive signatures. Applying SigProfilerExtractor to 4,643 whole-genome- and 19,184 whole-exome-sequenced cancers reveals four novel signatures. Two of the signatures are confirmed in independent cohorts, and one of these signatures is associated with tobacco smoking. In summary, this report provides a reference tool for analysis of mutational signatures, a comprehensive benchmarking of bioinformatics tools for extracting signatures, and several novel mutational signatures, including one putatively attributed to direct tobacco smoking mutagenesis in bladder tissues.

4.
Sci Rep ; 12(1): 8539, 2022 May 20.
Article in English | MEDLINE | ID: mdl-35595786

ABSTRACT

Quantum annealers manufactured by D-Wave Systems, Inc., are computational devices capable of finding high-quality heuristic solutions of NP-hard problems. In this contribution, we explore the potential and effectiveness of such quantum annealers for computing Boolean tensor networks. Tensors offer a natural way to model high-dimensional data commonplace in many scientific fields, and representing a binary tensor as a Boolean tensor network is the task of expressing a tensor containing categorical (i.e., [Formula: see text]) values as a product of low dimensional binary tensors. A Boolean tensor network is computed by Boolean tensor decomposition, and it is usually not exact. The aim of such decomposition is to minimize the given distance measure between the high-dimensional input tensor and the product of lower-dimensional (usually three-dimensional) tensors and matrices representing the tensor network. In this paper, we introduce and analyze three general algorithms for Boolean tensor networks: Tucker, Tensor Train, and Hierarchical Tucker networks. The computation of a Boolean tensor network is reduced to a sequence of Boolean matrix factorizations, which we show can be expressed as a quadratic unconstrained binary optimization problem suitable for solving on a quantum annealer. By using a novel method we introduce called parallel quantum annealing, we demonstrate that Boolean tensor's with up to millions of elements can be decomposed efficiently using a DWave 2000Q quantum annealer.

5.
IEEE/ACM Trans Comput Biol Bioinform ; 19(3): 1670-1682, 2022.
Article in English | MEDLINE | ID: mdl-33400654

ABSTRACT

A central challenge in protein modeling research and protein structure prediction in particular is known as decoy selection. The problem refers to selecting biologically-active/native tertiary structures among a multitude of physically-realistic structures generated by template-free protein structure prediction methods. Research on decoy selection is active. Clustering-based methods are popular, but they fail to identify good/near-native decoys on datasets where near-native decoys are severely under-sampled by a protein structure prediction method. Reasonable progress is reported by methods that additionally take into account the internal energy of a structure and employ it to identify basins in the energy landscape organizing the multitude of decoys. These methods, however, incur significant time costs for extracting basins from the landscape. In this paper, we propose a novel decoy selection method based on non-negative matrix factorization. We demonstrate that our method outperforms energy landscape-based methods. In particular, the proposed method addresses both the time cost issue and the challenge of identifying good decoys in a sparse dataset, successfully recognizing near-native decoys for both easy and hard protein targets.


Subject(s)
Algorithms , Proteins , Cluster Analysis , Protein Conformation , Protein Folding , Proteins/chemistry , Proteins/genetics
6.
Sci Rep ; 11(1): 19752, 2021 10 05.
Article in English | MEDLINE | ID: mdl-34611227

ABSTRACT

Although metabolic syndrome (MetS) is linked to an elevated risk of cardiovascular disease (CVD), the cardiac-specific risk mechanism is unknown. Obesity, hypertension, and diabetes (all MetS components) are the most common form of CVD and represent risk factors for worse COVID-19 outcomes compared to their non MetS peers. Here, we use obese Yorkshire pigs as a highly relevant animal model of human MetS, where pigs develop the hallmarks of human MetS and reproducibly mimics the myocardial pathophysiology in patients. Myocardium-specific mass spectroscopy-derived metabolomics, proteomics, and transcriptomics enabled the identity and quality of proteins and metabolites to be investigated in the myocardium to greater depth. Myocardium-specific deregulation of pro-inflammatory markers, propensity for arterial thrombosis, and platelet aggregation was revealed by computational analysis of differentially enriched pathways between MetS and control animals. While key components of the complement pathway and the immune response to viruses are under expressed, key N6-methyladenosin RNA methylation enzymes are largely overexpressed in MetS. Blood tests do not capture the entirety of metabolic changes that the myocardium undergoes, making this analysis of greater value than blood component analysis alone. Our findings create data associations to further characterize the MetS myocardium and disease vulnerability, emphasize the need for a multimodal therapeutic approach, and suggests a mechanism for observed worse outcomes in MetS patients with COVID-19 comorbidity.


Subject(s)
COVID-19/pathology , Disease Susceptibility , Metabolic Syndrome/pathology , Animals , Blood Coagulation Factors/genetics , Blood Coagulation Factors/metabolism , COVID-19/complications , COVID-19/virology , Cyclooxygenase 2/genetics , Cyclooxygenase 2/metabolism , Diet, High-Fat/veterinary , Disease Models, Animal , Humans , Immunity, Innate/genetics , Metabolic Syndrome/complications , Metabolic Syndrome/metabolism , Methyltransferases/genetics , Methyltransferases/metabolism , Myocardium/metabolism , Oxidative Stress/genetics , Platelet Aggregation , Receptors, Purinergic P2Y1/genetics , Receptors, Purinergic P2Y1/metabolism , Renin-Angiotensin System , Risk Factors , SARS-CoV-2/isolation & purification , Swine , Urokinase-Type Plasminogen Activator/genetics , Urokinase-Type Plasminogen Activator/metabolism
8.
Nat Commun ; 11(1): 3096, 2020 06 18.
Article in English | MEDLINE | ID: mdl-32555180

ABSTRACT

Intratumor heterogeneity (ITH) and tumor evolution have been well described for clear cell renal cell carcinomas (ccRCC), but they are less studied for other kidney cancer subtypes. Here we investigate ITH and clonal evolution of papillary renal cell carcinoma (pRCC) and rarer kidney cancer subtypes, integrating whole-genome sequencing and DNA methylation data. In 29 tumors, up to 10 samples from the center to the periphery of each tumor, and metastatic samples in 2 cases, enable phylogenetic analysis of spatial features of clonal expansion, which shows congruent patterns of genomic and epigenomic evolution. In contrast to previous studies of ccRCC, in pRCC, driver gene mutations and most arm-level somatic copy number alterations (SCNAs) are clonal. These findings suggest that a single biopsy would be sufficient to identify the important genetic drivers and that targeting large-scale SCNAs may improve pRCC treatment, which is currently poor. While type 1 pRCC displays near absence of structural variants (SVs), the more aggressive type 2 pRCC and the rarer subtypes have numerous SVs, which should be pursued for prognostic significance.


Subject(s)
Carcinoma, Renal Cell/genetics , Kidney Neoplasms/genetics , DNA Copy Number Variations/genetics , Epigenomics , Germ-Line Mutation/genetics , Humans , Phylogeny
9.
Sci Rep ; 10(1): 3483, 2020 02 26.
Article in English | MEDLINE | ID: mdl-32103083

ABSTRACT

Although metabolic syndrome (MS) is a significant risk of cardiovascular disease (CVD), the cardiac response (MR) to MS remains unclear due to traditional MS models' narrow scope around a limited number of cell-cycle regulation biomarkers and drawbacks of limited human tissue samples. To date, we developed the most comprehensive platform studying MR to MS in a pig model tightly related to human MS criteria. By incorporating comparative metabolomic, transcriptomic, functional analyses, and unsupervised machine learning (UML), we can discover unknown metabolic pathways connections and links on numerous biomarkers across the MS-associated issues in the heart. For the first time, we show severely diminished availability of glycolytic and citric acid cycle (CAC) pathways metabolites, altered expression, GlcNAcylation, and activity of involved enzymes. A notable exception, however, is the excessive succinate accumulation despite reduced succinate dehydrogenase complex iron-sulfur subunit b (SDHB) expression and decreased content of precursor metabolites. Finally, the expression of metabolites and enzymes from the GABA-glutamate, GABA-putrescine, and the glyoxylate pathways significantly increase, suggesting an alternative cardiac means to replenish succinate and malate in MS. Our platform discovers potential therapeutic targets for MS-associated CVD within pathways that were previously unknown to corelate with the disease.


Subject(s)
Energy Metabolism , Metabolic Syndrome/pathology , Metabolome , Metabolomics/methods , Myocardium/metabolism , Animals , Biomarkers/metabolism , Citric Acid Cycle/genetics , Diet, High-Fat , Disease Models, Animal , Glycolysis/genetics , Male , Metabolic Syndrome/metabolism , Risk Factors , Succinate Dehydrogenase/metabolism , Succinic Acid/metabolism , Swine , Unsupervised Machine Learning
10.
PLoS One ; 14(12): e0225857, 2019.
Article in English | MEDLINE | ID: mdl-31790488

ABSTRACT

Although the high-fat-diet-induced metabolic syndrome (MetS) is a precursor of human cardiac pathology, the myocardial metabolic state in MetS is far from clear. The discrepancies in metabolite handling between human and small animal models and the difficulties inherent in obtaining human tissue complicate the identification of the myocardium-specific metabolic response in patients. Here we use the large animal model of swine that develops the hallmark criteria of human MetS. Our comparative metabolomics together with transcriptomics and computational nonnegative matrix factorization (NMF) interpretation of the data exposes significant decline in metabolites related to the fatty acid oxidation, glycolysis, and pentose phosphate pathway. Behind the reversal lies decreased expression of enzymes that operate in the pathways. We showed that diminished glycogen deposition is a metabolic signature of MetS in the pig myocardium. The depletion of glycogen arises from disbalance in expression of genes that break down and synthesize glycogen. We show robust acetoacetate accumulation and activated expression of key enzymes in ketone body formation, catabolism and transporters, suggesting a shift in fuel utilization in MetS. A contrasting enrichment in O-GlcNAcylated proteins uncovers hexosamine pathway and O-GlcNAcase (OGA) expression involvement in the myocardial response to MetS. Although the hexosamine biosynthetic pathway (HBP) activity and the availability of the UDP-GlcNAc substrate in the MetS myocardium is low, the level of O-GlcNacylated proteins is high as the O-GlcNacase is significantly diminished. Our data support the perception of transcriptionally driven myocardial alterations in expression of standard fatty acids, glucose metabolism, glycogen, and ketone body related enzymes and subsequent paucity of their metabolite products in MetS. This aberrant energy metabolism in the MetS myocardium provide insight into the pathogenesis of CVD in MetS.


Subject(s)
Metabolic Networks and Pathways , Metabolic Syndrome/metabolism , Myocardium/metabolism , Animals , Cholesterol, Dietary/adverse effects , Diet , Glycosylation , Male , Metabolome , Metabolomics , N-Acetylglucosaminyltransferases/metabolism , RNA, Messenger/genetics , RNA, Messenger/metabolism , Risk Factors , Swine , Unsupervised Machine Learning , beta-N-Acetylhexosaminidases/metabolism
11.
J Chem Theory Comput ; 15(11): 6343-6357, 2019 Nov 12.
Article in English | MEDLINE | ID: mdl-31476122

ABSTRACT

Phase separation in mixed lipid systems has been extensively studied both experimentally and theoretically because of its biological importance. A detailed description of such complex systems undoubtedly requires novel mathematical frameworks that are capable of decomposing and categorizing the evolution of thousands if not millions of lipids involved in the phenomenon. The interpretation and analysis of molecular dynamics (MD) simulations representing temporal and spatial changes in such systems are still a challenging task. Here, we present an unsupervised machine learning approach based on nonnegative matrix factorization called NMFk that successfully extracts latent (i.e., not directly observable) features from the second layer neighborhood profiles derived from coarse-grained MD simulations of a ternary lipid mixture. Our results demonstrate that NMFk extracts physically meaningful features that uniquely describe the phase separation such as locations and roles of different lipid types, formation of nanodomains, and timescales of lipid segregation.


Subject(s)
Lipids/chemistry , Unsupervised Machine Learning , 1,2-Dipalmitoylphosphatidylcholine/chemistry , Cholesterol/chemistry , Lipid Bilayers/chemistry , Molecular Dynamics Simulation , Phosphatidylcholines/chemistry
12.
J Contam Hydrol ; 220: 66-97, 2019 Jan.
Article in English | MEDLINE | ID: mdl-30528243

ABSTRACT

Unsupervised Machine Learning (ML) is becoming increasingly popular for solving various types of data analytics problems including feature extraction, blind source separation, exploratory analyses, model diagnostics, etc. Here, we have developed a new unsupervised ML method based on Nonnegative Tensor Factorization (NTF) for identification of the original groundwater types (including contaminant sources) present in geochemical mixtures observed in an aquifer. Frequently, groundwater types with different geochemical signatures are related to different background and/or contamination sources. The characterization of groundwater mixing processes is a challenging but very important task critical for any environmental management project aiming to characterize the fate and transport of contaminants in the subsurface and perform contaminant remediation. This task typically requires solving complex inverse models representing groundwater flow and geochemical transport in the aquifer, where the inverse analysis accounts for available site data. Usually, the model is calibrated against the available data characterizing the spatial and temporal distribution of the observed geochemical types. Numerous different geochemical constituents and processes may need to be simulated in these models which further complicates the analyses. Additionally, the application of inverse methods may introduce biases in the analyses through the assumptions made in the model development process. Here, we substitute the model inversion with unsupervised ML analysis. The ML analysis does not make any assumptions about underlying physical and geochemical processes occurring in the aquifer. Our ML methodology, called NTFk, is capable of identifying (1) the unknown number of groundwater types (contaminant sources) present in the aquifer, (2) the original geochemical concentrations (signatures) of these groundwater types and (3) spatial and temporal dynamics in the mixing of these groundwater types. These results are obtained only from the measured geochemical data without any additional site information. In general, the NTFk methodology allows for interpretation of large high-dimensional datasets representing diverse spatial and temporal components such as state variables and velocities. NTFk has been tested on synthetic and real-world site three-dimensional datasets. The NTFk algorithm is designed to work with geochemical data represented in the form of concentrations, ratios (of two constituents; for example, isotope ratios), and delta notations (standard normalized stable isotope ratios).


Subject(s)
Groundwater , Water Pollutants, Chemical , Environmental Monitoring , Isotopes
13.
PLoS One ; 13(12): e0206653, 2018.
Article in English | MEDLINE | ID: mdl-30532243

ABSTRACT

D-Wave quantum annealers represent a novel computational architecture and have attracted significant interest. Much of this interest has focused on the quantum behavior of D-Wave machines, and there have been few practical algorithms that use the D-Wave. Machine learning has been identified as an area where quantum annealing may be useful. Here, we show that the D-Wave 2X can be effectively used as part of an unsupervised machine learning method. This method takes a matrix as input and produces two low-rank matrices as output-one containing latent features in the data and another matrix describing how the features can be combined to approximately reproduce the input matrix. Despite the limited number of bits in the D-Wave hardware, this method is capable of handling a large input matrix. The D-Wave only limits the rank of the two output matrices. We apply this method to learn the features from a set of facial images and compare the performance of the D-Wave to two classical tools. This method is able to learn facial features and accurately reproduce the set of facial images. The performance of the D-Wave shows some promise, but has some limitations. It outperforms the two classical codes in a benchmark when only a short amount of computational time is allowed (200-20,000 microseconds), but these results suggest heuristics that would likely outperform the D-Wave in this benchmark.


Subject(s)
Machine Learning , Models, Theoretical , Quantum Theory
14.
PLoS One ; 13(3): e0193974, 2018.
Article in English | MEDLINE | ID: mdl-29518126

ABSTRACT

Factor analysis is broadly used as a powerful unsupervised machine learning tool for reconstruction of hidden features in recorded mixtures of signals. In the case of a linear approximation, the mixtures can be decomposed by a variety of model-free Blind Source Separation (BSS) algorithms. Most of the available BSS algorithms consider an instantaneous mixing of signals, while the case when the mixtures are linear combinations of signals with delays is less explored. Especially difficult is the case when the number of sources of the signals with delays is unknown and has to be determined from the data as well. To address this problem, in this paper, we present a new method based on Nonnegative Matrix Factorization (NMF) that is capable of identifying: (a) the unknown number of the sources, (b) the delays and speed of propagation of the signals, and (c) the locations of the sources. Our method can be used to decompose records of mixtures of signals with delays emitted by an unknown number of sources in a nondispersive medium, based only on recorded data. This is the case, for example, when electromagnetic signals from multiple antennas are received asynchronously; or mixtures of acoustic or seismic signals recorded by sensors located at different positions; or when a shift in frequency is induced by the Doppler effect. By applying our method to synthetic datasets, we demonstrate its ability to identify the unknown number of sources as well as the waveforms, the delays, and the strengths of the signals. Using Bayesian analysis, we also evaluate estimation uncertainties and identify the region of likelihood where the positions of the sources can be found.


Subject(s)
Factor Analysis, Statistical , Signal Processing, Computer-Assisted , Unsupervised Machine Learning , Algorithms , Bayes Theorem , Datasets as Topic , Fourier Analysis , Markov Chains , Monte Carlo Method
15.
J Contam Hydrol ; 212: 134-142, 2018 05.
Article in English | MEDLINE | ID: mdl-29174719

ABSTRACT

Identification of the original groundwater types present in geochemical mixtures observed in an aquifer is a challenging but very important task. Frequently, some of the groundwater types are related to different infiltration and/or contamination sources associated with various geochemical signatures and origins. The characterization of groundwater mixing processes typically requires solving complex inverse models representing groundwater flow and geochemical transport in the aquifer, where the inverse analysis accounts for available site data. Usually, the model is calibrated against the available data characterizing the spatial and temporal distribution of the observed geochemical types. Numerous different geochemical constituents and processes may need to be simulated in these models which further complicates the analyses. In this paper, we propose a new contaminant source identification approach that performs decomposition of the observation mixtures based on Non-negative Matrix Factorization (NMF) method for Blind Source Separation (BSS), coupled with a custom semi-supervised clustering algorithm. Our methodology, called NMFk, is capable of identifying (a) the unknown number of groundwater types and (b) the original geochemical concentration of the contaminant sources from measured geochemical mixtures with unknown mixing ratios without any additional site information. NMFk is tested on synthetic and real-world site data. The NMFk algorithm works with geochemical data represented in the form of concentrations, ratios (of two constituents; for example, isotope ratios), and delta notations (standard normalized stable isotope ratios).


Subject(s)
Groundwater/chemistry , Supervised Machine Learning , Water Pollutants, Chemical/chemistry , Environmental Monitoring/methods , Isotopes/analysis
16.
Sci Rep ; 7(1): 9731, 2017 08 29.
Article in English | MEDLINE | ID: mdl-28851939

ABSTRACT

The innate flexibility of a DNA sequence is quantified by the Jacobson-Stockmayer's J-factor, which measures the propensity for DNA loop formation. Recent studies of ultra-short DNA sequences revealed a discrepancy of up to six orders of magnitude between experimentally measured and theoretically predicted J-factors. These large differences suggest that, in addition to the elastic moduli of the double helix, other factors contribute to loop formation. Here, we develop a new theoretical model that explores how coherent delocalized phonon-like modes in DNA provide single-stranded "flexible hinges" to assist in loop formation. We combine the Czapla-Swigon-Olson structural model of DNA with our extended Peyrard-Bishop-Dauxois model and, without changing any of the parameters of the two models, apply this new computational framework to 86 experimentally characterized DNA sequences. Our results demonstrate that the new computational framework can predict J-factors within an order of magnitude of experimental measurements for most ultra-short DNA sequences, while continuing to accurately describe the J-factors of longer sequences. Further, we demonstrate that our computational framework can be used to describe the cyclization of DNA sequences that contain a base pair mismatch. Overall, our results support the conclusion that coherent delocalized phonon-like modes play an important role in DNA cyclization.


Subject(s)
DNA/chemistry , Models, Chemical , Nucleic Acid Conformation , Phonons , Algorithms , Base Pairing , Base Sequence , Cyclization
17.
PLoS One ; 11(11): e0165899, 2016.
Article in English | MEDLINE | ID: mdl-27824914

ABSTRACT

We present a dynamical model of drug accumulation in bacteria. The model captures key features in experimental time courses on ofloxacin accumulation: initial uptake; two-phase response; and long-term acclimation. In combination with experimental data, the model provides estimates of import and export rates in each phase, the time of entry into the second phase, and the decrease of internal drug during acclimation. Global sensitivity analysis, local sensitivity analysis, and Bayesian sensitivity analysis of the model provide information about the robustness of these estimates, and about the relative importance of different parameters in determining the features of the accumulation time courses in three different bacterial species: Escherichia coli, Staphylococcus aureus, and Pseudomonas aeruginosa. The results lead to experimentally testable predictions of the effects of membrane permeability, drug efflux and trapping (e.g., by DNA binding) on drug accumulation. A key prediction is that a sudden increase in ofloxacin accumulation in both E. coli and S. aureus is accompanied by a decrease in membrane permeability.


Subject(s)
Anti-Bacterial Agents/pharmacokinetics , Bacteria/metabolism , Models, Biological , Bacteria/drug effects , Bayes Theorem , Cell Membrane Permeability , Escherichia coli/metabolism , Pseudomonas aeruginosa/metabolism , Staphylococcus aureus/metabolism
18.
BMC Bioinformatics ; 17: 68, 2016 Feb 04.
Article in English | MEDLINE | ID: mdl-26846597

ABSTRACT

BACKGROUND: The intrinsic bendability of DNA plays an important role with relevance for myriad of essential cellular mechanisms. The flexibility of a DNA fragment can be experimentally and computationally examined by its propensity for cyclization, quantified by the Jacobson-Stockmayer J factor. In this study, we use a well-established coarse-grained three-dimensional model of DNA and seven distinct sets of experimentally and computationally derived conformational parameters of the double helix to evaluate the role of structural parameters in calculating DNA cyclization. RESULTS: We calculate the cyclization rates of 86 DNA sequences with previously measured J factors and lengths between 57 and 325 bp as well as of 20,000 randomly generated DNA sequences with lengths between 350 and 4000 bp. Our comparison with experimental data is complemented with analysis of simulated data. CONCLUSIONS: Our data demonstrate that all sets of parameters yield very similar results for longer DNA fragments, regardless of the nucleotide sequence, which are in agreement with experimental measurements. However, for DNA fragments shorter than 100 bp, all sets of parameters performed poorly yielding results with several orders of magnitude difference from the experimental measurements. Our data show that DNA cyclization rates calculated using conformational parameters based on nucleosome packaging data are most similar to the experimental measurements. Overall, our study provides a comprehensive large-scale assessment of the role of structural parameters in calculating DNA cyclization rates.


Subject(s)
Biophysical Phenomena , DNA/chemistry , Nucleic Acid Conformation , Cyclization , Humans , Models, Molecular , Pliability
19.
Am J Hum Genet ; 95(6): 744-53, 2014 Dec 04.
Article in English | MEDLINE | ID: mdl-25434007

ABSTRACT

Schizophrenia (SZ) genome-wide association studies (GWASs) have identified common risk variants in >100 susceptibility loci; however, the contribution of rare variants at these loci remains largely unexplored. One of the strongly associated loci spans MIR137 (miR137) and MIR2682 (miR2682), two microRNA genes important for neuronal function. We sequenced ∼6.9 kb MIR137/MIR2682 and upstream regulatory sequences in 2,610 SZ cases and 2,611 controls of European ancestry. We identified 133 rare variants with minor allele frequency (MAF) <0.5%. The rare variant burden in promoters and enhancers, but not insulators, was associated with SZ (p = 0.021 for MAF < 0.5%, p = 0.003 for MAF < 0.1%). A rare enhancer SNP, 1:g.98515539A>T, presented exclusively in 11 SZ cases (nominal p = 4.8 × 10(-4)). We further identified its risk allele T in 2 of 2,434 additional SZ cases, 11 of 4,339 bipolar (BP) cases, and 3 of 3,572 SZ/BP study controls and 1,688 population controls; yielding combined p values of 0.0007, 0.0013, and 0.0001 for SZ, BP, and SZ/BP, respectively. The risk allele T of 1:g.98515539A>T reduced enhancer activity of its flanking sequence by >50% in human neuroblastoma cells, predicting lower expression of MIR137/MIR2682. Both empirical and computational analyses showed weaker transcription factor (YY1) binding by the risk allele. Chromatin conformation capture (3C) assay further indicated that 1:g.98515539A>T influenced MIR137/MIR2682, but not the nearby DPYD or LOC729987. Our results suggest that rare noncoding risk variants are associated with SZ and BP at MIR137/MIR2682 locus, with risk alleles decreasing MIR137/MIR2682 expression.


Subject(s)
Bipolar Disorder/genetics , Gene Expression Regulation/genetics , Genetic Variation , MicroRNAs/genetics , Schizophrenia/genetics , Alleles , Base Sequence , Cell Line, Tumor , Gene Frequency , Genes, Reporter , Genetic Loci , Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Molecular Sequence Data , Polymorphism, Single Nucleotide , Promoter Regions, Genetic/genetics , Risk , Sequence Analysis, DNA
20.
Sci Rep ; 3: 1184, 2013.
Article in English | MEDLINE | ID: mdl-23378916

ABSTRACT

We report that terahertz (THz) irradiation of mouse mesenchymal stem cells (mMSCs) with a single-frequency (SF) 2.52 THz laser or pulsed broadband (centered at 10 THz) source results in irradiation specific heterogenic changes in gene expression. The THz effect depends on irradiation parameters such as the duration and type of THz source, and on the degree of stem cell differentiation. Our microarray survey and RT-PCR experiments demonstrate that prolonged broadband THz irradiation drives mMSCs toward differentiation, while 2-hour irradiation (regardless of THz sources) affects genes transcriptionally active in pluripotent stem cells. The strictly controlled experimental environment indicates minimal temperature changes and the absence of any discernable response to heat shock and cellular stress genes imply a non-thermal response. Computer simulations of the core promoters of two pluripotency markers reveal association between gene upregulation and propensity for DNA breathing. We propose that THz radiation has potential for non-contact control of cellular gene expression.


Subject(s)
Gene Expression Regulation/radiation effects , Mesenchymal Stem Cells/radiation effects , Terahertz Radiation , Animals , Cell Differentiation , Mesenchymal Stem Cells/cytology , Mesenchymal Stem Cells/metabolism , Mice , Oligonucleotide Array Sequence Analysis , Reverse Transcriptase Polymerase Chain Reaction , Time Factors
SELECTION OF CITATIONS
SEARCH DETAIL
...