Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 12 de 12
Filter
1.
Nucleic Acids Res ; 51(19): 10147-10161, 2023 10 27.
Article in English | MEDLINE | ID: mdl-37738140

ABSTRACT

CRISPR-Cas9 tools have transformed genetic manipulation capabilities in the laboratory. Empirical rules-of-thumb have been developed for only a narrow range of model organisms, and mechanistic underpinnings for sgRNA efficiency remain poorly understood. This work establishes a novel feature set and new public resource, produced with quantum chemical tensors, for interpreting and predicting sgRNA efficiency. Feature engineering for sgRNA efficiency is performed using an explainable-artificial intelligence model: iterative Random Forest (iRF). By encoding quantitative attributes of position-specific sequences for Escherichia coli sgRNAs, we identify important traits for sgRNA design in bacterial species. Additionally, we show that expanding positional encoding to quantum descriptors of base-pair, dimer, trimer, and tetramer sequences captures intricate interactions in local and neighboring nucleotides of the target DNA. These features highlight variation in CRISPR-Cas9 sgRNA dynamics between E. coli and H. sapiens genomes. These novel encodings of sgRNAs enhance our understanding of the elaborate quantum biological processes involved in CRISPR-Cas9 machinery.


Subject(s)
CRISPR-Cas Systems , RNA, Guide, CRISPR-Cas Systems , Artificial Intelligence , DNA , Escherichia coli/genetics , Gene Editing , Humans
2.
Front Psychiatry ; 14: 1178633, 2023.
Article in English | MEDLINE | ID: mdl-37599888

ABSTRACT

Introduction: Despite a recent global decrease in suicide rates, death by suicide has increased in the United States. It is therefore imperative to identify the risk factors associated with suicide attempts to combat this growing epidemic. In this study, we aim to identify potential risk factors of suicide attempt using geospatial features in an Artificial intelligence framework. Methods: We use iterative Random Forest, an explainable artificial intelligence method, to predict suicide attempts using data from the Million Veteran Program. This cohort incorporated 405,540 patients with 391,409 controls and 14,131 attempts. Our predictive model incorporates multiple climatic features at ZIP-code-level geospatial resolution. We additionally consider demographic features from the American Community Survey as well as the number of firearms and alcohol vendors per 10,000 people to assess the contributions of proximal environment, access to means, and restraint decrease to suicide attempts. In total 1,784 features were included in the predictive model. Results: Our results show that geographic areas with higher concentrations of married males living with spouses are predictive of lower rates of suicide attempts, whereas geographic areas where males are more likely to live alone and to rent housing are predictive of higher rates of suicide attempts. We also identified climatic features that were associated with suicide attempt risk by age group. Additionally, we observed that firearms and alcohol vendors were associated with increased risk for suicide attempts irrespective of the age group examined, but that their effects were small in comparison to the top features. Discussion: Taken together, our findings highlight the importance of social determinants and environmental factors in understanding suicide risk among veterans.

3.
Comput Struct Biotechnol J ; 20: 3372-3386, 2022.
Article in English | MEDLINE | ID: mdl-35832622

ABSTRACT

Gene-to-gene networks, such as Gene Regulatory Networks (GRN) and Predictive Expression Networks (PEN) capture relationships between genes and are beneficial for use in downstream biological analyses. There exists multiple network inference tools to produce these gene-to-gene networks from matrices of gene expression data. Random Forest-Leave One Out Prediction (RF-LOOP) is a method that has been shown to be efficient at producing these gene-to-gene networks, frequently known as GEne Network Inference with Ensemble of trees (GENIE3). Random Forest can be replaced in this process by iterative Random Forest (iRF), which performs variable selection and boosting. Here we validate that iterative Random Forest-Leave One Out Prediction (iRF-LOOP) produces higher quality networks than GENIE3 (RF-LOOP). We use both synthetic and empirical networks from the Dialogue for Reverse Engineering Assessment and Methods (DREAM) Challenges by Sage Bionetworks, as well as two additional empirical networks created from Arabidopsis thaliana and Populus trichocarpa expression data.

4.
Methods Mol Biol ; 2452: 317-351, 2022.
Article in English | MEDLINE | ID: mdl-35554915

ABSTRACT

The unprecedented scientific achievements in combating the COVID-19 pandemic reflect a global response informed by unprecedented access to data. We now have the ability to rapidly generate a diversity of information on an emerging pathogen and, by using high-performance computing and a systems biology approach, we can mine this wealth of information to understand the complexities of viral pathogenesis and contagion like never before. These efforts will aid in the development of vaccines, antiviral medications, and inform policymakers and clinicians. Here we detail computational protocols developed as SARS-CoV-2 began to spread across the globe. They include pathogen detection, comparative structural proteomics, evolutionary adaptation analysis via network and artificial intelligence methodologies, and multiomic integration. These protocols constitute a core framework on which to build a systems-level infrastructure that can be quickly brought to bear on future pathogens before they evolve into pandemic proportions.


Subject(s)
COVID-19 Drug Treatment , SARS-CoV-2 , Antiviral Agents/pharmacology , Antiviral Agents/therapeutic use , Artificial Intelligence , Humans , Pandemics/prevention & control , Systems Biology
5.
Sci Transl Med ; 14(630): eabj0324, 2022 02 02.
Article in English | MEDLINE | ID: mdl-35108061

ABSTRACT

Skin is composed of diverse cell populations that cooperatively maintain homeostasis. Up-regulation of the nuclear factor κB (NF-κB) pathway may lead to the development of chronic inflammatory disorders of the skin, but its role during the early events remains unclear. Through analysis of single-cell RNA sequencing data via iterative random forest leave one out prediction, an explainable artificial intelligence method, we identified an immunoregulatory role for a unique paired related homeobox-1 (Prx1)+ fibroblast subpopulation. Disruption of Ikkb-NF-κB under homeostatic conditions in these fibroblasts paradoxically induced skin inflammation due to the overexpression of C-C motif chemokine ligand 11 (CCL11; or eotaxin-1) characterized by eosinophil infiltration and a subsequent TH2 immune response. Because the inflammatory phenotype resembled that seen in human atopic dermatitis (AD), we examined human AD skin samples and found that human AD fibroblasts also overexpressed CCL11 and that perturbation of Ikkb-NF-κB in primary human dermal fibroblasts up-regulated CCL11. Monoclonal antibody treatment against CCL11 was effective in reducing the eosinophilia and TH2 inflammation in a mouse model. Together, the murine model and human AD specimens point to dysregulated Prx1+ fibroblasts as a previously unrecognized etiologic factor that may contribute to the pathogenesis of AD and suggest that targeting CCL11 may be a way to treat AD-like skin lesions.


Subject(s)
Dermatitis, Atopic , Animals , Artificial Intelligence , Dermatitis, Atopic/pathology , Fibroblasts/pathology , Immunity , Mice , NF-kappa B/metabolism , Skin/pathology
6.
Lancet Healthy Longev ; 2(10): e639-e650, 2021 10.
Article in English | MEDLINE | ID: mdl-34870254

ABSTRACT

BACKGROUND: Polypharmacy, defined as use of five or more medications concurrently, is associated with adverse health outcomes and people ageing with HIV might be at greater risk than similar uninfected individuals. We aimed to determine whether known pairwise drug interactions (KPDIs) were associated with risk of admission to hospital (hereafter referred to as hospitalisation) and medication count among people ageing with and without HIV after accounting for physiological frailty. METHODS: In this observational study, we collected individual-level data for participants of the Veterans Aging Cohort Study (VACS) with HIV on antiretroviral therapy (ART) and with supressed HIV-1 RNA and people without HIV who were receiving at least one prescription medication, based on active medications in the 2009 fiscal year (ie, Oct 1, 2008, to Sept 30, 2009). We identified KPDIs among these patients by linking prescription fill and refill data with data from DrugBank (version 5.0.11). We collected data on all-cause mortality and hospitalisations between Oct 1, 2009, and March 31, 2019. We compared KPDI counts using random selection and actual patterns of use across medication counts from two to 12. We created a weighted KPDI Index on the basis of the average association of each KPDI with mortality among people ageing without HIV and used nested Cox models stratified by HIV status to estimate the association between medication count and hospitalisation, with incremental adjustments for demographics, physiological frailty, and KPDI Index. FINDINGS: We collected data for 9186 people ageing with HIV and 37 930 individuals without HIV. 45 913 (97·4%) of 47 116 patients were men and the sample was predominantly aged 50-64 years (30 413 [64·6%]). Compared with a random sample of medications, real-world pattern of medication counts and combinations were associated with five-to-six times more KPDIs (eg, for a combination of six medications, KPDI count was 1·09 in the random sample, 5·49 in the HIV-negative population, and 7·13 in the HIV-positive population). For each additional observed medication, people ageing with HIV had approximately 2·94 additional KPDIs and comparators had approximately 2·67 additional KPDIs. Adjustment for demographics, physiological frailty, and KPDI Index reduced the association between medication count and risk of hospitalisation for people ageing with HIV (hazard ratio 1·08 [95% CI 1·07-1·09] reduced to 1·06 [1·05-1·07]) and those without HIV (1·08 [1·07-1·08] reduced to 1·04 [1·03-1·05]). INTERPRETATION: For each additional medication, people ageing with HIV have more drug-drug interactions than those without HIV. Adjusting for known non-ART drug-drug interactions, each additional non-ART medication confers excess risk of hospitalisation for people ageing with HIV. Randomised trials will be needed to determine whether reducing these interactions improves outcomes. FUNDING: National Institutes of Health, National Institute on Alcohol Abuse and Alcoholism, Department of Veterans Affairs Health Services Research & Development, and Office of Research and Development.


Subject(s)
Frailty , HIV Infections , HIV Seropositivity , Aging , Cohort Studies , Female , Hospitalization , Humans , Male , Polypharmacy
7.
Genome Biol ; 21(1): 304, 2020 12 23.
Article in English | MEDLINE | ID: mdl-33357233

ABSTRACT

BACKGROUND: A mechanistic understanding of the spread of SARS-CoV-2 and diligent tracking of ongoing mutagenesis are of key importance to plan robust strategies for confining its transmission. Large numbers of available sequences and their dates of transmission provide an unprecedented opportunity to analyze evolutionary adaptation in novel ways. Addition of high-resolution structural information can reveal the functional basis of these processes at the molecular level. Integrated systems biology-directed analyses of these data layers afford valuable insights to build a global understanding of the COVID-19 pandemic. RESULTS: Here we identify globally distributed haplotypes from 15,789 SARS-CoV-2 genomes and model their success based on their duration, dispersal, and frequency in the host population. Our models identify mutations that are likely compensatory adaptive changes that allowed for rapid expansion of the virus. Functional predictions from structural analyses indicate that, contrary to previous reports, the Asp614Gly mutation in the spike glycoprotein (S) likely reduced transmission and the subsequent Pro323Leu mutation in the RNA-dependent RNA polymerase led to the precipitous spread of the virus. Our model also suggests that two mutations in the nsp13 helicase allowed for the adaptation of the virus to the Pacific Northwest of the USA. Finally, our explainable artificial intelligence algorithm identified a mutational hotspot in the sequence of S that also displays a signature of positive selection and may have implications for tissue or cell-specific expression of the virus. CONCLUSIONS: These results provide valuable insights for the development of drugs and surveillance strategies to combat the current and future pandemics.


Subject(s)
Adaptation, Biological , Evolution, Molecular , Models, Genetic , SARS-CoV-2/genetics , Viral Proteins/genetics , Artificial Intelligence , Genome, Viral , Haplotypes , Mutation , Selection, Genetic
8.
J Neurophysiol ; 123(6): 2285-2296, 2020 06 01.
Article in English | MEDLINE | ID: mdl-32347157

ABSTRACT

This study quantified eight small-molecule neurotransmitters collected simultaneously from prefrontal cortex of C57BL/6J mice (n = 23) during wakefulness and during isoflurane anesthesia (1.3%). Using isoflurane anesthesia as an independent variable enabled evaluation of the hypothesis that isoflurane anesthesia differentially alters concentrations of multiple neurotransmitters and their interactions. Machine learning was applied to reveal higher order interactions among neurotransmitters. Using a between-subjects design, microdialysis was performed during wakefulness and during anesthesia. Concentrations (nM) of acetylcholine, adenosine, dopamine, GABA, glutamate, histamine, norepinephrine, and serotonin in the dialysis samples are reported (means ± SD). Relative to wakefulness, acetylcholine concentration was lower during isoflurane anesthesia (1.254 ± 1.118 vs. 0.401 ± 0.134, P = 0.009), and concentrations of adenosine (29.456 ± 29.756 vs. 101.321 ± 38.603, P < 0.001), dopamine (0.0578 ± 0.0384 vs. 0.113 ± 0.084, P = 0.036), and norepinephrine (0.126 ± 0.080 vs. 0.219 ± 0.066, P = 0.010) were higher during anesthesia. Isoflurane reconfigured neurotransmitter interactions in prefrontal cortex, and the state of isoflurane anesthesia was reliably predicted by prefrontal cortex concentrations of adenosine, norepinephrine, and acetylcholine. A novel finding to emerge from machine learning analyses is that neurotransmitter concentration profiles in mouse prefrontal cortex undergo functional reconfiguration during isoflurane anesthesia. Adenosine, norepinephrine, and acetylcholine showed high feature importance, supporting the interpretation that interactions among these three transmitters may play a key role in modulating levels of cortical and behavioral arousal.NEW & NOTEWORTHY This study discovered that interactions between neurotransmitters in mouse prefrontal cortex were altered during isoflurane anesthesia relative to wakefulness. Machine learning further demonstrated that, relative to wakefulness, higher order interactions among neurotransmitters were disrupted during isoflurane administration. These findings extend to the neurochemical domain the concept that anesthetic-induced loss of wakefulness results from a disruption of neural network connectivity.


Subject(s)
Acetylcholine/metabolism , Adenosine/metabolism , Anesthesia , Anesthetics, Inhalation/pharmacology , Isoflurane/pharmacology , Machine Learning , Nerve Net , Norepinephrine/metabolism , Prefrontal Cortex , Unconsciousness/metabolism , Wakefulness/physiology , Animals , Male , Mice , Mice, Inbred C57BL , Microdialysis , Nerve Net/drug effects , Nerve Net/metabolism , Nerve Net/physiopathology , Prefrontal Cortex/drug effects , Prefrontal Cortex/metabolism , Prefrontal Cortex/physiopathology
9.
Curr Opin Biotechnol ; 61: 217-225, 2020 02.
Article in English | MEDLINE | ID: mdl-32086132

ABSTRACT

Human population growth and accelerated climate change necessitate agricultural improvements using designer crop ideotypes (idealized plants that can grow in niche environments). Diverse and highly skilled research groups must integrate efforts to bridge the gaps needed to achieve international goals toward sustainable agriculture. Given the scale of global agricultural needs and the breadth of multiple types of omics data needed to optimize these efforts, explainable artificial intelligence (AI with a decipherable decision making process that provides a meaningful explanation to humans) and exascale computing (computers that can perform 1018 floating-point operations per second, or exaflops) are crucial. Accurate phenotyping and daily-resolution climatype associations are equally important for refining ideotype production to specific environments at various levels of granularity. We review advances toward tackling technological hurdles to solve multiple United Nations Sustainable Development Goals and discuss a vision to overcome gaps between research and policy.


Subject(s)
Artificial Intelligence , Sustainable Development , Agriculture , Goals , Humans , United Nations
10.
Genes (Basel) ; 10(12)2019 12 02.
Article in English | MEDLINE | ID: mdl-31810264

ABSTRACT

As time progresses and technology improves, biological data sets are continuously increasing in size. New methods and new implementations of existing methods are needed to keep pace with this increase. In this paper, we present a high-performance computing (HPC)-capable implementation of Iterative Random Forest (iRF). This new implementation enables the explainable-AI eQTL analysis of SNP sets with over a million SNPs. Using this implementation, we also present a new method, iRF Leave One Out Prediction (iRF-LOOP), for the creation of Predictive Expression Networks on the order of 40,000 genes or more. We compare the new implementation of iRF with the previous R version and analyze its time to completion on two of the world's fastest supercomputers, Summit and Titan. We also show iRF-LOOP's ability to capture biologically significant results when creating Predictive Expression Networks. This new implementation of iRF will enable the analysis of biological data sets at scales that were previously not possible.


Subject(s)
Algorithms , Computer Simulation , Models, Genetic , Quantitative Trait Loci , Computational Biology
11.
Front Plant Sci ; 10: 1249, 2019.
Article in English | MEDLINE | ID: mdl-31649710

ABSTRACT

Understanding the regulatory network controlling cell wall biosynthesis is of great interest in Populus trichocarpa, both because of its status as a model woody perennial and its importance for lignocellulosic products. We searched for genes with putatively unknown roles in regulating cell wall biosynthesis using an extended network-based Lines of Evidence (LOE) pipeline to combine multiple omics data sets in P. trichocarpa, including gene coexpression, gene comethylation, population level pairwise SNP correlations, and two distinct SNP-metabolite Genome Wide Association Study (GWAS) layers. By incorporating validation, ranking, and filtering approaches we produced a list of nine high priority gene candidates for involvement in the regulation of cell wall biosynthesis. We subsequently performed a detailed investigation of candidate gene GROWTH-REGULATING FACTOR 9 (PtGRF9). To investigate the role of PtGRF9 in regulating cell wall biosynthesis, we assessed the genome-wide connections of PtGRF9 and a paralog across data layers with functional enrichment analyses, predictive transcription factor binding site analysis, and an independent comparison to eQTN data. Our findings indicate that PtGRF9 likely affects the cell wall by directly repressing genes involved in cell wall biosynthesis, such as PtCCoAOMT and PtMYB.41, and indirectly by regulating homeobox genes. Furthermore, evidence suggests that PtGRF9 paralogs may act as transcriptional co-regulators that direct the global energy usage of the plant. Using our extended pipeline, we show multiple lines of evidence implicating the involvement of these genes in cell wall regulatory functions and demonstrate the value of this method for prioritizing candidate genes for experimental validation.

12.
Trends Biotechnol ; 37(11): 1217-1235, 2019 11.
Article in English | MEDLINE | ID: mdl-31235329

ABSTRACT

Breeding crops for high yield and superior adaptability to new and variable climates is imperative to ensure continued food security, biomass production, and ecosystem services. Advances in genomics and phenomics are delivering insights into the complex biological mechanisms that underlie plant functions in response to environmental perturbations. However, linking genotype to phenotype remains a huge challenge and is hampering the optimal application of high-throughput genomics and phenomics to advanced breeding. Critical to success is the need to assimilate large amounts of data into biologically meaningful interpretations. Here, we present the current state of genomics and field phenomics, explore emerging approaches and challenges for multiomics big data integration by means of next-generation (Next-Gen) artificial intelligence (AI), and propose a workable path to improvement.


Subject(s)
Crops, Agricultural/genetics , Plant Breeding/methods , Artificial Intelligence , Biomass , Climate , Climate Change , Ecosystem , Genomics/methods , Genotype , Humans , Phenomics/methods , Phenotype
SELECTION OF CITATIONS
SEARCH DETAIL
...