RESUMO
Ammonia, which is toxic to the brain, is converted into non-toxic urea, through a pathway of six enzymatically catalyzed steps known as the urea cycle. In this pathway, N-acetylglutamate synthase (NAGS, EC 2.3.1.1) catalyzes the formation of N-acetylglutamate (NAG) from glutamate and acetyl coenzyme A. NAGS deficiency (NAGSD) is the rarest of the urea cycle disorders, yet is unique in that ureagenesis can be restored with the drug N-carbamylglutamate (NCG). We investigated whether the rarity of NAGSD could be due to low sequence variation in the NAGS genomic region, high NAGS tolerance for amino acid replacements, and alternative sources of NAG and NCG in the body. We also evaluated whether the small genomic footprint of the NAGS catalytic domain might play a role. The small number of patients diagnosed with NAGSD could result from the absence of specific disease biomarkers and/or short NAGS catalytic domain. We screened for sequence variants in NAGS regulatory regions in patients suspected of having NAGSD and found a novel NAGS regulatory element in the first intron of the NAGS gene. We applied the same datamining approach to identify regulatory elements in the remaining urea cycle genes. In addition to the known promoters and enhancers of each gene, we identified several novel regulatory elements in their upstream regions and first introns. The identification of cis-regulatory elements of urea cycle genes and their associated transcription factors holds promise for uncovering shared mechanisms governing urea cycle gene expression and potentially leading to new treatments for urea cycle disorders.
RESUMO
High-throughput genetic screening based on CRISPR/Cas9 or RNA-interference (RNAi) enables the exploration of genes associated with the phenotype of interest on a large scale. The rapid accumulation of public available genetic screening data provides a wealth of knowledge about genotype-to-phenotype relationships and a valuable resource for the systematic analysis of gene functions. Here we present CRISP-view, a comprehensive database of CRISPR/Cas9 and RNAi screening datasets that span multiple phenotypes, including in vitro and in vivo cell proliferation and viability, response to cancer immunotherapy, virus response, protein expression, etc. By 22 September 2020, CRISP-view has collected 10 321 human samples and 825 mouse samples from 167 papers. All the datasets have been curated, annotated, and processed by a standard MAGeCK-VISPR analysis pipeline with quality control (QC) metrics. We also developed a user-friendly webserver to visualize, explore, and search these datasets. The webserver is freely available at http://crispview.weililab.org.
Assuntos
Sistemas CRISPR-Cas/genética , Bases de Dados Genéticas , Testes Genéticos , Metadados , Anotação de Sequência Molecular , Fenótipo , Interface Usuário-ComputadorRESUMO
Introduction: Human endogenous retroviruses (HERVs) are transcriptionally-active remnants of ancient retroviral infections that may play a role in Alzheimer's disease. Methods: We combined two, publicly available RNA-Seq datasets with a third, novel dataset for a total cohort of 103 patients with Alzheimer's disease and 45 healthy controls. We use telescope to perform HERV quantification for these samples and simultaneously perform gene expression analysis. Results: We identify differentially expressed genes and differentially expressed HERVs in Alzheimer's disease patients. Differentially expressed HERVs are scattered throughout the genome; many of them are members of the HERV-K superfamily. A number of HERVs are correlated with the expression of dysregulated genes in Alzheimer's and are physically proximal to genes which drive disease pathways. Discussion: Dysregulated expression of ancient retroviral insertions in the human genome are present in Alzheimer's disease and show localization patterns that may explain how these elements drive pathogenic gene expression.
RESUMO
Transposable elements (TE) are typically silenced by DNA methylation and repressive histone modifications in differentiated healthy human tissues. However, TE expression increases in a wide range of cancers and is correlated with global hypomethylation of cancer genomes. We assessed expression and DNA methylation of TEs in fibroblast cells that were serially transduced with hTERT, SV40, and HRASR24C to immortalize and then transform them, modeling the different steps of the tumorigenesis process. RNA sequencing and whole-genome bisulfite sequencing were performed at each stage of transformation. TE expression significantly increased as cells progressed through transformation, with the largest increase in expression after the final stage of transformation, consistent with data from human tumors. The upregulated TEs were dominated by endogenous retroviruses [long terminal repeats (LTR)]. Most differentially methylated regions (DMR) in all stages were hypomethylated, with the greatest hypomethylation in the final stage of transformation. A majority of the DMRs overlapped TEs from the RepeatMasker database, indicating that TEs are preferentially demethylated. Many hypomethylated TEs displayed a concordant increase in expression. Demethylation began during immortalization and continued into transformation, while upregulation of TE transcription occurred in transformation. Numerous LTR elements upregulated in the model were also identified in The Cancer Genome Atlas datasets of breast, colon, and prostate cancer. Overall, these findings indicate that TEs, specifically endogenous retroviruses, are demethylated and transcribed during transformation. SIGNIFICANCE: Analysis of epigenetic and transcriptional changes in a transformation model reveals that transposable element expression and methylation are dysregulated during oncogenic transformation.
Assuntos
Metilação de DNA , Neoplasias , Humanos , Elementos de DNA Transponíveis/genética , Ativação Transcricional , Análise de Sequência de RNA , Neoplasias/genéticaRESUMO
Fecal microbiota transplantation (FMT) is a promising therapeutic modality for the treatment and prevention of metabolic disease. We previously conducted a double-blind, randomized, placebo-controlled pilot trial of FMT in obese metabolically healthy patients in which we found that FMT enhanced gut bacterial bile acid metabolism and delayed the development of impaired glucose tolerance relative to the placebo control group. Therefore, we conducted a secondary analysis of fecal samples collected from these patients to assess the potential gut microbial species contributing to the effect of FMT to improve metabolic health and increase gut bacterial bile acid metabolism. Fecal samples collected at baseline and after 4 weeks of FMT or placebo treatment underwent shotgun metagenomic analysis. Ultra-high-performance liquid chromatography-mass spectrometry was used to profile fecal bile acids. FMT-enriched bacteria that have been implicated in gut bile acid metabolism included Desulfovibrio fairfieldensis and Clostridium hylemonae. To identify candidate bacteria involved in gut microbial bile acid metabolism, we assessed correlations between bacterial species abundance and bile acid profile, with a focus on bile acid products of gut bacterial metabolism. Bacteroides ovatus and Phocaeicola dorei were positively correlated with unconjugated bile acids. Bifidobacterium adolescentis, Collinsella aerofaciens, and Faecalibacterium prausnitzii were positively correlated with secondary bile acids. Together, these data identify several candidate bacteria that may contribute to the metabolic benefits of FMT and gut bacterial bile acid metabolism that requires further functional validation.
Assuntos
Transplante de Microbiota Fecal , Microbioma Gastrointestinal , Humanos , Transplante de Microbiota Fecal/métodos , Fezes/microbiologia , Bactérias/genética , Ácidos e Sais Biliares/análiseRESUMO
SARS-CoV-2 (CoV) is the etiological agent of the COVID-19 pandemic and evolves to evade both host immune systems and intervention strategies. We divided the CoV genome into 29 constituent regions and applied novel analytical approaches to identify associations between CoV genomic features and epidemiological metadata. Our results show that nonstructural protein 3 (nsp3) and Spike protein (S) have the highest variation and greatest correlation with the viral whole-genome variation. S protein variation is correlated with nsp3, nsp6, and 3'-to-5' exonuclease variation. Country of origin and time since the start of the pandemic were the most influential metadata associated with genomic variation, while host sex and age were the least influential. We define a novel statistic-coherence-and show its utility in identifying geographic regions (populations) with unusually high (many new variants) or low (isolated) viral phylogenetic diversity. Interestingly, at both global and regional scales, we identify geographic locations with high coherence neighboring regions of low coherence; this emphasizes the utility of this metric to inform public health measures for disease spread. Our results provide a direction to prioritize genes associated with outcome predictors (e.g., health, therapeutic, and vaccine outcomes) and to improve DNA tests for predicting disease status.
Assuntos
Pandemias , SARS-CoV-2 , Genoma Viral , Humanos , MutaçãoRESUMO
Systemic lupus erythematosus (SLE) is a chronic autoimmune disease characterized by the production of autoantibodies predominantly to nuclear material. Many aspects of disease pathology are mediated by the deposition of nucleic acid containing immune complexes, which also induce the type 1interferon response, a characteristic feature of SLE. Notably, SLE is remarkably heterogeneous, with a variety of organs involved in different individuals, who also show variation in disease severity related to their ancestries. Here, we probed one potential contribution to disease heterogeneity as well as a possible source of immunoreactive nucleic acids by exploring the expression of human endogenous retroviruses (HERVs). We investigated the expression of HERVs in SLE and their potential relationship to SLE features and the expression of biochemical pathways, including the interferon gene signature (IGS). Towards this goal, we analyzed available and new RNA-Seq data from two independent whole blood studies using Telescope. We identified 481 locus specific HERV encoding regions that are differentially expressed between case and control individuals with only 14% overlap of differentially expressed HERVs between these two datasets. We identified significant differences between differentially expressed HERVs and non-differentially expressed HERVs between the two datasets. We also characterized the host differentially expressed genes and tested their association with the differentially expressed HERVs. We found that differentially expressed HERVs were significantly more physically proximal to host differentially expressed genes than non-differentially expressed HERVs. Finally, we capitalized on locus specific resolution of HERV mapping to identify key molecular pathways impacted by differential HERV expression in people with SLE.