Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
1.
Cell ; 186(22): 4834-4850.e23, 2023 10 26.
Artículo en Inglés | MEDLINE | ID: mdl-37794589

RESUMEN

Regulation of viral RNA biogenesis is fundamental to productive SARS-CoV-2 infection. To characterize host RNA-binding proteins (RBPs) involved in this process, we biochemically identified proteins bound to genomic and subgenomic SARS-CoV-2 RNAs. We find that the host protein SND1 binds the 5' end of negative-sense viral RNA and is required for SARS-CoV-2 RNA synthesis. SND1-depleted cells form smaller replication organelles and display diminished virus growth kinetics. We discover that NSP9, a viral RBP and direct SND1 interaction partner, is covalently linked to the 5' ends of positive- and negative-sense RNAs produced during infection. These linkages occur at replication-transcription initiation sites, consistent with NSP9 priming viral RNA synthesis. Mechanistically, SND1 remodels NSP9 occupancy and alters the covalent linkage of NSP9 to initiating nucleotides in viral RNA. Our findings implicate NSP9 in the initiation of SARS-CoV-2 RNA synthesis and unravel an unsuspected role of a cellular protein in orchestrating viral RNA production.


Asunto(s)
COVID-19 , ARN Viral , Humanos , COVID-19/metabolismo , Endonucleasas/metabolismo , ARN Viral/metabolismo , SARS-CoV-2/genética , Replicación Viral
2.
Genome Res ; 34(4): 572-589, 2024 May 15.
Artículo en Inglés | MEDLINE | ID: mdl-38719471

RESUMEN

Dormancy is a key feature of stem cell function in adult tissues as well as in embryonic cells in the context of diapause. The establishment of dormancy is an active process that involves extensive transcriptional, epigenetic, and metabolic rewiring. How these processes are coordinated to successfully transition cells to the resting dormant state remains unclear. Here we show that microRNA activity, which is otherwise dispensable for preimplantation development, is essential for the adaptation of early mouse embryos to the dormant state of diapause. In particular, the pluripotent epiblast depends on miRNA activity, the absence of which results in the loss of pluripotent cells. Through the integration of high-sensitivity small RNA expression profiling of individual embryos and protein expression of miRNA targets with public data of protein-protein interactions, we constructed the miRNA-mediated regulatory network of mouse early embryos specific to diapause. We find that individual miRNAs contribute to the combinatorial regulation by the network, and the perturbation of the network compromises embryo survival in diapause. We further identified the nutrient-sensitive transcription factor TFE3 as an upstream regulator of diapause-specific miRNAs, linking cytoplasmic MTOR activity to nuclear miRNA biogenesis. Our results place miRNAs as a critical regulatory layer for the molecular rewiring of early embryos to establish dormancy.


Asunto(s)
Proliferación Celular , MicroARNs , Células Madre Pluripotentes , Animales , MicroARNs/genética , MicroARNs/metabolismo , Ratones , Células Madre Pluripotentes/metabolismo , Células Madre Pluripotentes/citología , Regulación del Desarrollo de la Expresión Génica , Redes Reguladoras de Genes , Desarrollo Embrionario/genética , Estratos Germinativos/metabolismo , Estratos Germinativos/citología , Blastocisto/metabolismo , Blastocisto/citología , Femenino
3.
Brief Bioinform ; 24(5)2023 09 20.
Artículo en Inglés | MEDLINE | ID: mdl-37635383

RESUMEN

RNA-binding proteins (RBPs) are central actors of RNA post-transcriptional regulation. Experiments to profile-binding sites of RBPs in vivo are limited to transcripts expressed in the experimental cell type, creating the need for computational methods to infer missing binding information. While numerous machine-learning based methods have been developed for this task, their use of heterogeneous training and evaluation datasets across different sets of RBPs and CLIP-seq protocols makes a direct comparison of their performance difficult. Here, we compile a set of 37 machine learning (primarily deep learning) methods for in vivo RBP-RNA interaction prediction and systematically benchmark a subset of 11 representative methods across hundreds of CLIP-seq datasets and RBPs. Using homogenized sample pre-processing and two negative-class sample generation strategies, we evaluate methods in terms of predictive performance and assess the impact of neural network architectures and input modalities on model performance. We believe that this study will not only enable researchers to choose the optimal prediction method for their tasks at hand, but also aid method developers in developing novel, high-performing methods by introducing a standardized framework for their evaluation.


Asunto(s)
Benchmarking , Secuenciación de Inmunoprecipitación de Cromatina , Sitios de Unión , Aprendizaje Automático , ARN/genética
4.
PLoS Genet ; 18(4): e1010191, 2022 04.
Artículo en Inglés | MEDLINE | ID: mdl-35486646

RESUMEN

Whole genome sequencing is increasingly used to diagnose medical conditions of genetic origin. While both coding and non-coding DNA variants contribute to a wide range of diseases, most patients who receive a WGS-based diagnosis today harbour a protein-coding mutation. Functional interpretation and prioritization of non-coding variants represents a persistent challenge, and disease-causing non-coding variants remain largely unidentified. Depending on the disease, WGS fails to identify a candidate variant in 20-80% of patients, severely limiting the usefulness of sequencing for personalised medicine. Here we present FINSURF, a machine-learning approach to predict the functional impact of non-coding variants in regulatory regions. FINSURF outperforms state-of-the-art methods, owing in particular to optimized control variants selection during training. In addition to ranking candidate variants, FINSURF breaks down the score for each variant into contributions from individual annotations, facilitating the evaluation of their functional relevance. We applied FINSURF to a diverse set of 30 diseases with described causative non-coding mutations, and correctly identified the disease-causative non-coding variant within the ten top hits in 22 cases. FINSURF is implemented as an online server to as well as custom browser tracks, and provides a quick and efficient solution to prioritize candidate non-coding variants in realistic clinical settings.


Asunto(s)
Aprendizaje Automático , Programas Informáticos , Humanos , Mutación , Secuenciación Completa del Genoma
5.
Genome Biol ; 24(1): 180, 2023 08 04.
Artículo en Inglés | MEDLINE | ID: mdl-37542318

RESUMEN

We present RBPNet, a novel deep learning method, which predicts CLIP-seq crosslink count distribution from RNA sequence at single-nucleotide resolution. By training on up to a million regions, RBPNet achieves high generalization on eCLIP, iCLIP and miCLIP assays, outperforming state-of-the-art classifiers. RBPNet performs bias correction by modeling the raw signal as a mixture of the protein-specific and background signal. Through model interrogation via Integrated Gradients, RBPNet identifies predictive sub-sequences that correspond to known and novel binding motifs and enables variant-impact scoring via in silico mutagenesis. Together, RBPNet improves imputation of protein-RNA interactions, as well as mechanistic interpretation of predictions.


Asunto(s)
Secuencia de Bases , Simulación por Computador , Aprendizaje Profundo , Proteínas de Unión al ARN , ARN , Humanos , Alelos , Sesgo , Sitios de Unión , Secuencia de Consenso , Conjuntos de Datos como Asunto , Internet , Mutación , Motivos de Nucleótidos , Nucleótidos/metabolismo , ARN/química , ARN/genética , ARN/metabolismo , Sitios de Empalme de ARN , ARN Mensajero/química , ARN Mensajero/genética , ARN Mensajero/metabolismo , ARN Viral/química , ARN Viral/genética , ARN Viral/metabolismo , Proteínas de Unión al ARN/química , Proteínas de Unión al ARN/metabolismo
6.
NAR Genom Bioinform ; 5(1): lqad010, 2023 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-36814457

RESUMEN

RNA-binding proteins (RBPs) are critical host factors for viral infection, however, large scale experimental investigation of the binding landscape of human RBPs to viral RNAs is costly and further complicated due to sequence variation between viral strains. To fill this gap, we investigated the role of RBPs in the context of SARS-CoV-2 by constructing the first in silico map of human RBP-viral RNA interactions at nucleotide-resolution using two deep learning methods (pysster and DeepRiPe) trained on data from CLIP-seq experiments on more than 100 human RBPs. We evaluated conservation of RBP binding between six other human pathogenic coronaviruses and identified sites of conserved and differential binding in the UTRs of SARS-CoV-1, SARS-CoV-2 and MERS. We scored the impact of mutations from 11 variants of concern on protein-RNA interaction, identifying a set of gain- and loss-of-binding events, as well as predicted the regulatory impact of putative future mutations. Lastly, we linked RBPs to functional, OMICs and COVID-19 patient data from other studies, and identified MBNL1, FTO and FXR2 RBPs as potential clinical biomarkers. Our results contribute towards a deeper understanding of how viruses hijack host cellular pathways and open new avenues for therapeutic intervention.

7.
Front Genet ; 13: 909714, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35903362

RESUMEN

COVID-19 is a heterogeneous disease caused by SARS-CoV-2. Aside from infections of the lungs, the disease can spread throughout the body and damage many other tissues, leading to multiorgan failure in severe cases. The highly variable symptom severity is influenced by genetic predispositions and preexisting diseases which have not been investigated in a large-scale multimodal manner. We present a holistic analysis framework, setting previously reported COVID-19 genes in context with prepandemic data, such as gene expression patterns across multiple tissues, polygenetic predispositions, and patient diseases, which are putative comorbidities of COVID-19. First, we generate a multimodal network using the prior-based network inference method KiMONo. We then embed the network to generate a meaningful lower-dimensional representation of the data. The input data are obtained via the Genotype-Tissue Expression project (GTEx), containing expression data from a range of tissues with genomic and phenotypic information of over 900 patients and 50 tissues. The generated network consists of nodes, that is, genes and polygenic risk scores (PRS) for several diseases/phenotypes, as well as for COVID-19 severity and hospitalization, and links between them if they are statistically associated in a regularized linear model by feature selection. Applying network embedding on the generated multimodal network allows us to perform efficient network analysis by identifying nodes close by in a lower-dimensional space that correspond to entities which are statistically linked. By determining the similarity between COVID-19 genes and other nodes through embedding, we identify disease associations to tissues, like the brain and gut. We also find strong associations between COVID-19 genes and various diseases such as ischemic heart disease, cerebrovascular disease, and hypertension. Moreover, we find evidence linking PTPN6 to a range of comorbidities along with the genetic predisposition of COVID-19, suggesting that this kinase is a central player in severe cases of COVID-19. In conclusion, our holistic network inference coupled with network embedding of multimodal data enables the contextualization of COVID-19-associated genes with respect to tissues, disease states, and genetic risk factors. Such contextualization can be exploited to further elucidate the biological importance of known and novel genes for severity of the disease in patients.

8.
Virulence ; 13(1): 2042-2058, 2022 12.
Artículo en Inglés | MEDLINE | ID: mdl-36411449

RESUMEN

Legionella pneumophila (L.p.) is a bacterial pathogen which is a common causative agent of pneumonia. In humans, it infects alveolar macrophages and transfers hundreds of virulence factors that interfere with cellular signalling pathways and the transcriptomic landscape to sustain its own replication. By this interaction, it has acquired eukaryote-like protein motifs by gene transfer events that partake in the pathogenicity of Legionella. In a computational screening approach for eukaryotic motifs in the transcriptome of Legionella, we identified the L.p. strain Corby protein ABQ55614 as putative histone-deacetylase and named it "suppressing modifier of histones 1" (Smh1). During infection, Smh1 is translocated from the Legionella vacuole into the host cytosol. When expressed in human macrophage THP-1 cells, Smh1 was localized predominantly in the nucleus, leading to broad histone H3 and H4 deacetylation, blunted expression of a large number of genes (e.g. IL-1ß and IL-8), and fostered intracellular bacterial replication. L.p. with a Smh1 knockdown grew normally in media but showed a slight growth defect inside the host cell. Furthermore, Smh1 showed a very potent histone deacetylation activity in vitro, e.g. at H3K14, that could be inhibited by targeted mutation of the putative catalytic center inferred by analogy with eukaryotic HDAC8, and with the deacetylase inhibitor trichostatin A. In summary, Smh1 displays functional homology with class I/II type HDACs. We identified Smh1 as a new Legionella virulence factor with a eukaryote-like histone-deacetylase activity that moderates host gene expression and might pave the way for further histone modifications.IMPORTANCELegionella pneumophila (L.p.) is a prominent bacterial pathogen, which is a common causative agent of pneumonia. In order to survive inside the host cell, the human macrophage, it profoundly interacts with host cell processes to advance its own replication. In this study, we identify a bacterial factor, Smh1, with yet unknown function as a host histone deacetylase. The activity of this factor in the host cell leads to attenuated gene expression and increased intracellular bacterial replication.


Asunto(s)
Eucariontes , Legionella pneumophila , Humanos , Histonas/genética , Legionella pneumophila/genética , Células Eucariotas , Investigación , Factores de Virulencia/genética , Histona Desacetilasas , Proteínas Represoras
9.
PLoS One ; 16(8): e0256181, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34388204

RESUMEN

Identifying causative variants in cis-regulatory elements (CRE) in neurodevelopmental disorders has proven challenging. We have used in vivo functional analyses to categorize rigorously filtered CRE variants in a clinical cohort that is plausibly enriched for causative CRE mutations: 48 unrelated males with a family history consistent with X-linked intellectual disability (XLID) in whom no detectable cause could be identified in the coding regions of the X chromosome (chrX). Targeted sequencing of all chrX CRE identified six rare variants in five affected individuals that altered conserved bases in CRE targeting known XLID genes and segregated appropriately in families. Two of these variants, FMR1CRE and TENM1CRE, showed consistent site- and stage-specific differences of enhancer function in the developing zebrafish brain using dual-color fluorescent reporter assay. Mouse models were created for both variants. In male mice Fmr1CRE induced alterations in neurodevelopmental Fmr1 expression, olfactory behavior and neurophysiological indicators of FMRP function. The absence of another likely causative variant on whole genome sequencing further supported FMR1CRE as the likely basis of the XLID in this family. Tenm1CRE mice showed no phenotypic anomalies. Following the release of gnomAD 2.1, reanalysis showed that TENM1CRE exceeded the maximum plausible population frequency of a XLID causative allele. Assigning causative status to any ultra-rare CRE variant remains problematic and requires disease-relevant in vivo functional data from multiple sources. The sequential and bespoke nature of such analyses renders them time-consuming and challenging to scale for routine clinical use.


Asunto(s)
Proteína de la Discapacidad Intelectual del Síndrome del Cromosoma X Frágil/genética , Genes Ligados a X , Genoma Humano , Discapacidad Intelectual Ligada al Cromosoma X/genética , Proteínas del Tejido Nervioso/genética , Elementos Reguladores de la Transcripción , Tenascina/genética , Animales , Animales Modificados Genéticamente , Encéfalo/metabolismo , Encéfalo/patología , Mapeo Cromosómico , Estudios de Cohortes , Modelos Animales de Enfermedad , Embrión no Mamífero , Exoma , Proteína de la Discapacidad Intelectual del Síndrome del Cromosoma X Frágil/metabolismo , Frecuencia de los Genes , Genotipo , Humanos , Masculino , Discapacidad Intelectual Ligada al Cromosoma X/metabolismo , Discapacidad Intelectual Ligada al Cromosoma X/patología , Ratones , Proteínas del Tejido Nervioso/deficiencia , Linaje , Fenotipo , Tenascina/deficiencia , Pez Cebra
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA