Pesquisa | Secretaria de Estado da Saúde

1.

Use of Elasticsearch-based business intelligence tools for integration and visualization of biological data.

Scott-Boyer, Marie-Pier; Dufour, Pascal; Belleau, François; Ongaro-Carcy, Regis; Plessis, Clément; Périn, Olivier; Droit, Arnaud.

Brief Bioinform ; 24(6)2023 09 22.

Artigo em Inglês | MEDLINE | ID: mdl-37798252

RESUMO

The emergence of massive datasets exploring the multiple levels of molecular biology has made their analysis and knowledge transfer more complex. Flexible tools to manage big biological datasets could be of great help for standardizing the usage of developed data visualizations and integration methods. Business intelligence (BI) tools have been used in many fields as exploratory tools. They have numerous connectors to link numerous data repositories with a unified graphic interface, offering an overview of data and facilitating interpretation for decision makers. BI tools could be a flexible and user-friendly way of handling molecular biological data with interactive visualizations. However, it is rather uncommon to see such tools used for the exploration of massive and complex datasets in biological fields. We believe that two main obstacles could be the reason. Firstly, we posit that the way to import data into BI tools are not compatible with biological databases. Secondly, BI tools may not be adapted to certain particularities of complex biological data, namely, the size, the variability of datasets and the availability of specialized visualizations. This paper highlights the use of five BI tools (Elastic Kibana, Siren Investigate, Microsoft Power BI, Salesforce Tableau and Apache Superset) onto which the massive data management repository engine called Elasticsearch is compatible. Four case studies will be discussed in which these BI tools were applied on biological datasets with different characteristics. We conclude that the performance of the tools depends on the complexity of the biological questions and the size of the datasets.

Assuntos

Conjuntos de Dados como Assunto , Software , Visualização de Dados

2.

Interpretation of network-based integration from multi-omics longitudinal data.

Bodein, Antoine; Scott-Boyer, Marie-Pier; Perin, Olivier; Lê Cao, Kim-Anh; Droit, Arnaud.

Nucleic Acids Res ; 50(5): e27, 2022 03 21.

Artigo em Inglês | MEDLINE | ID: mdl-34883510

RESUMO

Multi-omics integration is key to fully understand complex biological processes in an holistic manner. Furthermore, multi-omics combined with new longitudinal experimental design can unreveal dynamic relationships between omics layers and identify key players or interactions in system development or complex phenotypes. However, integration methods have to address various experimental designs and do not guarantee interpretable biological results. The new challenge of multi-omics integration is to solve interpretation and unlock the hidden knowledge within the multi-omics data. In this paper, we go beyond integration and propose a generic approach to face the interpretation problem. From multi-omics longitudinal data, this approach builds and explores hybrid multi-omics networks composed of both inferred and known relationships within and between omics layers. With smart node labelling and propagation analysis, this approach predicts regulation mechanisms and multi-omics functional modules. We applied the method on 3 case studies with various multi-omics designs and identified new multi-layer interactions involved in key biological functions that could not be revealed with single omics analysis. Moreover, we highlighted interplay in the kinetics that could help identify novel biological mechanisms. This method is available as an R package netOmics to readily suit any application.

Assuntos

Genômica , Biologia de Sistemas/métodos , Genômica/métodos , Fenótipo

3.

A Machine Learning Approach to Identify Key Residues Involved in Protein-Protein Interactions Exemplified with SARS-CoV-2 Variants.

Quitté, Léopold; Leclercq, Mickael; Prunier, Julien; Scott-Boyer, Marie-Pier; Moroy, Gautier; Droit, Arnaud.

Int J Mol Sci ; 25(12)2024 Jun 13.

Artigo em Inglês | MEDLINE | ID: mdl-38928241

RESUMO

Human infection with the coronavirus disease 2019 (COVID-19) is mediated by the binding of the spike protein of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) to the human angiotensin-converting enzyme 2 (ACE2). The frequent mutations in the receptor-binding domain (RBD) of the spike protein induced the emergence of variants with increased contagion and can hinder vaccine efficiency. Hence, it is crucial to better understand the binding mechanisms of variant RBDs to human ACE2 and develop efficient methods to characterize this interaction. In this work, we present an approach that uses machine learning to analyze the molecular dynamics simulations of RBD variant trajectories bound to ACE2. Along with the binding free energy calculation, this method was used to characterize the major differences in ACE2-binding capacity of three SARS-CoV-2 RBD variants-namely the original Wuhan strain, Omicron BA.1, and the more recent Omicron BA.5 sublineages. Our analyses assessed the differences in binding free energy and shed light on how it affects the infectious rates of different variants. Furthermore, this approach successfully characterized key binding interactions and could be deployed as an efficient tool to predict different binding inhibitors to pave the way for new preventive and therapeutic strategies.

Assuntos

Enzima de Conversão de Angiotensina 2 , COVID-19 , Aprendizado de Máquina , Simulação de Dinâmica Molecular , Ligação Proteica , SARS-CoV-2 , Glicoproteína da Espícula de Coronavírus , SARS-CoV-2/metabolismo , SARS-CoV-2/genética , Enzima de Conversão de Angiotensina 2/metabolismo , Enzima de Conversão de Angiotensina 2/química , Humanos , Glicoproteína da Espícula de Coronavírus/metabolismo , Glicoproteína da Espícula de Coronavírus/química , Glicoproteína da Espícula de Coronavírus/genética , COVID-19/virologia , COVID-19/metabolismo , Sítios de Ligação , Mutação , Domínios e Motivos de Interação entre Proteínas

4.

timeOmics: an R package for longitudinal multi-omics data integration.

Bodein, Antoine; Scott-Boyer, Marie-Pier; Perin, Olivier; Lê Cao, Kim-Anh; Droit, Arnaud.

Bioinformatics ; 38(2): 577-579, 2022 01 03.

Artigo em Inglês | MEDLINE | ID: mdl-34554215

RESUMO

MOTIVATION: Multi-omics data integration enables the global analysis of biological systems and discovery of new biological insights. Multi-omics experimental designs have been further extended with a longitudinal dimension to study dynamic relationships between molecules. However, methods that integrate longitudinal multi-omics data are still in their infancy. RESULTS: We introduce the R package timeOmics, a generic analytical framework for the integration of longitudinal multi-omics data. The framework includes pre-processing, modeling and clustering to identify molecular features strongly associated with time. We illustrate this framework in a case study to detect seasonal patterns of mRNA, metabolites, gut taxa and clinical variables in patients with diabetes mellitus from the integrative Human Microbiome Project. AVAILABILITYAND IMPLEMENTATION: timeOmics is available on Bioconductor and github.com/abodein/timeOmics. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Genômica , Multiômica , Humanos , Genômica/métodos , Análise por Conglomerados

5.

KibioR & Kibio: a new architecture for next-generation data querying and sharing in big biology.

Ongaro-Carcy, Régis; Scott-Boyer, Marie-Pier; Dessemond, Adrien; Belleau, François; Leclercq, Mickael; Périn, Olivier; Droit, Arnaud.

Bioinformatics ; 37(17): 2706-2713, 2021 Sep 09.

Artigo em Inglês | MEDLINE | ID: mdl-33751043

RESUMO

MOTIVATION: The growing production of massive heterogeneous biological data offers opportunities for new discoveries. However, performing multi-omics data analysis is challenging, and researchers are forced to handle the ever-increasing complexity of both data management and evolution of our biological understanding. Substantial efforts have been made to unify biological datasets into integrated systems. Unfortunately, they are not easily scalable, deployable and searchable, locally or globally. RESULTS: This publication presents two tools with a simple structure that can help any data provider, organization or researcher, requiring a reliable data search and analysis base. The first tool is Kibio, a scalable and adaptable data storage based on Elasticsearch search engine. The second tool is KibioR, a R package to pull, push and search Kibio datasets or any accessible Elasticsearch-based databases. These tools apply a uniform data exchange model and minimize the burden of data management by organizing data into a decentralized, versatile, searchable and shareable structure. Several case studies are presented using multiple databases, from drug characterization to miRNAs and pathways identification, emphasizing the ease of use and versatility of the Kibio/KibioR framework. AVAILABILITYAND IMPLEMENTATION: Both KibioR and Elasticsearch are open source. KibioR package source is available at https://github.com/regisoc/kibior and the library on CRAN at https://cran.r-project.org/package=kibior. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

6.

New Developments and Possibilities in Reanalysis and Reinterpretation of Whole Exome Sequencing Datasets for Unsolved Rare Diseases Using Machine Learning Approaches.

Setty, Samarth Thonta; Scott-Boyer, Marie-Pier; Cuppens, Tania; Droit, Arnaud.

Int J Mol Sci ; 23(12)2022 Jun 18.

Artigo em Inglês | MEDLINE | ID: mdl-35743235

RESUMO

Rare diseases impact the lives of 300 million people in the world. Rapid advances in bioinformatics and genomic technologies have enabled the discovery of causes of 20-30% of rare diseases. However, most rare diseases have remained as unsolved enigmas to date. Newer tools and availability of high throughput sequencing data have enabled the reanalysis of previously undiagnosed patients. In this review, we have systematically compiled the latest developments in the discovery of the genetic causes of rare diseases using machine learning methods. Importantly, we have detailed methods available to reanalyze existing whole exome sequencing data of unsolved rare diseases. We have identified different reanalysis methodologies to solve problems associated with sequence alterations/mutations, variation re-annotation, protein stability, splice isoform malfunctions and oligogenic analysis. In addition, we give an overview of new developments in the field of rare disease research using whole genome sequencing data and other omics.

Assuntos

Exoma , Doenças Raras , Exoma/genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Aprendizado de Máquina , Doenças Raras/diagnóstico , Doenças Raras/genética , Sequenciamento do Exoma/métodos

7.

GWENA: gene co-expression networks analysis and extended modules characterization in a single Bioconductor package.

Lemoine, Gwenaëlle G; Scott-Boyer, Marie-Pier; Ambroise, Bathilde; Périn, Olivier; Droit, Arnaud.

BMC Bioinformatics ; 22(1): 267, 2021 May 25.

Artigo em Inglês | MEDLINE | ID: mdl-34034647

RESUMO

BACKGROUND: Network-based analysis of gene expression through co-expression networks can be used to investigate modular relationships occurring between genes performing different biological functions. An extended description of each of the network modules is therefore a critical step to understand the underlying processes contributing to a disease or a phenotype. Biological integration, topology study and conditions comparison (e.g. wild vs mutant) are the main methods to do so, but to date no tool combines them all into a single pipeline. RESULTS: Here we present GWENA, a new R package that integrates gene co-expression network construction and whole characterization of the detected modules through gene set enrichment, phenotypic association, hub genes detection, topological metric computation, and differential co-expression. To demonstrate its performance, we applied GWENA on two skeletal muscle datasets from young and old patients of GTEx study. Remarkably, we prioritized a gene whose involvement was unknown in the muscle development and growth. Moreover, new insights on the variations in patterns of co-expression were identified. The known phenomena of connectivity loss associated with aging was found coupled to a global reorganization of the relationships leading to expression of known aging related functions. CONCLUSION: GWENA is an R package available through Bioconductor ( https://bioconductor.org/packages/release/bioc/html/GWENA.html ) that has been developed to perform extended analysis of gene co-expression networks. Thanks to biological and topological information as well as differential co-expression, the package helps to dissect the role of genes relationships in diseases conditions or targeted phenotypes. GWENA goes beyond existing packages that perform co-expression analysis by including new tools to fully characterize modules, such as differential co-expression, additional enrichment databases, and network visualization.

Assuntos

Redes Reguladoras de Genes , Software , Expressão Gênica , Perfilação da Expressão Gênica , Humanos

8.

Multi-omics integration-a comparison of unsupervised clustering methodologies.

Tini, Giulia; Marchetti, Luca; Priami, Corrado; Scott-Boyer, Marie-Pier.

Brief Bioinform ; 20(4): 1269-1279, 2019 07 19.

Artigo em Inglês | MEDLINE | ID: mdl-29272335

RESUMO

With the recent developments in the field of multi-omics integration, the interest in factors such as data preprocessing, choice of the integration method and the number of different omics considered had increased. In this work, the impact of these factors is explored when solving the problem of sample classification, by comparing the performances of five unsupervised algorithms: Multiple Canonical Correlation Analysis, Multiple Co-Inertia Analysis, Multiple Factor Analysis, Joint and Individual Variation Explained and Similarity Network Fusion. These methods were applied to three real data sets taken from literature and several ad hoc simulated scenarios to discuss classification performance in different conditions of noise and signal strength across the data types. The impact of experimental design, feature selection and parameter training has been also evaluated to unravel important conditions that can affect the accuracy of the result.

Assuntos

Biologia Computacional/métodos , Integração de Sistemas , Aprendizado de Máquina não Supervisionado , Algoritmos , Animais , Análise por Conglomerados , Simulação por Computador , Bases de Dados Factuais , Análise Fatorial , Genômica/estatística & dados numéricos , Humanos , Metabolômica/estatística & dados numéricos , Camundongos , Modelos Biológicos , Análise Multivariada , Proteômica/estatística & dados numéricos , Biologia de Sistemas , Aprendizado de Máquina não Supervisionado/estatística & dados numéricos

9.

Inferring and modeling inheritance of differentially methylated changes across multiple generations.

Belleau, Pascal; Deschênes, Astrid; Scott-Boyer, Marie-Pier; Lambrot, Romain; Dalvai, Mathieu; Kimmins, Sarah; Bailey, Janice; Droit, Arnaud.

Nucleic Acids Res ; 46(14): e85, 2018 08 21.

Artigo em Inglês | MEDLINE | ID: mdl-29750268

RESUMO

High-throughput methylation sequencing enables genome-wide detection of differentially methylated sites (DMS) or regions (DMR). Increasing evidence suggests that treatment-induced DMS can be transmitted across generations, but the analysis of induced methylation changes across multiple generations is complicated by the lack of sound statistical methods to evaluate significance levels. Due to software design, DMS detection was usually made on each generation separately, thus disregarding stochastic effects expected when a large number of DMS is detected in each generation. Here, we present a novel method based on Monte Carlo sampling, methylInheritance, to evaluate that the number of conserved DMS between several generations is associated to an effect inherited from a treatment and not randomness. Moreover, we developed an inheritance simulation package, methInheritSim, to demonstrate the performance of the methylInheritance method and to evaluate the power of different experimental designs. Finally, we applied methylInheritance to a DNA methylation dataset obtained from early-life persistent organic pollutants (POPs) exposed Sprague-Dawley female rats and their descendants through a paternal transmission. The results show that metylInheritance can efficiently identify treatment-induced inherited methylation changes. Specifically, we identified two intergenerationally conserved DMS at transcription start site (TSS); one of those persisted transgenerationally. Three transgenerationally conserved DMR were found at intra or integenic regions.

Assuntos

Metilação de DNA , Padrões de Herança , Animais , Simulação por Computador , Poluentes Ambientais , Epigênese Genética , Feminino , Masculino , Modelos Genéticos , Método de Monte Carlo , Ratos Sprague-Dawley

10.

LY75 Ablation Mediates Mesenchymal-Epithelial Transition (MET) in Epithelial Ovarian Cancer (EOC) Cells Associated with DNA Methylation Alterations and Suppression of the Wnt/ß-Catenin Pathway.

Mehdi, Sadia; Bachvarova, Magdalena; Scott-Boyer, Marie-Pier; Droit, Arnaud; Bachvarov, Dimcho.

Int J Mol Sci ; 21(5)2020 Mar 07.

Artigo em Inglês | MEDLINE | ID: mdl-32156068

RESUMO

Growing evidence demonstrates that epithelial-mesenchymal transition (EMT) plays an important role in epithelial ovarian cancer (EOC) progression and spreading; however, its molecular mechanisms remain poorly defined. We have previously shown that the antigen receptor LY75 can modulate EOC cell phenotype and metastatic potential, as LY75 depletion directed mesenchymal-epithelial transition (MET) in EOC cell lines with mesenchymal phenotype. We used the LY75-mediated modulation of EMT as a model to investigate for DNA methylation changes during EMT in EOC cells, by applying the reduced representation bisulfite sequencing (RRBS) methodology. Numerous genes have displayed EMT-related DNA methylation patterns alterations in their promoter/exon regions. Ten selected genes, whose DNA methylation alterations were further confirmed by alternative methods, were further identified, some of which could represent new EOC biomarkers/therapeutic targets. Moreover, our methylation data were strongly indicative for the predominant implication of the Wnt/ß-catenin pathway in the EMT-induced DNA methylation variations in EOC cells. Consecutive experiments, including alterations in the Wnt/ß-catenin pathway activity in EOC cells with a specific inhibitor and the identification of LY75-interacting partners by a proteomic approach, were strongly indicative for the direct implication of the LY75 receptor in modulating the Wnt/ß-catenin signaling in EOC cells.

Assuntos

Antígenos CD/genética , Carcinoma Epitelial do Ovário/patologia , Metilação de DNA/genética , Transição Epitelial-Mesenquimal/genética , Lectinas Tipo C/genética , Antígenos de Histocompatibilidade Menor/genética , Neoplasias Ovarianas/patologia , Receptores de Superfície Celular/genética , Via de Sinalização Wnt/genética , beta Catenina/antagonistas & inibidores , Linhagem Celular Tumoral , Feminino , Regulação Neoplásica da Expressão Gênica/genética , Humanos , Interferência de RNA , RNA Interferente Pequeno/genética

11.

Sexual Dimorphism, Age, and Fat Mass Are Key Phenotypic Drivers of Proteomic Signatures.

Curran, Aoife M; Fogarty Draper, Colleen; Scott-Boyer, Marie-Pier; Valsesia, Armand; Roche, Helen M; Ryan, Miriam F; Gibney, Michael J; Kutmon, Martina; Evelo, Chris T; Coort, Susan L; Astrup, Arne; Saris, Wim H; Brennan, Lorraine; Kaput, Jim.

J Proteome Res ; 16(11): 4122-4133, 2017 11 03.

Artigo em Inglês | MEDLINE | ID: mdl-28950061

RESUMO

Validated protein biomarkers are needed for assessing health trajectories, predicting and subclassifying disease, and optimizing diagnostic and therapeutic clinical decision-making. The sensitivity, specificity, accuracy, and precision of single or combinations of protein biomarkers may be altered by differences in physiological states limiting the ability to translate research results to clinically useful diagnostic tests. Aptamer based affinity assays were used to test whether low abundant serum proteins differed based on age, sex, and fat mass in a healthy population of 94 males and 102 females from the MECHE cohort. The findings were replicated in 217 healthy male and 377 healthy female participants in the DiOGenes consortium. Of the 1129 proteins in the panel, 141, 51, and 112 proteins (adjusted p < 0.1) were identified in the MECHE cohort and significantly replicated in DiOGenes for sexual dimorphism, age, and fat mass, respectively. Pathway analysis classified a subset of proteins from the 3 phenotypes to the complement and coagulation cascades pathways and to immune and coagulation processes. These results demonstrated that specific proteins were statistically associated with dichotomous (male vs female) and continuous phenotypes (age, fat mass), which may influence the identification and use of biomarkers of clinical utility for health diagnosis and therapeutic strategies.

Assuntos

Fenótipo , Proteômica/métodos , Tecido Adiposo , Fatores Etários , Feminino , Humanos , Masculino , Caracteres Sexuais

12.

Inferring and modeling inheritance of differentially methylated changes across multiple generations.

Belleau, Pascal; Deschênes, Astrid; Scott-Boyer, Marie-Pier; Lambrot, Romain; Dalvai, Mathieu; Kimmins, Sarah; Bailey, Janice; Droit, Arnaud.

Nucleic Acids Res ; 46(14): 7466, 2018 08 21.

Artigo em Inglês | MEDLINE | ID: mdl-29788315

13.

iBMQ: a R/Bioconductor package for integrated Bayesian modeling of eQTL data.

Imholte, Greg C; Scott-Boyer, Marie-Pier; Labbe, Aurélie; Deschepper, Christian F; Gottardo, Raphael.

Bioinformatics ; 29(21): 2797-8, 2013 Nov 01.

Artigo em Inglês | MEDLINE | ID: mdl-23958729

RESUMO

MOTIVATION: Recently, mapping studies of expression quantitative loci (eQTL) (where gene expression levels are viewed as quantitative traits) have provided insight into the biology of gene regulation. Bayesian methods provide natural modeling frameworks for analyzing eQTL studies, where information shared across markers and/or genes can increase the power to detect eQTLs. Bayesian approaches tend to be computationally demanding and require specialized software. As a result, most eQTL studies use univariate methods treating each gene independently, leading to suboptimal results. RESULTS: We present a powerful, computationally optimized and free open-source R package, iBMQ. Our package implements a joint hierarchical Bayesian model where all genes and SNPs are modeled concurrently. Model parameters are estimated using a Markov chain Monte Carlo algorithm. The free and widely used openMP parallel library speeds up computation. Using a mouse cardiac dataset, we show that iBMQ improves the detection of large trans-eQTL hotspots compared with other state-of-the-art packages for eQTL analysis. AVAILABILITY: The R-package iBMQ is available from the Bioconductor Web site at http://bioconductor.org and runs on Linux, Windows and MAC OS X. It is distributed under the Artistic Licence-2.0 terms. CONTACT: christian.deschepper@ircm.qc.ca or rgottard@fhcrc.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Expressão Gênica , Locos de Características Quantitativas , Software , Algoritmos , Animais , Teorema de Bayes , Cadeias de Markov , Camundongos , Método de Monte Carlo , Polimorfismo de Nucleotídeo Único

14.

Target repositioning using multi-layer networks and machine learning: The case of prostate cancer.

Picard, Milan; Scott-Boyer, Marie-Pier; Bodein, Antoine; Leclercq, Mickaël; Prunier, Julien; Périn, Olivier; Droit, Arnaud.

Comput Struct Biotechnol J ; 24: 464-475, 2024 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-38983753

RESUMO

The discovery of novel therapeutic targets, defined as proteins which drugs can interact with to induce therapeutic benefits, typically represent the first and most important step of drug discovery. One solution for target discovery is target repositioning, a strategy which relies on the repurposing of known targets for new diseases, leading to new treatments, less side effects and potential drug synergies. Biological networks have emerged as powerful tools for integrating heterogeneous data and facilitating the prediction of biological or therapeutic properties. Consequently, they are widely employed to predict new therapeutic targets by characterizing potential candidates, often based on their interactions within a Protein-Protein Interaction (PPI) network, and their proximity to genes associated with the disease. However, over-reliance on PPI networks and the assumption that potential targets are necessarily near known genes can introduce biases that may limit the effectiveness of these methods. This study addresses these limitations in two ways. First, by exploiting a multi-layer network which incorporates additional information such as gene regulation, metabolite interactions, metabolic pathways, and several disease signatures such as Differentially Expressed Genes, mutated genes, Copy Number Alteration, and structural variants. Second, by extracting relevant features from the network using several approaches including proximity to disease-associated genes, but also unbiased approaches such as propagation-based methods, topological metrics, and module detection algorithms. Using prostate cancer as a case study, the best features were identified and utilized to train machine learning algorithms to predict 5 novel promising therapeutic targets for prostate cancer: IGF2R, C5AR, RAB7, SETD2 and NPBWR1.

15.

SOS genes are rapidly induced while translesion synthesis polymerase activity is temporally regulated.

Bergum, Olaug Elisabeth Torheim; Singleton, Amanda Holstad; Røst, Lisa Marie; Bodein, Antoine; Scott-Boyer, Marie-Pier; Rye, Morten Beck; Droit, Arnaud; Bruheim, Per; Otterlei, Marit.

Front Microbiol ; 15: 1373344, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38596376

RESUMO

The DNA damage inducible SOS response in bacteria serves to increase survival of the species at the cost of mutagenesis. The SOS response first initiates error-free repair followed by error-prone repair. Here, we have employed a multi-omics approach to elucidate the temporal coordination of the SOS response. Escherichia coli was grown in batch cultivation in bioreactors to ensure highly controlled conditions, and a low dose of the antibiotic ciprofloxacin was used to activate the SOS response while avoiding extensive cell death. Our results show that expression of genes involved in error-free and error-prone repair were both induced shortly after DNA damage, thus, challenging the established perception that the expression of error-prone repair genes is delayed. By combining transcriptomics and a sub-proteomics approach termed signalomics, we found that the temporal segregation of error-free and error-prone repair is primarily regulated after transcription, supporting the current literature. Furthermore, the heterology index (i.e., the binding affinity of LexA to the SOS box) was correlated to the maximum increase in gene expression and not to the time of induction of SOS genes. Finally, quantification of metabolites revealed increasing pyrimidine pools as a late feature of the SOS response. Our results elucidate how the SOS response is coordinated, showing a rapid transcriptional response and temporal regulation of mutagenesis on the protein and metabolite levels.

16.

Unveiling the crucial neuronal role of the proteasomal ATPase subunit gene PSMC5 in neurodevelopmental proteasomopathies.

Küry, Sébastien; Stanton, Janelle E; van Woerden, Geeske; Hsieh, Tzung-Chien; Rosenfelt, Cory; Scott-Boyer, Marie Pier; Most, Victoria; Wang, Tianyun; Papendorf, Jonas Johannes; de Konink, Charlotte; Deb, Wallid; Vignard, Virginie; Studencka-Turski, Maja; Besnard, Thomas; Hajdukowicz, Anna Marta; Thiel, Franziska; Möller, Sophie; Florenceau, Laëtitia; Cuinat, Silvestre; Marsac, Sylvain; Wentzensen, Ingrid; Tuttle, Annabelle; Forster, Cara; Striesow, Johanna; Golnik, Richard; Ortiz, Damara; Jenkins, Laura; Rosenfeld, Jill A; Ziegler, Alban; Houdayer, Clara; Bonneau, Dominique; Torti, Erin; Begtrup, Amber; Monaghan, Kristin G; Mullegama, Sureni V; Volker-Touw, C M L Nienke; van Gassen, Koen L I; Oegema, Renske; de Pagter, Mirjam; Steindl, Katharina; Rauch, Anita; Ivanovski, Ivan; McDonald, Kimberly; Boothe, Emily; Dauber, Andrew; Baker, Janice; Fabie, Noelle Andrea V; Bernier, Raphael A; Turner, Tychele N; Srivastava, Siddharth.

medRxiv ; 2024 Jan 26.

Artigo em Inglês | MEDLINE | ID: mdl-38293138

RESUMO

Neurodevelopmental proteasomopathies represent a distinctive category of neurodevelopmental disorders (NDD) characterized by genetic variations within the 26S proteasome, a protein complex governing eukaryotic cellular protein homeostasis. In our comprehensive study, we identified 23 unique variants in PSMC5 , which encodes the AAA-ATPase proteasome subunit PSMC5/Rpt6, causing syndromic NDD in 38 unrelated individuals. Overexpression of PSMC5 variants altered human hippocampal neuron morphology, while PSMC5 knockdown led to impaired reversal learning in flies and loss of excitatory synapses in rat hippocampal neurons. PSMC5 loss-of-function resulted in abnormal protein aggregation, profoundly impacting innate immune signaling, mitophagy rates, and lipid metabolism in affected individuals. Importantly, targeting key components of the integrated stress response, such as PKR and GCN2 kinases, ameliorated immune dysregulations in cells from affected individuals. These findings significantly advance our understanding of the molecular mechanisms underlying neurodevelopmental proteasomopathies, provide links to research in neurodegenerative diseases, and open up potential therapeutic avenues.

17.

An integrated hierarchical Bayesian model for multivariate eQTL mapping.

Scott-Boyer, Marie Pier; Imholte, Gregory C; Tayeb, Arafat; Labbe, Aurelie; Deschepper, Christian F; Gottardo, Raphael.

Stat Appl Genet Mol Biol ; 11(4)2012 Jul 12.

Artigo em Inglês | MEDLINE | ID: mdl-22850063

RESUMO

Recently, expression quantitative loci (eQTL) mapping studies, where expression levels of thousands of genes are viewed as quantitative traits, have been used to provide greater insight into the biology of gene regulation. Originally, eQTLs were detected by applying standard QTL detection tools (using a "one gene at-a-time" approach), but this method ignores many possible interactions between genes. Several other methods have proposed to overcome these limitations, but each of them has some specific disadvantages. In this paper, we present an integrated hierarchical Bayesian model that jointly models all genes and SNPs to detect eQTLs. We propose a model (named iBMQ) that is specifically designed to handle a large number G of gene expressions, a large number S of regressors (genetic markers) and a small number n of individuals in what we call a ``large G, large S, small n'' paradigm. This method incorporates genotypic and gene expression data into a single model while 1) specifically coping with the high dimensionality of eQTL data (large number of genes), 2) borrowing strength from all gene expression data for the mapping procedures, and 3) controlling the number of false positives to a desirable level. To validate our model, we have performed simulation studies and showed that it outperforms other popular methods for eQTL detection, including QTLBIM, R-QTL, remMap and M-SPLS. Finally, we used our model to analyze a real expression dataset obtained in a panel of mice BXD Recombinant Inbred (RI) strains. Analysis of these data with iBMQ revealed the presence of multiple hotspots showing significant enrichment in genes belonging to one or more annotation categories.

Assuntos

Mapeamento Cromossômico/estatística & dados numéricos , Regulação da Expressão Gênica/genética , Locos de Características Quantitativas , Algoritmos , Animais , Teorema de Bayes , Mapeamento Cromossômico/métodos , Simulação por Computador , Camundongos , Camundongos Endogâmicos , Modelos Genéticos , Modelos Teóricos , Polimorfismo de Nucleotídeo Único/fisiologia , Análise de Regressão

18.

Transgenerational impact of grand-paternal lifetime exposures to both folic acid deficiency and supplementation on genome-wide DNA methylation in male germ cells.

Chan, Donovan; Ly, Lundi; Rebolledo, Edgar Martínez Duncker; Martel, Josée; Landry, Mylène; Scott-Boyer, Marie-Pier; Droit, Arnaud; Trasler, Jacquetta M.

Andrology ; 11(5): 927-942, 2023 07.

Artigo em Inglês | MEDLINE | ID: mdl-36697378

RESUMO

BACKGROUND: DNA methylation (DNAme) erasure and reacquisition occur during prenatal male germ cell development; some further remodeling takes place after birth during spermatogenesis. Environmental insults during germline epigenetic reprogramming may affect DNAme, presenting a potential mechanism for transmission of environmental exposures across multiple generations. OBJECTIVES: We investigated how germ cell DNAme is impacted by lifetime exposures to diets containing either low or high, clinically relevant, levels of the methyl donor folic acid and whether resulting DNAme alterations were inherited in germ cells of male offspring of subsequent generations. MATERIALS AND METHODS: Female mice were placed on a control (FCD), 7-fold folic acid deficient (7FD) or 10- to 20-fold supplemented (10FS and 20FS) diet before and during pregnancy. Resulting F1 litters were weaned on the respective diets. F2 and F3 males received control diets. Genome-wide DNAme at cytosines (within CpG sites) was assessed in F1 spermatogonia, and in F1, F2 and F3 sperm. RESULTS: In F1 germ cells, a greater number of differentially methylated cytosines (DMCs) were observed in spermatogonia as compared with F1 sperm for all folic acid diets. DMCs were lower in number in F2 versus F1 sperm, while an unexpected increase was found in F3 sperm. DMCs were predominantly hypomethylated, with genes in neurodevelopmental pathways commonly affected in F1, F2 and F3 male germ cells. While no DMCs were found to be significantly inherited inter- or transgenerationally, we observed over-representation of repetitive elements, particularly young long interspersed nuclear elements (LINEs). DISCUSSION AND CONCLUSION: These results suggest that the prenatal window is the time most susceptible to folate-induced alterations in sperm DNAme in male germ cells. Altered methylation of specific sites in F1 germ cells was not present in later generations. However, the presence of DNAme perturbations in the sperm of males of the F2 and F3 generations suggests that epigenetic inheritance mechanisms other than DNAme may have been impacted by the folate diet exposure of F1 germ cells.

Assuntos

Metilação de DNA , Deficiência de Ácido Fólico , Gravidez , Masculino , Feminino , Camundongos , Animais , Deficiência de Ácido Fólico/genética , Deficiência de Ácido Fólico/metabolismo , Sêmen/metabolismo , Epigênese Genética , Espermatozoides/metabolismo , Ácido Fólico/metabolismo , Suplementos Nutricionais , Espermatogônias/metabolismo , DNA/metabolismo

19.

PSMC3 proteasome subunit variants are associated with neurodevelopmental delay and type I interferon production.

Ebstein, Frédéric; Küry, Sébastien; Most, Victoria; Rosenfelt, Cory; Scott-Boyer, Marie-Pier; van Woerden, Geeske M; Besnard, Thomas; Papendorf, Jonas Johannes; Studencka-Turski, Maja; Wang, Tianyun; Hsieh, Tzung-Chien; Golnik, Richard; Baldridge, Dustin; Forster, Cara; de Konink, Charlotte; Teurlings, Selina M W; Vignard, Virginie; van Jaarsveld, Richard H; Ades, Lesley; Cogné, Benjamin; Mignot, Cyril; Deb, Wallid; Jongmans, Marjolijn C J; Cole, F Sessions; van den Boogaard, Marie-José H; Wambach, Jennifer A; Wegner, Daniel J; Yang, Sandra; Hannig, Vickie; Brault, Jennifer Ann; Zadeh, Neda; Bennetts, Bruce; Keren, Boris; Gélineau, Anne-Claire; Powis, Zöe; Towne, Meghan; Bachman, Kristine; Seeley, Andrea; Beck, Anita E; Morrison, Jennifer; Westman, Rachel; Averill, Kelly; Brunet, Theresa; Haasters, Judith; Carter, Melissa T; Osmond, Matthew; Wheeler, Patricia G; Forzano, Francesca; Mohammed, Shehla; Trakadis, Yannis.

Sci Transl Med ; 15(698): eabo3189, 2023 05 31.

Artigo em Inglês | MEDLINE | ID: mdl-37256937

RESUMO

A critical step in preserving protein homeostasis is the recognition, binding, unfolding, and translocation of protein substrates by six AAA-ATPase proteasome subunits (ATPase-associated with various cellular activities) termed PSMC1-6, which are required for degradation of proteins by 26S proteasomes. Here, we identified 15 de novo missense variants in the PSMC3 gene encoding the AAA-ATPase proteasome subunit PSMC3/Rpt5 in 23 unrelated heterozygous patients with an autosomal dominant form of neurodevelopmental delay and intellectual disability. Expression of PSMC3 variants in mouse neuronal cultures led to altered dendrite development, and deletion of the PSMC3 fly ortholog Rpt5 impaired reversal learning capabilities in fruit flies. Structural modeling as well as proteomic and transcriptomic analyses of T cells derived from patients with PSMC3 variants implicated the PSMC3 variants in proteasome dysfunction through disruption of substrate translocation, induction of proteotoxic stress, and alterations in proteins controlling developmental and innate immune programs. The proteostatic perturbations in T cells from patients with PSMC3 variants correlated with a dysregulation in type I interferon (IFN) signaling in these T cells, which could be blocked by inhibition of the intracellular stress sensor protein kinase R (PKR). These results suggest that proteotoxic stress activated PKR in patient-derived T cells, resulting in a type I IFN response. The potential relationship among proteosome dysfunction, type I IFN production, and neurodevelopment suggests new directions in our understanding of pathogenesis in some neurodevelopmental disorders.

Assuntos

Interferon Tipo I , Complexo de Endopeptidases do Proteassoma , Animais , Humanos , Camundongos , Adenosina Trifosfatases/genética , Drosophila melanogaster , Expressão Gênica , Complexo de Endopeptidases do Proteassoma/metabolismo , Proteômica

20.

Sperm Heterogeneity Accounts for Sperm DNA Methylation Variations Observed in the Caput Epididymis, Independently From DNMT/TET Activities.

Chen, Hong; Scott-Boyer, Marie-Pier; Droit, Arnaud; Robert, Claude; Belleannée, Clémence.

Front Cell Dev Biol ; 10: 834519, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-35392175

RESUMO

Following their production in the testis, spermatozoa enter the epididymis where they gain their motility and fertilizing abilities. This post-testicular maturation coincides with sperm epigenetic profile changes that influence progeny outcome. While recent studies highlighted the dynamics of small non-coding RNAs in maturing spermatozoa, little is known regarding sperm methylation changes and their impact at the post-fertilization level. Fluorescence-activated cell sorting (FACS) was used to purify spermatozoa from the testis and different epididymal segments (i.e., caput, corpus and cauda) of CAG/su9-DsRed2; Acr3-EGFP transgenic mice in order to map out sperm methylome dynamics. Reduced representation bisulfite sequencing (RRBS-Seq) performed on DNA from these respective sperm populations indicated that high methylation changes were observed between spermatozoa from the caput vs. testis with 5,546 entries meeting our threshold values (q value <0.01, methylation difference above 25%). Most of these changes were transitory during epididymal sperm maturation according to the low number of entries identified between spermatozoa from cauda vs. testis. According to enzymatic and sperm/epididymal fluid co-incubation assays, (de)methylases were not found responsible for these sperm methylation changes. Instead, we identified that a subpopulation of caput spermatozoa displayed distinct methylation marks that were susceptible to sperm DNAse treatment and accounted for the DNA methylation profile changes observed in the proximal epididymis. Our results support the paradigm that a fraction of caput spermatozoa has a higher propensity to bind extracellular DNA, a phenomenon responsible for the sperm methylome variations observed at the post-testicular level. Further investigating the degree of conservation of this sperm heterogeneity in human will eventually provide new considerations regarding sperm selection procedures used in fertility clinics.

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

Detalhe da pesquisa