ABSTRACT
Dysfunction of ENaC, the epithelial sodium channel that regulates salt and water reabsorption in epithelia, causes several human diseases, including cystic fibrosis (CF). To develop a global understanding of molecular regulators of ENaC traffic/function and to identify of candidate CF drug targets, we performed a large-scale screen combining high-content live-cell microscopy and siRNAs in human airway epithelial cells. Screening over 6,000 genes identified over 1,500 candidates, evenly divided between channel inhibitors and activators. Genes in the phosphatidylinositol pathway were enriched on the primary candidate list, and these, along with other ENaC activators, were examined further with secondary siRNA validation. Subsequent detailed investigation revealed ciliary neurotrophic factor receptor (CNTFR) as an ENaC modulator and showed that inhibition of (diacylglycerol kinase, iota) DGKι, a protein involved in PiP2 metabolism, downgrades ENaC activity, leading to normalization of both Na+ and fluid absorption in CF airways to non-CF levels in primary human lung cells from CF patients.
Subject(s)
Cystic Fibrosis/drug therapy , Molecular Targeted Therapy , Cell Line , Cells, Cultured , Epithelial Sodium Channels/metabolism , Humans , Lung/cytology , Lung/metabolism , RNA, Small InterferingABSTRACT
The main goals and challenges for the life science communities in the Open Science framework are to increase reuse and sustainability of data resources, software tools, and workflows, especially in large-scale data-driven research and computational analyses. Here, we present key findings, procedures, effective measures and recommendations for generating and establishing sustainable life science resources based on the collaborative, cross-disciplinary work done within the EOSC-Life (European Open Science Cloud for Life Sciences) consortium. Bringing together 13 European life science research infrastructures, it has laid the foundation for an open, digital space to support biological and medical research. Using lessons learned from 27 selected projects, we describe the organisational, technical, financial and legal/ethical challenges that represent the main barriers to sustainability in the life sciences. We show how EOSC-Life provides a model for sustainable data management according to FAIR (findability, accessibility, interoperability, and reusability) principles, including solutions for sensitive- and industry-related resources, by means of cross-disciplinary training and best practices sharing. Finally, we illustrate how data harmonisation and collaborative work facilitate interoperability of tools, data, solutions and lead to a better understanding of concepts, semantics and functionalities in the life sciences.
Subject(s)
Biological Science Disciplines , Biomedical Research , Software , WorkflowABSTRACT
Histone ubiquitylation is a prominent response to DNA double-strand breaks (DSBs), but how these modifications are confined to DNA lesions is not understood. Here, we show that TRIP12 and UBR5, two HECT domain ubiquitin E3 ligases, control accumulation of RNF168, a rate-limiting component of a pathway that ubiquitylates histones after DNA breakage. We find that RNF168 can be saturated by increasing amounts of DSBs. Depletion of TRIP12 and UBR5 allows accumulation of RNF168 to supraphysiological levels, followed by massive spreading of ubiquitin conjugates and hyperaccumulation of ubiquitin-regulated genome caretakers such as 53BP1 and BRCA1. Thus, regulatory and proteolytic ubiquitylations are wired in a self-limiting circuit that promotes histone ubiquitylation near the DNA lesions but at the same time counteracts its excessive spreading to undamaged chromosomes. We provide evidence that this mechanism is vital for the homeostasis of ubiquitin-controlled events after DNA breakage and can be subverted during tumorigenesis.
Subject(s)
Carrier Proteins/metabolism , Chromatin/metabolism , DNA Breaks, Double-Stranded , DNA Repair , Ubiquitin-Protein Ligases/metabolism , Alphapapillomavirus , Cell Line , Cell Line, Tumor , Gene Silencing , Humans , Intracellular Signaling Peptides and Proteins/metabolism , Neoplasms/metabolism , Neoplasms/pathology , Neoplasms/virology , Papillomavirus Infections/metabolism , Papillomavirus Infections/pathology , Transcription, Genetic , Tumor Suppressor p53-Binding Protein 1 , UbiquitinationABSTRACT
MOTIVATION: The nuclear pore complex (NPC) is the only passageway for macromolecules between nucleus and cytoplasm, and an important reference standard in microscopy: it is massive and stereotypically arranged. The average architecture of NPC proteins has been resolved with pseudoatomic precision, however observed NPC heterogeneities evidence a high degree of divergence from this average. Single-molecule localization microscopy (SMLM) images NPCs at protein-level resolution, whereupon image analysis software studies NPC variability. However, the true picture of this variability is unknown. In quantitative image analysis experiments, it is thus difficult to distinguish intrinsically high SMLM noise from variability of the underlying structure. RESULTS: We introduce CIR4MICS ('ceramics', Configurable, Irregular Rings FOR MICroscopy Simulations), a pipeline that synthesizes ground truth datasets of structurally variable NPCs based on architectural models of the true NPC. Users can select one or more N- or C-terminally tagged NPC proteins, and simulate a wide range of geometric variations. We also represent the NPC as a spring-model such that arbitrary deforming forces, of user-defined magnitudes, simulate irregularly shaped variations. Further, we provide annotated reference datasets of simulated human NPCs, which facilitate a side-by-side comparison with real data. To demonstrate, we synthetically replicate a geometric analysis of real NPC radii and reveal that a range of simulated variability parameters can lead to observed results. Our simulator is therefore valuable to test the capabilities of image analysis methods, as well as to inform experimentalists about the requirements of hypothesis-driven imaging studies. AVAILABILITY AND IMPLEMENTATION: Code: https://github.com/uhlmanngroup/cir4mics. Simulated data: BioStudies S-BSST1058.
Subject(s)
Microscopy , Nuclear Pore , Humans , Nuclear Pore/chemistry , Nuclear Pore/metabolism , Nuclear Pore Complex Proteins/analysis , Nuclear Pore Complex Proteins/metabolism , Single Molecule Imaging/methods , SoftwareABSTRACT
Essential biological functions, such as mitosis, require tight coordination of hundreds of proteins in space and time. Localization, the timing of interactions and changes in cellular structure are all crucial to ensure the correct assembly, function and regulation of protein complexes1-4. Imaging of live cells can reveal protein distributions and dynamics but experimental and theoretical challenges have prevented the collection of quantitative data, which are necessary for the formulation of a model of mitosis that comprehensively integrates information and enables the analysis of the dynamic interactions between the molecular parts of the mitotic machinery within changing cellular boundaries. Here we generate a canonical model of the morphological changes during the mitotic progression of human cells on the basis of four-dimensional image data. We use this model to integrate dynamic three-dimensional concentration data of many fluorescently knocked-in mitotic proteins, imaged by fluorescence correlation spectroscopy-calibrated microscopy5. The approach taken here to generate a dynamic protein atlas of human cell division is generic; it can be applied to systematically map and mine dynamic protein localization networks that drive cell division in different cell types, and can be conceptually transferred to other cellular functions.
Subject(s)
Cell Cycle Proteins/analysis , Cell Cycle Proteins/metabolism , Mitosis , Gene Editing , Green Fluorescent Proteins/analysis , Green Fluorescent Proteins/metabolism , HeLa Cells , Humans , Imaging, Three-Dimensional , Microscopy, Fluorescence , Molecular Imaging , Time FactorsABSTRACT
Chromosome segregation depends on sister chromatid cohesion which is established by cohesin during DNA replication. Cohesive cohesin complexes become acetylated to prevent their precocious release by WAPL before cells have reached mitosis. To obtain insight into how DNA replication, cohesion establishment and cohesin acetylation are coordinated, we analysed the interaction partners of 55 human proteins implicated in these processes by mass spectrometry. This proteomic screen revealed that on chromatin the cohesin acetyltransferase ESCO2 associates with the MCM2-7 subcomplex of the replicative Cdc45-MCM-GINS helicase. The analysis of ESCO2 mutants defective in MCM binding indicates that these interactions are required for proper recruitment of ESCO2 to chromatin, cohesin acetylation during DNA replication, and centromeric cohesion. We propose that MCM binding enables ESCO2 to travel with replisomes to acetylate cohesive cohesin complexes in the vicinity of replication forks so that these complexes can be protected from precocious release by WAPL Our results also indicate that ESCO1 and ESCO2 have distinct functions in maintaining cohesion between chromosome arms and centromeres, respectively.
Subject(s)
Acetyltransferases/metabolism , Chromatids/metabolism , Chromosomal Proteins, Non-Histone/metabolism , Chromosome Segregation/genetics , Minichromosome Maintenance Proteins/metabolism , Acetylation , Cell Cycle Proteins/metabolism , Humans , Mitosis/genetics , CohesinsABSTRACT
In many disciplines, data are highly decentralized across thousands of online databases (repositories, registries, and knowledgebases). Wringing value from such databases depends on the discipline of data science and on the humble bricks and mortar that make integration possible; identifiers are a core component of this integration infrastructure. Drawing on our experience and on work by other groups, we outline 10 lessons we have learned about the identifier qualities and best practices that facilitate large-scale data integration. Specifically, we propose actions that identifier practitioners (database providers) should take in the design, provision and reuse of identifiers. We also outline the important considerations for those referencing identifiers in various circumstances, including by authors and data generators. While the importance and relevance of each lesson will vary by context, there is a need for increased awareness about how to avoid and manage common identifier problems, especially those related to persistence and web-accessibility/resolvability. We focus strongly on web-based identifiers in the life sciences; however, the principles are broadly relevant to other disciplines.
Subject(s)
Biological Science Disciplines/methods , Computational Biology/methods , Data Mining/methods , Software Design , Software , Biological Science Disciplines/statistics & numerical data , Biological Science Disciplines/trends , Computational Biology/trends , Data Mining/statistics & numerical data , Data Mining/trends , Databases, Factual/statistics & numerical data , Databases, Factual/trends , Forecasting , Humans , InternetABSTRACT
BACKGROUND: Loss-of-function phenotypes are widely used to infer gene function using the principle that similar phenotypes are indicative of similar functions. However, converting phenotypic to functional annotations requires careful interpretation of phenotypic descriptions and assessment of phenotypic similarity. Understanding how functions and phenotypes are linked will be crucial for the development of methods for the automatic conversion of gene loss-of-function phenotypes to gene functional annotations. RESULTS: We explored the relation between cellular phenotypes from RNAi-based screens in human cells and gene annotations of cellular functions as provided by the Gene Ontology (GO). Comparing different similarity measures, we found that information content-based measures of phenotypic similarity were the best at capturing gene functional similarity. However, phenotypic similarities did not map to the Gene Ontology organization of gene function but to functions defined as groups of GO terms with shared gene annotations. CONCLUSIONS: Our observations have implications for the use and interpretation of phenotypic similarities as a proxy for gene functions both in RNAi screen data analysis and curation and in the prediction of disease genes.
Subject(s)
Computational Biology/methods , Area Under Curve , Cluster Analysis , Humans , Phenotype , RNA Interference , ROC CurveABSTRACT
MOTIVATION: Most biological processes remain only partially characterized with many components still to be identified. Given that a whole genome can usually not be tested in a functional assay, identifying the genes most likely to be of interest is of critical importance to avoid wasting resources. RESULTS: Given a set of known functionally related genes and using a state-of-the-art approach to data integration and mining, our Functional Lists (FUN-L) method provides a ranked list of candidate genes for testing. Validation of predictions from FUN-L with independent RNAi screens confirms that FUN-L-produced lists are enriched in genes with the expected phenotypes. In this article, we describe a website front end to FUN-L. AVAILABILITY AND IMPLEMENTATION: The website is freely available to use at http://funl.org
Subject(s)
Algorithms , Computational Biology/methods , Data Mining/methods , Gene Regulatory Networks , RNA Interference , Software , Humans , PhenotypeABSTRACT
Despite our rapidly growing knowledge about the human genome, we do not know all of the genes required for some of the most basic functions of life. To start to fill this gap we developed a high-throughput phenotypic screening platform combining potent gene silencing by RNA interference, time-lapse microscopy and computational image processing. We carried out a genome-wide phenotypic profiling of each of the approximately 21,000 human protein-coding genes by two-day live imaging of fluorescently labelled chromosomes. Phenotypes were scored quantitatively by computational image processing, which allowed us to identify hundreds of human genes involved in diverse biological functions including cell division, migration and survival. As part of the Mitocheck consortium, this study provides an in-depth analysis of cell division phenotypes and makes the entire high-content data set available as a resource to the community.
Subject(s)
Cell Division/genetics , Genome, Human/genetics , Microscopy, Fluorescence/methods , Phenotype , Animals , Cell Movement/genetics , Cell Survival/genetics , Color , Gene Knockdown Techniques , Genes/genetics , HeLa Cells , Humans , Kinetics , Mice , Mitosis/genetics , RNA Interference , Reproducibility of Results , Spindle Apparatus/genetics , Spindle Apparatus/metabolism , Time FactorsABSTRACT
Whereas bacterial artificial chromosomes (BACs) offer many advantages in studies of gene and protein function, generation of seamless, precisely mutated BACs has been difficult. Here we describe a counterselection-based recombineering method and its accompanying reagents. After identifying intramolecular recombination as the major problem in counterselection, we built a strategy to reduce these unwanted events by expressing Redß alone at the crucial step. We enhanced this method by using phosphothioated oligonucleotides, using a sequence-altered rpsL counterselection gene and developing online software for oligonucleotide design. We illustrated this method by generating transgenic mammalian cell lines carrying small interfering RNA-resistant and point-mutated BAC transgenes. Using this approach, we generated mutated TACC3 transgenes to identify phosphorylation-specific spindle defects after knockdown of endogenous TACC3 expression. Our results highlight the complementary use of precisely mutated BAC transgenes and RNA interference in the study of cell biology at physiological expression levels and regulation.
Subject(s)
Chromosomes, Artificial, Bacterial/genetics , Mutagenesis, Site-Directed/methods , Oligonucleotides/genetics , Recombination, Genetic/genetics , Cell Line, Tumor , Drug Resistance, Bacterial , Escherichia coli Proteins/genetics , Humans , Microtubule-Associated Proteins/genetics , Protein Engineering/methods , RNA Interference , Ribosomal Protein S9 , Ribosomal Proteins/genetics , Software , TransgenesABSTRACT
Insect biomass is declining globally, likely driven by climate change and pesticide use, yet systematic studies on the effects of various chemicals remain limited. In this work, we used a chemical library of 1024 molecules-covering insecticides, herbicides, fungicides, and plant growth inhibitors-to assess the impact of sublethal pesticide doses on insects. In Drosophila melanogaster, 57% of chemicals affected larval behavior, and a higher proportion compromised long-term survivability. Exposure to sublethal doses also induced widespread changes in the phosphoproteome and changes in development and reproduction. The negative effects of agrochemicals were amplified when the temperature was increased. We observed similar behavioral changes across multiple insect species, including mosquitoes and butterflies. These findings suggest that widespread sublethal pesticide exposure can alter insect behavior and physiology, threatening long-term population survival.
Subject(s)
Agrochemicals , Insecta , Animals , Agrochemicals/toxicity , Behavior, Animal/drug effects , Butterflies/drug effects , Butterflies/growth & development , Drosophila melanogaster/drug effects , Herbicides/toxicity , Insecticides/toxicity , Larva/drug effects , Larva/growth & development , Reproduction/drug effects , Small Molecule Libraries/toxicity , Temperature , Insecta/drug effects , Proteome/drug effects , Hot Temperature , Extinction, Biological , Dose-Response Relationship, DrugABSTRACT
BACKGROUND: The combination of time-lapse imaging of live cells with high-throughput perturbation assays is a powerful tool for genetics and cell biology. The Mitocheck project employed this technique to associate thousands of genes with transient biological phenotypes in cell division, cell death and migration. The original analysis of these data proceeded by assigning nuclear morphologies to cells at each time-point using automated image classification, followed by description of population frequencies and temporal distribution of cellular states through event-order maps. One of the choices made by that analysis was not to rely on temporal tracking of the individual cells, due to the relatively low image sampling frequency, and to focus on effects that could be discerned from population-level behaviour. RESULTS: Here, we present a variation of this approach that employs explicit modelling by dynamic differential equations of the cellular state populations. Model fitting to the time course data allowed reliable estimation of the penetrance and time of appearance of four types of disruption of the cell cycle: quiescence, mitotic arrest, polynucleation and cell death. Model parameters yielded estimates of the duration of the interphase and mitosis phases. We identified 2190 siRNAs that induced a disruption of the cell cycle at reproducible times, or increased the durations of the interphase or mitosis phases. CONCLUSIONS: We quantified the dynamic effects of the siRNAs and compiled them as a resource that can be used to characterize the role of their target genes in cell death, mitosis and cell cycle regulation. The described population-based modelling method might be applicable to other large-scale cell-based assays with temporal readout when only population-level measures are available.
Subject(s)
Cell Cycle/genetics , Computational Biology/methods , Image Processing, Computer-Assisted/methods , Models, Biological , RNA Interference , HeLa Cells , Humans , Mitosis/genetics , Phenotype , RNA, Small Interfering/genetics , RNA, Small Interfering/metabolism , SoftwareABSTRACT
Methods and tools for visualizing biological data have improved considerably over the last decades, but they are still inadequate for some high-throughput data sets. For most users, a key challenge is to benefit from the deluge of data without being overwhelmed by it. This challenge is still largely unfulfilled and will require the development of truly integrated and highly useable tools.
Subject(s)
Image Processing, Computer-Assisted , Systems Integration , User-Computer InterfaceABSTRACT
Advances in imaging techniques and high-throughput technologies are providing scientists with unprecedented possibilities to visualize internal structures of cells, organs and organisms and to collect systematic image data characterizing genes and proteins on a large scale. To make the best use of these increasingly complex and large image data resources, the scientific community must be provided with methods to query, analyze and crosslink these resources to give an intuitive visual representation of the data. This review gives an overview of existing methods and tools for this purpose and highlights some of their limitations and challenges.
Subject(s)
Image Processing, Computer-Assisted , Magnetic Resonance Imaging , Microscopy/methodsABSTRACT
Microglia are very sensitive to changes in the environment and respond through morphological, functional and metabolic adaptations. To depict the modifications microglia undergo under healthy and pathological conditions, we developed free access image analysis scripts to quantify microglia morphologies and phagocytosis. Neuron-glia cultures, in which microglia express the reporter tdTomato, were exposed to excitotoxicity or excitotoxicity + inflammation and analysed 8 h later. Neuronal death was assessed by SYTOX staining of nucleus debris and phagocytosis was measured through the engulfment of SYTOX+ particles in microglia. We identified seven morphologies: round, hypertrophic, fried egg, bipolar and three 'inflamed' morphologies. We generated a classifier able to separate them and assign one of the seven classes to each microglia in sample images. In control cultures, round and hypertrophic morphologies were predominant. Excitotoxicity had a limited effect on the composition of the populations. By contrast, excitotoxicity + inflammation promoted an enrichment in inflamed morphologies and increased the percentage of phagocytosing microglia. Our data suggest that inflammation is critical to promote phenotypical changes in microglia. We also validated our tools for the segmentation of microglia in brain slices and performed morphometry with the obtained mask. Our method is versatile and useful to correlate microglia sub-populations and behaviour with environmental changes.
Subject(s)
Microglia , Phagocytosis , Humans , Microglia/metabolism , Inflammation/metabolism , Cell Death , Neurons/metabolismABSTRACT
Many bioimage analysis projects produce quantitative descriptors of regions of interest in images. Associating these descriptors with visual characteristics of the objects they describe is a key step in understanding the data at hand. However, as many bioimage data and their analysis workflows are moving to the cloud, addressing interactive data exploration in remote environments has become a pressing issue. To address it, we developed the Image Data Explorer (IDE) as a web application that integrates interactive linked visualization of images and derived data points with exploratory data analysis methods, annotation, classification and feature selection functionalities. The IDE is written in R using the shiny framework. It can be easily deployed on a remote server or on a local computer. The IDE is available at https://git.embl.de/heriche/image-data-explorer and a cloud deployment is accessible at https://shiny-portal.embl.de/shinyapps/app/01_image-data-explorer.
Subject(s)
SoftwareABSTRACT
The interpretation of genome sequences requires reliable and standardized methods to assess protein function at high throughput. Here we describe a fast and reliable pipeline to study protein function in mammalian cells based on protein tagging in bacterial artificial chromosomes (BACs). The large size of the BAC transgenes ensures the presence of most, if not all, regulatory elements and results in expression that closely matches that of the endogenous gene. We show that BAC transgenes can be rapidly and reliably generated using 96-well-format recombineering. After stable transfection of these transgenes into human tissue culture cells or mouse embryonic stem cells, the localization, protein-protein and/or protein-DNA interactions of the tagged protein are studied using generic, tag-based assays. The same high-throughput approach will be generally applicable to other model systems.