Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 141
Filtrar
1.
Cell ; 163(7): 1611-27, 2015 Dec 17.
Artigo em Inglês | MEDLINE | ID: mdl-26686651

RESUMO

Spatial genome organization and its effect on transcription remains a fundamental question. We applied an advanced chromatin interaction analysis by paired-end tag sequencing (ChIA-PET) strategy to comprehensively map higher-order chromosome folding and specific chromatin interactions mediated by CCCTC-binding factor (CTCF) and RNA polymerase II (RNAPII) with haplotype specificity and nucleotide resolution in different human cell lineages. We find that CTCF/cohesin-mediated interaction anchors serve as structural foci for spatial organization of constitutive genes concordant with CTCF-motif orientation, whereas RNAPII interacts within these structures by selectively drawing cell-type-specific genes toward CTCF foci for coordinated transcription. Furthermore, we show that haplotype variants and allelic interactions have differential effects on chromosome configuration, influencing gene expression, and may provide mechanistic insights into functions associated with disease susceptibility. 3D genome simulation suggests a model of chromatin folding around chromosomal axes, where CTCF is involved in defining the interface between condensed and open compartments for structural regulation. Our 3D genome strategy thus provides unique insights in the topological mechanism of human variations and diseases.


Assuntos
Cromatina/química , Genoma Humano , Proteínas Repressoras/metabolismo , Transcrição Gênica , Animais , Fator de Ligação a CCCTC , Proteínas de Ciclo Celular/metabolismo , Linhagem Celular , Cromatina/genética , Cromatina/metabolismo , Proteínas Cromossômicas não Histona/metabolismo , Cromossomos/metabolismo , Empacotamento do DNA , Humanos , RNA Polimerase II/metabolismo , Salamandridae , Coesinas
2.
EMBO J ; 42(5): e112443, 2023 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-36705062

RESUMO

Eukaryotic genomes are pervasively transcribed by RNA polymerase II. Yet, the molecular and biological implications of such a phenomenon are still largely puzzling. Here, we describe noncoding RNA transcription upstream of the Arabidopsis thaliana DOG1 gene, which governs salt stress responses and is a key regulator of seed dormancy. We find that expression of the DOG1 gene is induced by salt stress, thereby causing a delay in seed germination. We uncover extensive transcriptional activity on the promoter of the DOG1 gene, which produces a variety of lncRNAs. These lncRNAs, named PUPPIES, are co-directionally transcribed and extend into the DOG1 coding region. We show that PUPPIES RNAs respond to salt stress and boost DOG1 expression, resulting in delayed germination. This positive role of pervasive PUPPIES transcription on DOG1 gene expression is associated with augmented pausing of RNA polymerase II, slower transcription and higher transcriptional burst size. These findings highlight the positive role of upstream co-directional transcription in controlling transcriptional dynamics of downstream genes.


Assuntos
Proteínas de Arabidopsis , Arabidopsis , RNA Longo não Codificante , Animais , Arabidopsis/metabolismo , Proteínas de Arabidopsis/metabolismo , Regulação da Expressão Gênica de Plantas , Germinação/genética , Mutação , RNA Polimerase II/genética , RNA Polimerase II/metabolismo , RNA Longo não Codificante/metabolismo
3.
EMBO J ; 42(23): e113527, 2023 Dec 01.
Artigo em Inglês | MEDLINE | ID: mdl-37846891

RESUMO

Emergency granulopoiesis is the enhanced and accelerated production of granulocytes that occurs during acute infection. The contribution of hematopoietic stem cells (HSCs) to this process was reported; however, how HSCs participate in emergency granulopoiesis remains elusive. Here, using a mouse model of emergency granulopoiesis we observe transcriptional changes in HSCs as early as 4 h after lipopolysaccharide (LPS) administration. We observe that the HSC identity is changed towards a myeloid-biased HSC and show that CD201 is enriched in lymphoid-biased HSCs. While CD201 expression under steady-state conditions reveals a lymphoid bias, under emergency granulopoiesis loss of CD201 marks the lymphoid-to-myeloid transcriptional switch. Mechanistically, we determine that lymphoid-biased CD201+ HSCs act as a first response during emergency granulopoiesis due to direct sensing of LPS by TLR4 and downstream activation of NF-κΒ signaling. The myeloid-biased CD201- HSC population responds indirectly during an acute infection by sensing G-CSF, increasing STAT3 phosphorylation, and upregulating LAP/LAP* C/EBPß isoforms. In conclusion, HSC subpopulations support early phases of emergency granulopoiesis due to their transcriptional rewiring from a lymphoid-biased to myeloid-biased population and thus establishing alternative paths to supply elevated numbers of granulocytes.


Assuntos
Células-Tronco Hematopoéticas , Lipopolissacarídeos , Lipopolissacarídeos/metabolismo , Hematopoese , Granulócitos/metabolismo
4.
Methods ; 223: 106-117, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38295892

RESUMO

The connection between the patterns observed in 3C-type experiments and the modeling of polymers remains unresolved. This paper presents a simulation pipeline that generates thermodynamic ensembles of 3D structures for topologically associated domain (TAD) regions by loop extrusion model (LEM). The simulations consist of two main components: a stochastic simulation phase, employing a Monte Carlo approach to simulate the binding positions of cohesins, and a dynamical simulation phase, utilizing these cohesins' positions to create 3D structures. In this approach, the system's total energy is the combined result of the Monte Carlo energy and the molecular simulation energy, which are iteratively updated. The structural maintenance of chromosomes (SMC) protein complexes are represented as loop extruders, while the CCCTC-binding factor (CTCF) locations on DNA sequence are modeled as energy minima on the Monte Carlo energy landscape. Finally, the spatial distances between DNA segments from ChIA-PET experiments are compared with the computer simulations, and we observe significant Pearson correlations between predictions and the real data. LoopSage model offers a fresh perspective on chromatin loop dynamics, allowing us to observe phase transition between sparse and condensed states in chromatin.


Assuntos
Cromatina , Proteínas Cromossômicas não Histona , Cromatina/genética , Proteínas Cromossômicas não Histona/genética , Proteínas Cromossômicas não Histona/metabolismo , Proteínas de Ciclo Celular/genética , Proteínas de Ciclo Celular/metabolismo , Cromossomos/metabolismo , Coesinas
5.
Methods ; 226: 54-60, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38636797

RESUMO

The challenge of modelling the spatial conformation of chromatin remains an open problem. While multiple data-driven approaches have been proposed, each has limitations. This work introduces two image-driven modelling methods based on the Molecular Dynamics Flexible Fitting (MDFF) approach: the force method and the correlational method. Both methods have already been used successfully in protein modelling. We propose a novel way to employ them for building chromatin models directly from 3D images. This approach is termed image-driven modelling. Additionally, we introduce the initial structure generator, a tool designed to generate optimal starting structures for the proposed algorithms. The methods are versatile and can be applied to various data types, with minor modifications to accommodate new generation imaging techniques.


Assuntos
Algoritmos , Cromatina , Simulação de Dinâmica Molecular , Cromatina/química , Cromatina/metabolismo , Imageamento Tridimensional/métodos , Humanos
6.
Bioessays ; 45(10): e2200240, 2023 10.
Artigo em Inglês | MEDLINE | ID: mdl-37603403

RESUMO

Recent advances in genomic and imaging techniques have revealed the complex manner of organizing billions of base pairs of DNA necessary for maintaining their functionality and ensuring the proper expression of genetic information. The SMC proteins and cohesin complex primarily contribute to forming higher-order chromatin structures, such as chromosomal territories, compartments, topologically associating domains (TADs) and chromatin loops anchored by CCCTC-binding factor (CTCF) protein or other genome organizers. Cohesin plays a fundamental role in chromatin organization, gene expression and regulation. This review aims to describe the current understanding of the dynamic nature of the cohesin-DNA complex and its dependence on cohesin for genome maintenance. We discuss the current 3C technique and numerous bioinformatics pipelines used to comprehend structural genomics and epigenetics focusing on the analysis of Cohesin-centred interactions. We also incorporate our present comprehension of Loop Extrusion (LE) and insights from stochastic modelling.


Assuntos
Proteínas Cromossômicas não Histona , Genoma Humano , Humanos , Proteínas Cromossômicas não Histona/genética , Proteínas de Ciclo Celular/genética , Cromatina/genética , Coesinas
7.
Nucleic Acids Res ; 51(W1): W5-W10, 2023 07 05.
Artigo em Inglês | MEDLINE | ID: mdl-37158257

RESUMO

In the current update, we added a feature for analysing changes in spatial distances between promoters and enhancers in chromatin 3D model ensembles. We updated our datasets by the novel in situ CTCF and RNAPII ChIA-PET chromatin loops obtained from the GM12878 cell line mapped to the GRCh38 genome assembly and extended the 1000 Genomes SVs dataset. To handle the new datasets, we applied GPU acceleration for the modelling engine, which gives a speed-up of 30× versus the previous versions. To improve visualisation and data analysis, we embedded the IGV tool for viewing ChIA-PET arcs with additional genes and SVs annotations. For 3D model visualisation, we added a new viewer: NGL, where we provided colouring by gene and enhancer location. The models are downloadable in mmcif and xyz format. The web server is hosted and performs calculations on DGX A100 GPU servers that provide optimal performance with multitasking. 3D-GNOME 3.0 web server provides unique insights into the topological mechanism of human variations at the population scale with high speed-up and is freely available at https://3dgnome.mini.pw.edu.pl/.


Assuntos
Cromatina , Visualização de Dados , Genoma Humano , Genômica , Humanos , Cromatina/química , Elementos Facilitadores Genéticos , Genoma Humano/genética , Regiões Promotoras Genéticas , Genômica/instrumentação , Genômica/métodos , Conformação Molecular , Simulação por Computador , Internet
8.
Semin Cell Dev Biol ; 121: 171-185, 2022 01.
Artigo em Inglês | MEDLINE | ID: mdl-34429265

RESUMO

The three-dimensional structure of the human genome has been proven to have a significant functional impact on gene expression. The high-order spatial chromatin is organised first by looping mediated by multiple protein factors, and then it is further formed into larger structures of topologically associated domains (TADs) or chromatin contact domains (CCDs), followed by A/B compartments and finally the chromosomal territories (CTs). The genetic variation observed in human population influences the multi-scale structures, posing a question regarding the functional impact of structural variants reflected by the variability of the genes expression patterns. The current methods of evaluating the functional effect include eQTLs analysis which uses statistical testing of influence of variants on spatially close genes. Rarely, non-coding DNA sequence changes are evaluated by their impact on the biomolecular interaction network (BIN) reflecting the cellular interactome that can be analysed by the classical graph-theoretic algorithms. Therefore, in the second part of the review, we introduce the concept of BIN, i.e. a meta-network model of the complete molecular interactome developed by integrating various biological networks. The BIN meta-network model includes DNA-protein binding by the plethora of protein factors as well as chromatin interactions, therefore allowing connection of genomics with the downstream biomolecular processes present in a cell. As an illustration, we scrutinise the chromatin interactions mediated by the CTCF protein detected in a ChIA-PET experiment in the human lymphoblastoid cell line GM12878. In the corresponding BIN meta-network the DNA spatial proximity is represented as a graph model, combined with the Proteins-Interaction Network (PIN) of human proteome using the Gene Association Network (GAN). Furthermore, we enriched the BIN with the signalling and metabolic pathways and Gene Ontology (GO) terms to assert its functional context. Finally, we mapped the Single Nucleotide Polymorphisms (SNPs) from the GWAS studies and identified the chromatin mutational hot-spots associated with a significant enrichment of SNPs related to autoimmune diseases. Afterwards, we mapped Structural Variants (SVs) from healthy individuals of 1000 Genomes Project and identified an interesting example of the missing protein complex associated with protein Q6GYQ0 due to a deletion on chromosome 14. Such an analysis using the meta-network BIN model is therefore helpful in evaluating the influence of genetic variation on spatial organisation of the genome and its functional effect in a cell.


Assuntos
Cromatina/metabolismo , Genoma Humano/genética , Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Mapas de Interação de Proteínas/genética , Humanos
9.
Curr Issues Mol Biol ; 46(3): 2713-2740, 2024 Mar 21.
Artigo em Inglês | MEDLINE | ID: mdl-38534787

RESUMO

HER2-positive breast cancer is one of the most prevalent forms of cancer among women worldwide. Generally, the molecular characteristics of this breast cancer include activation of human epidermal growth factor receptor-2 (HER2) and hormone receptor activation. HER2-positive is associated with a higher death rate, which led to the development of a monoclonal antibody called trastuzumab, specifically targeting HER2. The success rate of HER2-positive breast cancer treatment has been increased; however, drug resistance remains a challenge. This fact motivated us to explore the underlying molecular mechanisms of trastuzumab resistance. For this purpose, a two-fold approach was taken by considering well-known breast cancer cell lines SKBR3 and BT474. In the first fold, trastuzumab treatment doses were optimized separately for both cell lines. This was done based on the proliferation rate of cells in response to a wide variety of medication dosages. Thereafter, each cell line was cultivated with a steady dosage of herceptin for several months. During this period, six time points were selected for further in vitro analysis, ranging from the untreated cell line at the beginning to a fully resistant cell line at the end of the experiment. In the second fold, nucleic acids were extracted for further high throughput-based microarray experiments of gene and microRNA expression. Such expression data were further analyzed in order to infer the molecular mechanisms involved in the underlying development of trastuzumab resistance. In the list of differentially expressed genes and miRNAs, multiple genes (e.g., BIRC5, E2F1, TFRC, and USP1) and miRNAs (e.g., hsa miR 574 3p, hsa miR 4530, and hsa miR 197 3p) responsible for trastuzumab resistance were found. Downstream analysis showed that TFRC, E2F1, and USP1 were also targeted by hsa-miR-8485. Moreover, it indicated that miR-4701-5p was highly expressed as compared to TFRC in the SKBR3 cell line. These results unveil key genes and miRNAs as molecular regulators for trastuzumab resistance.

10.
Brief Bioinform ; 23(6)2022 11 19.
Artigo em Inglês | MEDLINE | ID: mdl-36094071

RESUMO

The emerging ligation-free three-dimensional (3D) genome mapping technologies can identify multiplex chromatin interactions with single-molecule precision. These technologies not only offer new insight into high-dimensional chromatin organization and gene regulation, but also introduce new challenges in data visualization and analysis. To overcome these challenges, we developed MCIBox, a toolkit for multi-way chromatin interaction (MCI) analysis, including a visualization tool and a platform for identifying micro-domains with clustered single-molecule chromatin complexes. MCIBox is based on various clustering algorithms integrated with dimensionality reduction methods that can display multiplex chromatin interactions at single-molecule level, allowing users to explore chromatin extrusion patterns and super-enhancers regulation modes in transcription, and to identify single-molecule chromatin complexes that are clustered into micro-domains. Furthermore, MCIBox incorporates a two-dimensional kernel density estimation algorithm to identify micro-domains boundaries automatically. These micro-domains were stratified with distinctive signatures of transcription activity and contained different cell-cycle-associated genes. Taken together, MCIBox represents an invaluable tool for the study of multiple chromatin interactions and inaugurates a previously unappreciated view of 3D genome structure.


Assuntos
Cromatina , Sequências Reguladoras de Ácido Nucleico , Cromatina/genética , Genoma , Regulação da Expressão Gênica
11.
Bioinformatics ; 39(8)2023 08 01.
Artigo em Inglês | MEDLINE | ID: mdl-37535685

RESUMO

MOTIVATION: The advent of T-cell receptor (TCR) sequencing experiments allowed for a significant increase in the amount of peptide:TCR binding data available and a number of machine-learning models appeared in recent years. High-quality prediction models for a fixed epitope sequence are feasible, provided enough known binding TCR sequences are available. However, their performance drops significantly for previously unseen peptides. RESULTS: We prepare the dataset of known peptide:TCR binders and augment it with negative decoys created using healthy donors' T-cell repertoires. We employ deep learning methods commonly applied in Natural Language Processing to train part a peptide:TCR binding model with a degree of cross-peptide generalization (0.69 AUROC). We demonstrate that BERTrand outperforms the published methods when evaluated on peptide sequences not used during model training. AVAILABILITY AND IMPLEMENTATION: The datasets and the code for model training are available at https://github.com/SFGLab/bertrand.


Assuntos
Peptídeos , Receptores de Antígenos de Linfócitos T , Peptídeos/metabolismo , Receptores de Antígenos de Linfócitos T/metabolismo , Ligação Proteica , Epitopos , Aprendizado de Máquina
12.
Bioinformatics ; 39(10)2023 10 03.
Artigo em Inglês | MEDLINE | ID: mdl-37774005

RESUMO

MOTIVATION: Investigating the 3D structure of chromatin provides new insights into transcriptional regulation. With the evolution of 3C next-generation sequencing methods like ChiA-PET and Hi-C, the surge in data volume has highlighted the need for more efficient chromatin spatial modelling algorithms. This study introduces the cudaMMC method, based on the Simulated Annealing Monte Carlo approach and enhanced by GPU-accelerated computing, to efficiently generate ensembles of chromatin 3D structures. RESULTS: The cudaMMC calculations demonstrate significantly faster performance with better stability compared to our previous method on the same workstation. cudaMMC also substantially reduces the computation time required for generating ensembles of large chromatin models, making it an invaluable tool for studying chromatin spatial conformation. AVAILABILITY AND IMPLEMENTATION: Open-source software and manual and sample data are freely available on https://github.com/SFGLab/cudaMMC.


Assuntos
Cromatina , Software , Cromossomos , Algoritmos , Conformação Molecular , Método de Monte Carlo
13.
Int J Mol Sci ; 25(11)2024 May 25.
Artigo em Inglês | MEDLINE | ID: mdl-38891944

RESUMO

Gilles de la Tourette syndrome (GTS) is a neurodevelopmental psychiatric disorder with complex and elusive etiology with a significant role of genetic factors. The aim of this study was to identify structural variants that could be associated with familial GTS. The study group comprised 17 multiplex families with 80 patients. Structural variants were identified from whole-genome sequencing data and followed by co-segregation and bioinformatic analyses. The localization of these variants was used to select candidate genes and create gene sets, which were subsequently processed in gene ontology and pathway enrichment analysis. Seventy putative pathogenic variants shared among affected individuals within one family but not present in the control group were identified. Only four private or rare deletions were exonic in LDLRAD4, B2M, USH2A, and ZNF765 genes. Notably, the USH2A gene is involved in cochlear development and sensory perception of sound, a process that was associated previously with familial GTS. In addition, two rare variants and three not present in the control group were co-segregating with the disease in two families, and uncommon insertions in GOLM1 and DISC1 were co-segregating in three families each. Enrichment analysis showed that identified structural variants affected synaptic vesicle endocytosis, cell leading-edge organization, and signaling for neurite outgrowth. The results further support the involvement of the regulation of neurotransmission, neuronal migration, and sound-sensing in GTS.


Assuntos
Linhagem , Síndrome de Tourette , Humanos , Síndrome de Tourette/genética , Masculino , Feminino , Predisposição Genética para Doença , Proteínas da Matriz Extracelular/genética , Proteínas da Matriz Extracelular/metabolismo , Adulto , Sequenciamento Completo do Genoma
14.
Bioinformatics ; 38(24): 5440-5442, 2022 12 13.
Artigo em Inglês | MEDLINE | ID: mdl-36315072

RESUMO

SUMMARY: The detection of the structural variants (SVs) using Illumina sequencing of human DNA is not an easy task. Multiple approaches have been proposed; however, all the methods have their limitations. In this article, we present ConsensuSV pipeline that aids the research in complex variant detection. By using consensus meta-approach, eight independent SV callers are being used to identify a uniform set of high-quality SVs. The pipeline works using raw sequencing data and performs all the necessary steps automatically, significantly reducing the researchers' time required for processing the data. The output files contain SVs, single nucleotide polymorphisms and Indels. The pipeline uses luigi framework, allowing the software to be run efficiently and parallelly using the high-performance computing infrastructure. We strongly believe that the software is useful to the scientific community interested in the germline variant detection. AVAILABILITY AND IMPLEMENTATION: https://github.com/SFGLab/ConsensuSV-pipeline. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Software , Humanos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento Completo do Genoma , Mutação INDEL , Polimorfismo de Nucleotídeo Único
15.
Methods ; 203: 498-510, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35167916

RESUMO

For the last two years, the COVID-19 pandemic has continued to bring consternation on most of the world. According to recent WHO estimates, there have been more than 5.6 million deaths worldwide. The virus continues to evolve all over the world, thus requiring both vigilance and the necessity to find and develop a variety of therapeutic treatments, including the identification of specific antiviral drugs. Multiple studies have confirmed that SARS-CoV-2 utilizes its membrane-bound spike protein to recognize human angiotensin-converting enzyme 2 (ACE2). Thus, preventing spike-ACE2 interactions is a potentially viable strategy for COVID-19 treatment as it would block the virus from binding and entering into a host cell. This work aims to identify potential drugs using an in silico approach. Molecular docking was carried out on both approved drugs and substances previously tested in vivo. This step was followed by a more detailed analysis of selected ligands by molecular dynamics simulations to identify the best molecules that thwart the ability of the virus to interact with the ACE2 receptor. Because the SARS-CoV-2 virus evolves rapidly due to a plethora of immunocompromised hosts, the compounds were tested against five different known lineages. As a result, we could identify substances that work well on individual lineages and those showing broader efficacy. The most promising candidates among the currently used drugs were zafirlukast and simeprevir with an average binding affinity of -22 kcal/mol for spike proteins originating from various lineages. The first compound is a leukotriene receptor antagonist that is used to treat asthma, while the latter is a protease inhibitor used for hepatitis C treatment. From among the in vivo tested substances that concurrently exhibit promising free energy of binding and ADME parameters (indicating a possible oral administration) we selected the compound BDBM50136234. In conclusion, these molecules are worth exploring further by in vitro and in vivo studies against SARS-CoV-2.


Assuntos
Tratamento Farmacológico da COVID-19 , SARS-CoV-2 , Enzima de Conversão de Angiotensina 2 , Antivirais/farmacologia , Antivirais/uso terapêutico , Reposicionamento de Medicamentos , Humanos , Simulação de Acoplamento Molecular , Simulação de Dinâmica Molecular , Pandemias
16.
Brief Bioinform ; 21(2): 458-472, 2020 03 23.
Artigo em Inglês | MEDLINE | ID: mdl-30698641

RESUMO

There are multiple definitions for low complexity regions (LCRs) in protein sequences, with all of them broadly considering LCRs as regions with fewer amino acid types compared to an average composition. Following this view, LCRs can also be defined as regions showing composition bias. In this critical review, we focus on the definition of sequence complexity of LCRs and their connection with structure. We present statistics and methodological approaches that measure low complexity (LC) and related sequence properties. Composition bias is often associated with LC and disorder, but repeats, while compositionally biased, might also induce ordered structures. We illustrate this dichotomy, and more generally the overlaps between different properties related to LCRs, using examples. We argue that statistical measures alone cannot capture all structural aspects of LCRs and recommend the combined usage of a variety of predictive tools and measurements. While the methodologies available to study LCRs are already very advanced, we foresee that a more comprehensive annotation of sequences in the databases will enable the improvement of predictions and a better understanding of the evolution and the connection between structure and function of LCRs. This will require the use of standards for the generation and exchange of data describing all aspects of LCRs. SHORT ABSTRACT: There are multiple definitions for low complexity regions (LCRs) in protein sequences. In this critical review, we focus on the definition of sequence complexity of LCRs and their connection with structure. We present statistics and methodological approaches that measure low complexity (LC) and related sequence properties. Composition bias is often associated with LC and disorder, but repeats, while compositionally biased, might also induce ordered structures. We illustrate this dichotomy, plus overlaps between different properties related to LCRs, using examples.


Assuntos
Proteínas/química , Algoritmos , Sequência de Aminoácidos , Bases de Dados de Proteínas , Evolução Molecular , Conformação Proteica , Domínios Proteicos
17.
Nucleic Acids Res ; 48(W1): W170-W176, 2020 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-32442297

RESUMO

Structural variants (SVs) that alter DNA sequence emerge as a driving force involved in the reorganisation of DNA spatial folding, thus affecting gene transcription. In this work, we describe an improved version of our integrated web service for structural modeling of three-dimensional genome (3D-GNOME), which now incorporates all types of SVs to model changes to the reference 3D conformation of chromatin. In 3D-GNOME 2.0, the default reference 3D genome structure is generated using ChIA-PET data from the GM12878 cell line and SVs data are sourced from the population-scale catalogue of SVs identified by the 1000 Genomes Consortium. However, users may also submit their own structural data to set a customized reference genome structure, and/or a custom input list of SVs. 3D-GNOME 2.0 provides novel tools to inspect, visualize and compare 3D models for regions that differ in terms of their linear genomic sequence. Contact diagrams are displayed to compare the reference 3D structure with the one altered by SVs. In our opinion, 3D-GNOME 2.0 is a unique online tool for modeling and analyzing conformational changes to the human genome induced by SVs across populations. It can be freely accessed at https://3dgnome.cent.uw.edu.pl/.


Assuntos
Cromatina/química , Variação Estrutural do Genoma , Modelos Moleculares , Software , Inversão Cromossômica , Genoma Humano , Humanos , Modelos Genéticos , Conformação Molecular , Deleção de Sequência
18.
BMC Bioinformatics ; 22(1): 72, 2021 Feb 17.
Artigo em Inglês | MEDLINE | ID: mdl-33596823

RESUMO

BACKGROUND: Bioimaging techniques offer a robust tool for studying molecular pathways and morphological phenotypes of cell populations subjected to various conditions. As modern high-resolution 3D microscopy provides access to an ever-increasing amount of high-quality images, there arises a need for their analysis in an automated, unbiased, and simple way. Segmentation of structures within the cell nucleus, which is the focus of this paper, presents a new layer of complexity in the form of dense packing and significant signal overlap. At the same time, the available segmentation tools provide a steep learning curve for new users with a limited technical background. This is especially apparent in the bulk processing of image sets, which requires the use of some form of programming notation. RESULTS: In this paper, we present PartSeg, a tool for segmentation and reconstruction of 3D microscopy images, optimised for the study of the cell nucleus. PartSeg integrates refined versions of several state-of-the-art algorithms, including a new multi-scale approach for segmentation and quantitative analysis of 3D microscopy images. The features and user-friendly interface of PartSeg were carefully planned with biologists in mind, based on analysis of multiple use cases and difficulties encountered with other tools, to offer an ergonomic interface with a minimal entry barrier. Bulk processing in an ad-hoc manner is possible without the need for programmer support. As the size of datasets of interest grows, such bulk processing solutions become essential for proper statistical analysis of results. Advanced users can use PartSeg components as a library within Python data processing and visualisation pipelines, for example within Jupyter notebooks. The tool is extensible so that new functionality and algorithms can be added by the use of plugins. For biologists, the utility of PartSeg is presented in several scenarios, showing the quantitative analysis of nuclear structures. CONCLUSIONS: In this paper, we have presented PartSeg which is a tool for precise and verifiable segmentation and reconstruction of 3D microscopy images. PartSeg is optimised for cell nucleus analysis and offers multi-scale segmentation algorithms best-suited for this task. PartSeg can also be used for the bulk processing of multiple images and its components can be reused in other systems or computational experiments.


Assuntos
Imageamento Tridimensional , Microscopia , Algoritmos , Núcleo Celular , Processamento de Imagem Assistida por Computador
19.
Semin Cell Dev Biol ; 90: 114-127, 2019 06.
Artigo em Inglês | MEDLINE | ID: mdl-30096365

RESUMO

The eukaryotic genome, constituting several billion base pairs, must be contracted to fit within the volume of a nucleus where the diameter is on the scale of µm. The 3D structure and packing of such a long sequence cannot be left to pure chance, as DNA must be efficiently used for its primary roles as a matrix for transcription and replication. In recent years, methods like chromatin conformation capture (including 3C, 4C, Hi-C, ChIA-PET and Multi-ChIA) and optical microscopy have advanced substantially and have shed new light on how eukaryotic genomes are hierarchically organized; first into 10-nm fiber, next into DNA loops, topologically associated domains and finally into interphase or mitotic chromosomes. This knowledge has allowed us to revise our understanding regarding the mechanisms governing the process of DNA organization. Mounting experimental evidence suggests that the key element in the formation of loops is the binding of the CCCTC-binding factor (CTCF) to DNA; a protein that can be referred to as the chief organizer of the genome. However, CTCF does not work alone but in cooperation with other proteins, such as cohesin or Yin Yang 1 (YY1). In this short review, we briefly describe our current understanding of the structure of eukaryotic genomes, how they are established and how the formation of DNA loops can influence gene expression. We discuss the recent discoveries describing the 3D structure of the CTCF-DNA complex and the role of CTCF in establishing genome structure. Finally, we briefly explain how various genetic disorders might arise as a consequence of mutations in the CTCF target sequence or alteration of genomic imprinting.


Assuntos
Fator de Ligação a CCCTC/genética , Genoma Humano/genética , Fator de Ligação a CCCTC/química , DNA/química , DNA/genética , Humanos
20.
Methods ; 181-182: 35-51, 2020 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-32645447

RESUMO

In recent years, high-throughput techniques have revealed considerable structural organization of the human genome with diverse regions of the chromatin interacting with each other in the form of loops. Some of these loops are quite complex and may encompass regions comprised of many interacting chain segments around a central locus. Popular techniques for extracting this information are chromatin interaction analysis by paired-end tag sequencing (ChIA-PET) and high-throughput chromosome conformation capture (Hi-C). Here, we introduce a physics-based method to predict the three-dimensional structure of chromatin from population-averaged ChIA-PET data. The approach uses experimentally-validated data from human B-lymphoblastoid cells to generate 2D meta-structures of chromatin using a dynamic programming algorithm that explores the chromatin free energy landscape. By generating both optimal and suboptimal meta-structures we can calculate both the free energy and additionally the relative thermodynamic probability. A 3D structure prediction program with applied restraints then can be used to generate the tertiary structures. The main advantage of this approach for population-averaged experimental data is that it provides a way to distinguish between the principal and the spurious contacts. This study also finds that euchromatin appear to have rather precisely regulated 2D meta-structures compared to heterochromatin. The program source-code is available at https://github.com/plewczynski/looper.


Assuntos
Genoma Humano , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Conformação Molecular , Análise de Sequência de DNA/métodos , Algoritmos , Linhagem Celular Tumoral , Montagem e Desmontagem da Cromatina/genética , Entropia , Eucromatina/química , Eucromatina/genética , Eucromatina/metabolismo , Heterocromatina/química , Heterocromatina/genética , Heterocromatina/metabolismo , Humanos , Software
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA