RESUMEN
Structural variants (SVs) pose a challenge to detect and interpret, but their study provides novel biological insights and molecular diagnosis underlying rare diseases. The aim of this study was to resolve a 9p24 rearrangement segregating in a family through five generations with a congenital heart defect (congenital pulmonary and aortic valvular stenosis and pulmonary artery stenosis), by applying a combined genomic analysis. The analysis involved multiple techniques, including karyotype, chromosomal microarray analysis (CMA), FISH, genome sequencing (GS), RNA-seq, and optical genome mapping (OGM). A complex 9p24 SV was hinted at by CMA results, showing three interspersed duplicated segments. Combined GS and OGM analyses revealed that the 9p24 duplications constitute a complex SV, on which a set of breakpoints matches the boundaries of the CMA duplicated sequences. The proposed structure for this complex rearrangement implies three duplications associated with an inversion of ~ 2 Mb region on chromosome 9 and a SINE element insertion at the more distal breakpoint. Interestingly, this genomic structure of rearrangement forms a chimeric transcript of the KANK1/DMRT1 loci, which was confirmed by both RNA-seq and Sanger sequencing on blood samples from 9p24 rearrangement carriers. Altogether with breakpoint amplification and FISH analysis, this combined approach allowed a deep characterization of this complex rearrangement. Although the genotype-phenotype correlation remains elusive from the molecular mechanism point of view, this study identified a large genomic rearrangement at 9p24 segregating with a familial congenital heart defect, revealing a genetic biomarker that was successfully applied for embryo selection, changing the reproductive perspective of affected individuals.
Asunto(s)
Cromosomas , Variaciones en el Número de Copia de ADN , Humanos , Inversión Cromosómica , Secuencia de Bases , Células Germinativas , Proteínas del Citoesqueleto/genética , Proteínas Adaptadoras Transductoras de Señales/genéticaRESUMEN
Extraordinary extended lampbrush chromosomes with thousands of transcription loops are favorable objects in chromosome biology. Chromosomes become lampbrushy due to unusually high rate of transcription during oogenesis. However, until recently, the information on the spectrum of transcribed sequences as well as genomic context of individual chromomeres was mainly limited to tandemly repetitive elements. Here we briefly outline novel findings and future directions in lampbrush chromosome studies in the post-genomic era. We emphasize the fruitfulness of combining genome-wide approaches with microscopy imaging techniques using lampbrush chromosomes as a remarkable model object. We believe that new data on the spectrum of sequences transcribed on the lateral loops of lampbrush chromosomes and their structural organization push the boundaries in the discussion of their biological role. Also see the video abstract here: https://youtu.be/zexoHfzX9rM.
Asunto(s)
Cromosomas , Transcripción Genética , Cromosomas/genética , GenómicaRESUMEN
An increasing number of studies emphasize the role of non-coding variants in the development of hereditary diseases. However, the interpretation of such variants in clinical genetic testing still remains a critical challenge due to poor knowledge of their pathogenicity mechanisms. It was previously shown that variants in 5'-untranslated regions (5'UTRs) can lead to hereditary diseases due to disruption of upstream open reading frames (uORFs). Here, we performed a manual annotation of upstream translation initiation sites (TISs) in human disease-associated genes from the OMIM database and revealed â¼4.7 thousand of TISs related to uORFs. We compared our TISs with the previous studies and provided a list of 'high confidence' uORFs. Using a luciferase assay, we experimentally validated the translation of uORFs in the ETFDH, PAX9, MAST1, HTT, TTN,GLI2 and COL2A1 genes, as well as existence of N-terminal CDS extension in the ZIC2 gene. Besides, we created a tool to annotate the effects of genetic variants located in uORFs. We revealed the variants from the HGMD and ClinVar databases that disrupt uORFs and thereby could lead to Mendelian disorders. We also showed that the distribution of uORFs-affecting variants differs between pathogenic and population variants. Finally, drawing on manually curated data, we developed a machine-learning algorithm that allows us to predict the TISs in other human genes.
Asunto(s)
Regiones no Traducidas 5' , Bases de Datos Genéticas , Enfermedad , Sistemas de Lectura Abierta , Humanos , Biosíntesis de Proteínas , Enfermedad/genéticaRESUMEN
Alterations of human karyotype caused by chromosomal rearrangements are often associated with considerable phenotypic effects. Studying molecular mechanisms underlying these effects requires an efficient and scalable experimental model. Here, we propose a Cre-LoxP-based approach for the generation of combinatorial diversity of chromosomal rearrangements. We demonstrate that using the developed system, both intra- and inter-chromosomal rearrangements can be induced in the human haploid HAP1 cells, although the latter is significantly less effective. The obtained genetically modified HAP1 cell line can be used to dissect genomic effects associated with intra-chromosomal structural variations.
Asunto(s)
Cromosomas , Reordenamiento Génico , Recombinación Genética , Humanos , Cromosomas/genética , Cromosomas/metabolismo , Reordenamiento Génico/genética , Reordenamiento Génico/fisiología , Integrasas/genética , Integrasas/metabolismo , Recombinación Genética/genética , Recombinación Genética/fisiología , Línea CelularRESUMEN
Chromosome conformation capture techniques have revolutionized our understanding of chromatin architecture and dynamics at the genome-wide scale. In recent years, these methods have been applied to a diverse array of species, revealing fundamental principles of chromosomal organization. However, structural organization of the extrachromosomal entities, like viral genomes or plasmids, and their interactions with the host genome, remain relatively underexplored. In this work, we introduce an enhanced 4C-protocol tailored for probing plasmid DNA interactions. We design specific plasmid vector and optimize protocol to allow high detection rate of contacts between the plasmid and host DNA.
Asunto(s)
Plásmidos , Plásmidos/metabolismo , Plásmidos/genética , ADN/química , ADN/genética , Cromatina/genética , Cromatina/metabolismo , Cromatina/química , GenomaRESUMEN
Molecular genetic analysis of tumor tissues is the most important step towards understanding the mechanisms of cancer development; it is also necessary for the choice of targeted therapy. The Hi-C (high-throughput chromatin conformation capture) technology can be used to detect various types of genomic variants, including balanced chromosomal rearrangements, such as inversions and translocations. We propose a modification of the Hi-C method for the analysis of chromatin contacts in formalin-fixed paraffin-embedded (FFPE) sections of tumor tissues. The developed protocol allows to generate high-quality Hi-C data and detect all types of chromosomal rearrangements. We have analyzed various databases to compile a comprehensive list of translocations that hold clinical importance for the targeted therapy selection. The practical value of molecular genetic testing is its ability to influence the treatment strategies and to provide prognostic insights. Detecting specific chromosomal rearrangements can guide the choice of the targeted therapies, which is a critical aspect of personalized medicine in oncology.
Asunto(s)
Formaldehído , Neoplasias , Adhesión en Parafina , Humanos , Neoplasias/genética , Neoplasias/patología , Formaldehído/química , Translocación Genética , Fijación del Tejido , Cromatina/genética , Cromatina/metabolismo , Cromatina/químicaRESUMEN
Lymphedema is a disorder that leads to excessive swelling due to lymphatic insufficiency, resulting in the accumulation of protein-rich interstitial fluid. Primary lymphedema predominantly impacts the lower extremities and is frequently linked to hereditary factors. This condition is known to be associated with variants in several genes, such as FOXC2, FLT4, and SOX18. However, many cases remain unexplained, suggesting undiscovered gene associations. This study describes a novel mutation in the hepatocyte growth factor (HGF) gene, a previously hypothesized candidate for lymphedema pathogenesis. This mutation was identified in affected members of a multigenerational family presenting with primary leg lymphedema, consistent with an autosomal dominant inheritance pattern.
Asunto(s)
Factor de Crecimiento de Hepatocito , Linfedema , Mutación , Adulto , Femenino , Humanos , Masculino , Persona de Mediana Edad , Factor de Crecimiento de Hepatocito/genética , Factor de Crecimiento de Hepatocito/metabolismo , Linfedema/genética , LinajeRESUMEN
Recent experimental and computational efforts have provided large data sets describing three-dimensional organization of mouse and human genomes and showed the interconnection between the expression profile, epigenetic state, and spatial interactions of loci. These interconnections were utilized to infer the spatial organization of chromatin, including enhancer-promoter contacts, from one-dimensional epigenetic marks. Here, we show that the predictive power of some of these algorithms is overestimated due to peculiar properties of the biological data. We propose an alternative approach, which provides high-quality predictions of chromatin interactions using information on gene expression and CTCF-binding alone. Using multiple metrics, we confirmed that our algorithm could efficiently predict the three-dimensional architecture of both normal and rearranged genomes.
Asunto(s)
Biología Computacional/métodos , Elementos de Facilitación Genéticos , Regulación de la Expresión Génica , Genómica/métodos , Regiones Promotoras Genéticas , Animales , Mapeo Cromosómico , Bases de Datos Genéticas , Epigenómica , Humanos , Aprendizaje Automático , Ratones , Especificidad de Órganos , Programas InformáticosRESUMEN
Structural maintenance of chromosomes (SMC) complexes are essential proteins found in genomes of all cellular organisms. Essential functions of these proteins, such as mitotic chromosome formation and sister chromatid cohesion, were discovered a long time ago. Recent advances in chromatin biology showed that SMC proteins are involved in many other genomic processes, acting as active motors extruding DNA, which leads to the formation of chromatin loops. Some loops formed by SMC proteins are highly cell type and developmental stage specific, such as SMC-mediated DNA loops required for VDJ recombination in B-cell progenitors, or dosage compensation in Caenorhabditis elegans and X-chromosome inactivation in mice. In this review, we focus on the extrusion-based mechanisms that are common for multiple cell types and species. We will first describe an anatomy of SMC complexes and their accessory proteins. Next, we provide biochemical details of the extrusion process. We follow this by the sections describing the role of SMC complexes in gene regulation, DNA repair, and chromatin topology.
Asunto(s)
Proteínas de Ciclo Celular , Proteínas Cromosómicas no Histona , Animales , Ratones , Proteínas Cromosómicas no Histona/metabolismo , Proteínas de Ciclo Celular/metabolismo , Cromatina , ADN/química , Caenorhabditis elegans/metabolismoRESUMEN
Mechanisms that ensure repair of double-strand DNA breaks (DSBs) are instrumental in the integration of foreign DNA into the genome of transgenic organisms. After pronuclear microinjection, exogenous DNA is usually found as a concatemer comprising multiple co-integrated transgene copies. Here, we investigated the contribution of various DSB repair pathways to the concatemer formation. We injected mouse zygotes with a pool of linear DNA molecules carrying unique barcodes at both ends and obtained 10 transgenic embryos with 1-300 transgene copies. Sequencing the barcodes allowed us to assign relative positions to the copies in concatemers and detect recombination events that occurred during integration. Cumulative analysis of approximately 1,000 integrated copies reveals that over 80% of them underwent recombination when their linear ends were processed by synthesis-dependent strand annealing (SDSA) or double-strand break repair (DSBR). We also observed evidence of double Holliday junction (dHJ) formation and crossing over during the concatemer formations. Sequencing indels at the junctions between copies shows that at least 10% of DNA molecules introduced into the zygotes are ligated by non-homologous end joining (NHEJ). Our barcoding approach, verified with Pacific Biosciences Single Molecule Real-Time (SMRT) long-range sequencing, documents high activity of homologous recombination after DNA microinjection.
Asunto(s)
Roturas del ADN de Doble Cadena , ADN/química , Recombinación Homóloga/genética , Transgenes/genética , Animales , Animales Modificados Genéticamente , ADN/genética , Código de Barras del ADN Taxonómico , Reparación del ADN por Unión de Extremidades/genética , Reparación del ADN/genética , ADN Cruciforme/química , ADN Cruciforme/genética , Ratones , Cigoto/crecimiento & desarrolloRESUMEN
DNA methylation is an essential epigenetic regulation mechanism implicated in transcription and replication control, developmental reprogramming, retroelements silencing and other genomic processes. During mammalian development, a specific DNA methylation pattern should be established in germ cells to allow embryonic development. Less is known about germ cell DNA methylation in other species. To close this gap, we performed a single-cell methylome analysis of chicken diplotene oocytes. We comprehensively characterized methylation patterns in these cells, obtained methylation-based chicken genome segmentation and identified oocyte-specific methylated gene promoters. Our data show that despite the formation of specific transcriptionally hyperactive genome architecture in chicken diplotene oocytes, methylation patterns in these cells closely resemble genomic distribution observed in somatic tissues.
Asunto(s)
Metilación de ADN , Epigénesis Genética , Animales , Pollos/genética , Retroelementos/genética , Oocitos/metabolismo , Cromosomas/genética , MamíferosRESUMEN
Most copy number variations (CNVs) in the human genome display incomplete penetrance with unknown underlying mechanisms. One such mechanism may be epigenetic modification, particularly DNA methylation. The IMMP2L gene is located in a critical region for autism susceptibility on chromosome 7q (AUTS1). The level of DNA methylation was assessed by bisulfite sequencing of 87 CpG sites in the IMMP2L gene in 3 families with maternally inherited 7q31.1 microdeletions affecting the IMMP2L gene alone. Bisulfite sequencing revealed comparable levels of DNA methylation in the probands, healthy siblings without microdeletions, and their fathers. In contrast, a reduced DNA methylation index and increased IMMP2L expression were observed in lymphocytes from the healthy mothers compared with the probands. A number of genes were upregulated in the healthy mothers compared to controls and downregulated in probands compared to mothers. These genes were enriched in components of the ribosome and electron transport chain, as well as oxidative phosphorylation and various degenerative conditions. Differential expression in probands and mothers with IMMP2L deletions relative to controls may be due to compensatory processes in healthy mothers with IMMP2L deletions and disturbances of these processes in probands with intellectual disability. The results suggest a possible partial compensation for IMMP2L gene haploinsufficiency in healthy mothers with the 7q31.1 microdeletion by reducing the DNA methylation level. Differential DNA methylation of intragenic CpG sites may affect the phenotypic manifestation of CNVs and explain the incomplete penetrance of chromosomal microdeletions.
Asunto(s)
Deleción Cromosómica , Cromosomas Humanos Par 7/genética , Metilación de ADN , Discapacidades del Desarrollo/genética , Endopeptidasas/genética , Discapacidad Intelectual/genética , Adolescente , Adulto , Niño , Preescolar , Islas de CpG/genética , Salud de la Familia , Femenino , Perfilación de la Expresión Génica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Masculino , Herencia Materna/genéticaRESUMEN
How chromosomes are folded, spatially organized and regulated in three dimensions inside the cell nucleus are among the longest standing questions in cell biology. Genome-wide chromosome conformation capture (Hi-C) technique allowed identifying and characterizing spatial chromatin compartments in several mammalian species. Here, we present the first genome-wide analysis of chromatin interactions in chicken embryonic fibroblasts (CEF) and adult erythrocytes. We showed that genome of CEF is partitioned into topologically associated domains (TADs), distributed in accordance with gene density, transcriptional activity and CTCF-binding sites. In contrast to mammals, where all examined somatic cell types display relatively similar spatial organization of genome, chicken erythrocytes strongly differ from fibroblasts, showing pronounced A- and B- compartments, absence of typical TADs and formation of long-range chromatin interactions previously observed on mitotic chromosomes. Comparing mammalian and chicken genome architectures, we provide evidence highlighting evolutionary role of chicken TADs and their significance in genome activity and regulation.
Asunto(s)
Pollos/genética , Cromatina/ultraestructura , Eritrocitos/ultraestructura , Evolución Molecular , Animales , Núcleo Celular/genética , Fibroblastos/ultraestructura , GenomaRESUMEN
We report a universal straightforward strategy for the chemical synthesis of modified oligoribonucleotides containing functional groups of different structures at the 2' position of ribose. The on-column synthetic concept is based on the incorporation of two types of commercial nucleotide phosphoramidites containing orthogonal 2'-O-protecting groups, namely 2'-O-thiomorpholine-carbothioate (TC, as "permanent") and 2'-O-tert-butyl(dimethyl)silyl (tBDMS, as "temporary"), to RNA during solid-phase synthesis. Subsequently, the support-bound RNA undergoes selective deprotection and follows postsynthetic 2' functionalization of the naked hydroxyl group. This convenient method to tailor RNA, utilizing the advantages of solid phase approaches, gives an opportunity to introduce site-specifically a wide range of linkers and functional groups. By this strategy, a series of RNAs containing diverse 2' functionalities were synthesized and studied with respect to their physicochemical properties.
Asunto(s)
Oligorribonucleótidos/síntesis química , ARN/síntesis química , Técnicas de Síntesis en Fase Sólida/métodos , Secuencia de Bases , Desnaturalización de Ácido Nucleico , Oligorribonucleótidos/química , Compuestos Organofosforados/síntesis química , Compuestos Organofosforados/química , ARN/químicaRESUMEN
Neuronal tracing is a modern technology that is based on the expression of fluorescent proteins under the control of cell type-specific promoters. However, random genomic integration of the reporter construct often leads to incorrect spatial and temporal expression of the marker protein. Targeted integration (or knock-in) of the reporter coding sequence is supposed to provide better expression control by exploiting endogenous regulatory elements. Here we describe the generation of two fluorescent reporter systems: enhanced green fluorescent protein (EGFP) under pan-neural marker class III ß-tubulin (Tubb3) promoter and mEos2 under serotonergic neuron-specific tryptophan hydroxylase 2 (Tph2) promoter. Differentiation of Tubb3-EGFP embryonic stem (ES) cells into neurons revealed that though Tubb3-positive cells express EGFP, its expression level is not sufficient for the neuronal tracing by routine fluorescent microscopy. Similarly, the expression levels of mEos2-TPH2 in differentiated ES cells was very low and could be detected only on messenger RNA level using polymerase chain reaction-based methods. Our data shows that the use of endogenous regulatory elements to control transgene expression is not always beneficial compared with the random genomic integration.
Asunto(s)
Proteínas Fluorescentes Verdes/metabolismo , Proteínas Luminiscentes/metabolismo , Células Madre Embrionarias de Ratones/metabolismo , Neuronas/metabolismo , Regiones Promotoras Genéticas , Triptófano Hidroxilasa/genética , Tubulina (Proteína)/genética , Animales , Diferenciación Celular , Células Cultivadas , Vectores Genéticos , Proteínas Fluorescentes Verdes/genética , Proteínas Luminiscentes/genética , Ratones , Células Madre Embrionarias de Ratones/citología , Neuronas/citología , Recombinación Genética , TransgenesRESUMEN
The application of array-based comparative genomic hybridization and next-generation sequencing has identified many chromosomal microdeletions and microduplications in patients with different pathological phenotypes. Different copy number variations are described within the short arm of chromosome 18 in patients with skin diseases. In particular, full or partial monosomy 18p has also been associated with keratosis pilaris. Here, for the first time, we report a young male patient with intellectual disability, diabetes mellitus (type I), and keratosis pilaris, who exhibited a de novo 45-kb microduplication of exons 4-22 of LAMA1, located at 18p11.31, and a 432-kb 18p11.32 microduplication of paternal origin containing the genes METTL4, NDC80, and CBX3P2 and exons 1-15 of the SMCHD1 gene. The microduplication of LAMA1 was identified in skin fibroblasts but not in lymphocytes, whereas the larger microduplication was present in both tissues. We propose LAMA1 as a novel candidate gene for keratosis pilaris. Although inherited from a healthy father, the 18p11.32 microduplication, which included relevant genes, could also contribute to phenotype manifestation.
Asunto(s)
Anomalías Múltiples/genética , Duplicación Cromosómica/genética , Enfermedad de Darier/complicaciones , Enfermedad de Darier/genética , Cejas/anomalías , Discapacidad Intelectual/complicaciones , Discapacidad Intelectual/genética , Laminina/genética , Mosaicismo , Adolescente , Niño , Preescolar , Hibridación Genómica Comparativa , Fibroblastos/metabolismo , Regulación de la Expresión Génica , Humanos , Lactante , Recién Nacido , Masculino , Piel/patologíaRESUMEN
Structural variations are a pervasive feature of human genomes, and there is growing recognition of their role in disease development through their impact on spatial chromatin architecture. This understanding has led us to investigate the clinical significance of CNVs in noncoding regions that influence TAD structures. In this study, we focused on the Epb41l4a locus, which contains a highly conserved TAD boundary present in both human chromosome 5 and mouse chromosome 18, and its association with neurodevelopmental phenotypes. Analysis of human data from the DECIPHER database indicates that CNVs within this locus, including both deletions and duplications, are often observed alongside neurological abnormalities, such as dyslexia and intellectual disability, although there is not enough evidence of a direct correlation or causative relationship. To investigate these possible associations, we generated mouse models with deletion and inversion mutations at this locus and carried out RNA-seq analysis to elucidate gene expression changes. We found that modifications in the Epb41l4a TAD boundary led to dysregulation of the Nrep gene, which plays a crucial role in nervous system development. These findings underscore the potential pathogenicity of these CNVs and highlight the crucial role of spatial genome architecture in gene expression regulation.
Asunto(s)
Cromatina , Cromosomas Humanos Par 18 , Humanos , Animales , Ratones , Cromosomas Humanos Par 5 , Bases de Datos Factuales , Modelos Animales de EnfermedadRESUMEN
Interpretation of noncoding genomic variants is one of the most important challenges in human genetics. Machine learning methods have emerged recently as a powerful tool to solve this problem. State-of-the-art approaches allow prediction of transcriptional and epigenetic effects caused by noncoding mutations. However, these approaches require specific experimental data for training and cannot generalize across cell types where required features were not experimentally measured. We show here that available epigenetic characteristics of human cell types are extremely sparse, limiting those approaches that rely on specific epigenetic input. We propose a new neural network architecture, DeepCT, which can learn complex interconnections of epigenetic features and infer unmeasured data from any available input. Furthermore, we show that DeepCT can learn cell type-specific properties, build biologically meaningful vector representations of cell types, and utilize these representations to generate cell type-specific predictions of the effects of noncoding variations in the human genome.
Asunto(s)
Aprendizaje Profundo , Humanos , Redes Neurales de la Computación , Aprendizaje Automático , Genoma HumanoRESUMEN
Structural variants (SVs) pose a challenge to detect and interpret, but their study provides novel biological insights and molecular diagnosis underlying rare diseases. The aim of this study was to resolve a 9p24 rearrangement segregating in a family through five generations with a congenital heart defect (congenital pulmonary and aortic valvular stenosis, and pulmonary artery stenosis), by applying a combined genomic analysis. The analysis involved multiple techniques, including karyotype, chromosomal microarray analysis (CMA), FISH, whole-genome sequencing (WGS), RNA-seq and optical genome mapping (OGM). A complex 9p24 SV was hinted at by CMA results, showing three interspersed duplicated segments. Combined WGS and OGM analyses revealed that the 9p24 duplications constitute a complex SV, on which a set of breakpoints match the boundaries of the CMA duplicated sequences. The proposed structure for this complex rearrangement implies three duplications associated with an inversion of ~ 2Mb region on chromosome 9 with a SINE element insertion at the more distal breakpoint. Interestingly, this hypothesized genomic structure of rearrangement forms a chimeric transcript of the KANK1/DMRT1 loci, which was confirmed by RNA-seq on blood from 9p24 rearrangement carriers. Altogether with breakpoint amplification and FISH analysis, this combined approach allowed a deep characterization of this complex rearrangement. Although the genotype-phenotype correlation remains elusive from the molecular mechanism point of view, this study identified a large genomic rearrangement at 9p segregating with a familial congenital clinical trait, revealing a genetic biomarker that was successfully applied for embryo selection, changing the reproductive perspective of affected individuals.
RESUMEN
Chromosomes inside the nucleus are not located in the form of linear molecules. Instead, there is a complex multilevel genome folding that includes nucleosomes packaging, formation of chromatin loops, domains, compartments, and finally, chromosomal territories. Proper spatial organization play an essential role for the correct functioning of the genome, and is therefore dynamically changed during development or disease. Here we discuss how the organization of the cancer cell genome differs from the healthy genome at various levels. A better understanding of how malignization affects genome organization and long-range gene regulation will help to reveal the molecular mechanisms underlying cancer development and evolution.