RESUMO
Complex functioning of the genome in the cell nucleus is controlled at different levels: (a) the DNA base sequence containing all relevant inherited information; (b) epigenetic pathways consisting of protein interactions and feedback loops; (c) the genome architecture and organization activating or suppressing genetic interactions between different parts of the genome. Most research so far has shed light on the puzzle pieces at these levels. This article, however, attempts an integrative approach to genome expression regulation incorporating these different layers. Under environmental stress or during cell development, differentiation towards specialized cell types, or to dysfunctional tumor, the cell nucleus seems to react as a whole through coordinated changes at all levels of control. This implies the need for a framework in which biological, chemical, and physical manifestations can serve as a basis for a coherent theory of gene self-organization. An international symposium held at the Biomedical Research and Study Center in Riga, Latvia, on 25 July 2022 addressed novel aspects of the abovementioned topic. The present article reviews the most recent results and conclusions of the state-of-the-art research in this multidisciplinary field of science, which were delivered and discussed by scholars at the Riga symposium.
Assuntos
Núcleo Celular , Genoma , Núcleo Celular/metabolismo , Diferenciação Celular/genéticaRESUMO
PURPOSE: We characterize the clinical and molecular phenotypes of six unrelated individuals with intellectual disability and autism spectrum disorder who carry heterozygous missense variants of the PRKAR1B gene, which encodes the R1ß subunit of the cyclic AMP-dependent protein kinase A (PKA). METHODS: Variants of PRKAR1B were identified by single- or trio-exome analysis. We contacted the families and physicians of the six individuals to collect phenotypic information, performed in vitro analyses of the identified PRKAR1B-variants, and investigated PRKAR1B expression during embryonic development. RESULTS: Recent studies of large patient cohorts with neurodevelopmental disorders found significant enrichment of de novo missense variants in PRKAR1B. In our cohort, de novo origin of the PRKAR1B variants could be confirmed in five of six individuals, and four carried the same heterozygous de novo variant c.1003C>T (p.Arg335Trp; NM_001164760). Global developmental delay, autism spectrum disorder, and apraxia/dyspraxia have been reported in all six, and reduced pain sensitivity was found in three individuals carrying the c.1003C>T variant. PRKAR1B expression in the brain was demonstrated during human embryonal development. Additionally, in vitro analyses revealed altered basal PKA activity in cells transfected with variant-harboring PRKAR1B expression constructs. CONCLUSION: Our study provides strong evidence for a PRKAR1B-related neurodevelopmental disorder.
Assuntos
Apraxias , Transtorno do Espectro Autista , Deficiência Intelectual , Transtornos do Neurodesenvolvimento , Transtorno do Espectro Autista/genética , Subunidade RIbeta da Proteína Quinase Dependente de AMP Cíclico , Feminino , Humanos , Deficiência Intelectual/genética , Transtornos do Neurodesenvolvimento/genética , Dor , GravidezRESUMO
Immunostaining and fluorescence in situ hybridization (FISH) are well established methods for specific labelling of chromatin in the cell nucleus. COMBO-FISH (combinatorial oligonucleotide fluorescence in situ hybridization) is a FISH method using computer designed oligonucleotide probes specifically co-localizing at given target sites. In combination with super resolution microscopy which achieves spatial resolution far beyond the Abbe Limit, it allows new insights into the nano-scaled structure and organization of the chromatin of the nucleus. To avoid nano-structural changes of the chromatin, the COMBO-FISH labelling protocol was optimized omitting heat treatment for denaturation of the target. As an example, this protocol was applied to ALU elements-dispersed short stretches of DNA which appear in different kinds in large numbers in primate genomes. These ALU elements seem to be involved in gene regulation, genomic diversity, disease induction, DNA repair, etc. By computer search, we developed a unique COMBO-FISH probe which specifically binds to ALU consensus elements and combined this DNA-DNA labelling procedure with heterochromatin immunostainings in formaldehyde-fixed cell specimens. By localization microscopy, the chromatin network-like arrangements of ALU oligonucleotide repeats and heterochromatin antibody labelling sites were simultaneously visualized and quantified. This novel approach which simultaneously combines COMBO-FISH and immunostaining was applied to chromatin analysis on the nanoscale after low-linear-energy-transfer (LET) radiation exposure at different doses. Dose-correlated curves were obtained from the amount of ALU representing signals, and the chromatin re-arrangements during DNA repair after irradiation were quantitatively studied on the nano-scale. Beyond applications in radiation research, the labelling strategy of immunostaining and COMBO-FISH with localization microscopy will also offer new potentials for analyses of subcellular elements in combination with other specific chromatin targets.
Assuntos
Elementos Alu , Núcleo Celular/ultraestrutura , Cromatina/química , Hibridização in Situ Fluorescente/métodos , Linhagem Celular Tumoral , Cromatina/ultraestrutura , Humanos , Hibridização in Situ Fluorescente/normas , Microscopia de Fluorescência/métodos , Microscopia de Fluorescência/normasRESUMO
BACKGROUND: It is widely accepted that the 3D chromatin organization in human cell nuclei is not random and recent investigations point towards an interactive relation of epigenetic functioning and chromatin (re-)organization. Although chromatin organization seems to be the result of self-organization of the entirety of all molecules available in the cell nucleus, a general question remains open as to what extent chromatin organization might additionally be predetermined by the DNA sequence and, if so, if there are characteristic differences that distinguish typical regions involved in dysfunction-related aberrations from normal ones, since typical DNA breakpoint regions involved in disease-related chromosome aberrations are not randomly distributed along the DNA sequence. METHODS: Highly conserved k-mer patterns in intronic and intergenic regions have been reported in eukaryotic genomes. In this article, we search and analyze regions deviating from average spectra (ReDFAS) of k-mer word frequencies in the human genome. This includes all assembled regions, e.g., telomeric, centromeric, genic as well as intergenic regions. RESULTS: A positive correlation between k-mer spectra and 3D contact frequencies, obtained exemplarily from given Hi-C datasets, has been found indicating a relation of ReDFAS to chromatin organization and interactions. We also searched and found correlations of known functional annotations, e.g., genes correlating with ReDFAS. Selected regions known to contain typical breakpoints on chromosomes 9 and 5 that are involved in cancer-related chromosomal aberrations appear to be enriched in ReDFAS. Since transposable elements like ALUs are often assigned as major players in 3D genome organization, we also studied their impact on our examples but could not find a correlation between ALU regions and breakpoints comparable to ReDFAS. CONCLUSIONS: Our findings might show that ReDFAS are associated with instable regions of the genome and regions with many chromatin contacts which is in line with current research indicating that chromatin loop anchor points lead to genomic instability.
Assuntos
Cromatina , Genoma Humano , Humanos , Cromatina/genética , Pontos de Quebra do Cromossomo , DNA Intergênico/genéticaRESUMO
The specific characteristics of k-mer words (2 ≤ k ≤ 11) regarding genomic distribution and evolutionary conservation were recently found. Among them are, in high abundance, words with a tandem repeat structure (repeat unit length of 1 bp to 3 bp). Furthermore, there seems to be a class of extremely short tandem repeats (≤12 bp), so far overlooked, that are non-random-distributed and, therefore, may play a crucial role in the functioning of the genome. In the following article, the positional distributions of these motifs we call super-short tandem repeats (SSTRs) were compared to other functional elements, like genes and retrotransposons. We found length- and sequence-dependent correlations between the local SSTR density and G+C content, and also between the density of SSTRs and genes, as well as correlations with retrotransposon density. In addition to many general interesting relations, we found that SINE Alu has a strong influence on the local SSTR density. Moreover, the observed connection of SSTR patterns to pseudogenes and -exons might imply a special role of SSTRs in gene expression. In summary, our findings support the idea of a special role and the functional relevance of SSTRs in the genome.
Assuntos
Repetições de Microssatélites , Retroelementos , Humanos , Retroelementos/genética , Composição de Bases , DNA Intergênico , Repetições de Microssatélites/genética , Cromossomos Humanos , Receptores de SomatostatinaRESUMO
Dinucleotides are known as determinants for various structural and physiochemical properties of DNA and for binding affinities of proteins to DNA. These properties (e.g., stiffness) and bound proteins (e.g., transcription factors) are known to influence important biological functions, such as transcription regulation and 3D chromatin organization. Accordingly, the question arises of how the considerable variations in dinucleotide contents of eukaryotic chromosomes could still provide consistent DNA properties resulting in similar functions and 3D conformations. In this work, we investigate the hypothesis that coupled dinucleotide contents influence DNA properties in opposite directions to moderate each other's influences. Analyzing all 2478 chromosomes of 155 eukaryotic species, considering bias from coding sequences and enhancers, we found sets of correlated and anti-correlated dinucleotide contents. Using computational models, we estimated changes of DNA properties resulting from this coupling. We found that especially pure A/T dinucleotides (AA, TT, AT, TA), known to influence histone positioning and AC/GT contents, are relevant moderators and that, e.g., the Roll property, which is known to influence histone affinity of DNA, is preferably moderated. We conclude that dinucleotide contents might indirectly influence transcription and chromatin 3D conformation, via regulation of histone occupancy and/or other mechanisms.
Assuntos
Eucariotos , Histonas , Histonas/genética , Eucariotos/genética , Eucariotos/metabolismo , DNA/genética , DNA/química , Cromatina/genética , Células Eucarióticas/metabolismoRESUMO
Several strongly conserved DNA sequence patterns in and between introns and intergenic regions (IIRs) consisting of short tandem repeats (STRs) with repeat lengths <3 bp have already been described in the kingdom of Animalia. In this work, we expanded the search and analysis of conserved DNA sequence patterns to a wider range of eukaryotic genomes. Our aims were to confirm the conservation of these patterns, to support the hypothesis on their functional constraints and/or the identification of unknown patterns. We pairwise compared genomic DNA sequences of genes, exons, CDS, introns and intergenic regions of 34 Embryophyta (land plants), 30 Protista and 29 Fungi using established k-mer-based (alignment-free) comparison methods. Additionally, the results were compared with values derived for Animalia in former studies. We confirmed strong correlations between the sequence structures of IIRs spanning over the entire domain of Eukaryotes. We found that the high correlations within introns, intergenic regions and between the two are a result of conserved abundancies of STRs with repeat units ≤2 bp (e.g., (AT)n). For some sequence patterns and their inverse complementary sequences, we found a violation of equal distribution on complementary DNA strands in a subset of genomes. Looking at mismatches within the identified STR patterns, we found specific preferences for certain nucleotides stable over all four phylogenetic kingdoms. We conclude that all of these conserved patterns between IIRs indicate a shared function of these sequence structures related to STRs.
Assuntos
DNA Intergênico/genética , Evolução Molecular , Genoma/genética , Íntrons/genética , Eucariotos/genéticaRESUMO
During the last decade, genome sequence databases of many species have been more and more completed so that it has become possible to further develop a recently established technique of FISH (Fluorescence In Situ Hybridization) called COMBO-FISH (COMBinatorial Oligo FISH). In contrast to standard FISH techniques, COMBO-FISH makes use of a bioinformatic search in sequence databases for probe design, so that it can be done for any species so far sequenced. In the original approach, oligonucleotide stretches of typical lengths of 15-30 nucleotides were selected in such a way that they only co-localize at the given genome target. Typical probe sets of about 20-40 stretches were used to label about 50-250 kb specifically. The probes of different lengths can be composed of purines and pyrimidines, but were often restricted to homo-purine or homo-pyrimidine probe sets because of the experimental advantage of using a protocol omitting denaturation of the target strand and triple strand binding of the probes. This allows for a better conservation of the 3D folding and arrangement of the genome. With an improved, rigorous genome sequence database analysis and sequence search according to statistical frequency and uniqueness, a novel family of probes repetitively binding to characteristic genome features like SINEs (Short Interspersed Nuclear Elements, e.g., ALU elements), LINEs (Long Interspersed Nuclear Elements, e.g., L1), or centromeres has been developed. These probes can be synthesized commercially as DNA or PNA probes with high purity and labeled by fluorescent dye molecules. Here, new protocols are described for purine-pyrimidine probes omitting heat treatment for denaturation of the target so that oligonucleotide labeling can also be combined with immune-staining by specific antibodies. If the dyes linked to the oligonucleotide stretches undergo reversible photo-bleaching (laser-induced slow blinking), the labeled cell nuclei can be further subjected to super-resolution localization microscopy for complex chromatin architecture research.
Assuntos
Elementos Alu/genética , Técnicas de Química Combinatória/métodos , Hibridização in Situ Fluorescente/métodos , Sondas de Oligonucleotídeos/genética , Ácidos Nucleicos Peptídicos/genética , Linhagem Celular Tumoral , Cromatina/genética , Bases de Dados de Ácidos Nucleicos , Genoma , Humanos , Processamento de Imagem Assistida por Computador , Nanoestruturas/química , Sequências Repetitivas de Ácido Nucleico , Análise de Sequência de DNA/métodosRESUMO
In this study, we pairwise-compared multiple genome regions, including genes, exons, coding DNA sequences (CDS), introns, and intergenic regions of 39 Animalia genomes, including Deuterostomia (27 species) and Protostomia (12 species), by applying established k-mer-based (alignment-free) comparison methods. We found strong correlations between the sequence structure of introns and intergenic regions, individual organisms, and within wider phylogenetical ranges, indicating the conservation of certain structures over the full range of analyzed organisms. We analyzed these sequence structures by quantifying the contribution of different sets of DNA words to the average correlation value by decomposing the correlation coefficients with respect to these word sets. We found that the conserved structures within introns, intergenic regions, and between the two were mainly a result of conserved tandem repeats with repeat units ≤ 2 bp (e.g., (AT)n), while other conserved sequence structures, such as those found between exons and CDS, were dominated by tandem repeats with repeat unit sizes of 3 bp in length and more complex DNA word patterns. We conclude that the conservation between intron and intergenic regions indicates a shared function of these sequence structures. Also, the similar differences in conserved structures with known origin, especially to the conservation between exons and CDS resulting from DNA codons, indicate that k-mer composition-based functional properties of introns and intergenic regions may differ from those of exons and CDS.
RESUMO
In genome analysis, k-mer-based comparison methods have become standard tools. However, even though they are able to deliver reliable results, other algorithms seem to work better in some cases. To improve k-mer-based DNA sequence analysis and comparison, we successfully checked whether adding positional resolution is beneficial for finding and/or comparing interesting organizational structures. A simple but efficient algorithm for extracting and saving local k-mer spectra (frequency distribution of k-mers) was developed and used. The results were analyzed by including positional information based on visualizations as genomic maps and by applying basic vector correlation methods. This analysis was concentrated on small word lengths (1 ≤ k ≤ 4) on relatively small viral genomes of Papillomaviridae and Herpesviridae, while also checking its usability for larger sequences, namely human chromosome 2 and the homologous chromosomes (2A, 2B) of a chimpanzee. Using this alignment-free analysis, several regions with specific characteristics in Papillomaviridae and Herpesviridae formerly identified by independent, mostly alignment-based methods, were confirmed. Correlations between the k-mer content and several genes in these genomes have been found, showing similarities between classified and unclassified viruses, which may be potentially useful for further taxonomic research. Furthermore, unknown k-mer correlations in the genomes of Human Herpesviruses (HHVs), which are probably of major biological function, are found and described. Using the chromosomes of a chimpanzee and human that are currently known, identities between the species on every analyzed chromosome were reproduced. This demonstrates the feasibility of our approach for large data sets of complex genomes. Based on these results, we suggest k-mer analysis with positional resolution as a method for closing a gap between the effectiveness of alignment-based methods (like NCBI BLAST) and the high pace of standard k-mer analysis.