Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 24
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Croat Med J ; 65(3): 209-219, 2024 Jun 13.
Artículo en Inglés | MEDLINE | ID: mdl-38868967

RESUMEN

AIM: To precisely identify and analyze alpha-satellite higher-order repeats (HORs) in T2T-CHM13 assembly of human chromosome 3. METHODS: From the recently sequenced complete T2T-CHM13 assembly of human chromosome 3, the precise alpha satellite HOR structure was computed by using the novel high-precision GRM2023 algorithm with global repeat map (GRM) and monomer distance (MD) diagrams. RESULTS: The major alpha satellite HOR array in chromosome 3 revealed a novel cascading HOR, housing 17mer HOR copies with subfragments of periods 15 and 2. Within each row in the cascading HOR, the monomers were of different types, but different rows within the same cascading 17mer HOR contained more than one monomer of the same type. Each canonical 17mer HOR copy comprised 17 monomers belonging to 16 different monomer types. Another pronounced 10mer HOR array was of the regular Willard's type. CONCLUSION: Our findings emphasize the complexity within the chromosome 3 centromere as well as deviations from expected highly regular patterns.


Asunto(s)
Cromosomas Humanos Par 3 , ADN Satélite , Humanos , ADN Satélite/genética , Cromosomas Humanos Par 3/genética , Centrómero/genética , Algoritmos , Secuencias Repetitivas de Ácidos Nucleicos/genética
2.
Int J Mol Sci ; 25(8)2024 Apr 16.
Artículo en Inglés | MEDLINE | ID: mdl-38673983

RESUMEN

Unraveling the intricate centromere structure of human chromosomes holds profound implications, illuminating fundamental genetic mechanisms and potentially advancing our comprehension of genetic disorders and therapeutic interventions. This study rigorously identified and structurally analyzed alpha satellite higher-order repeats (HORs) within the centromere of human chromosome 15 in the complete T2T-CHM13 assembly using the high-precision GRM2023 algorithm. The most extensive alpha satellite HOR array in chromosome 15 reveals a novel cascading HOR, housing 429 15mer HOR copies, containing 4-, 7- and 11-monomer subfragments. Within each row of cascading HORs, all alpha satellite monomers are of distinct types, as in regular Willard's HORs. However, different HOR copies within the same cascading 15mer HOR contain more than one monomer of the same type. Each canonical 15mer HOR copy comprises 15 monomers belonging to only 9 different monomer types. Notably, 65% of the 429 15mer cascading HOR copies exhibit canonical structures, while 35% display variant configurations. Identified as the second most extensive alpha satellite HOR, another novel cascading HOR within human chromosome 15 encompasses 164 20mer HOR copies, each featuring two subfragments. Moreover, a distinct pattern emerges as interspersed 25mer/26mer structures differing from regular Willard's HORs and giving rise to a 34-monomer subfragment. Only a minor 18mer HOR array of 12 HOR copies is of the regular Willard's type. These revelations highlight the complexity within the chromosome 15 centromeric region, accentuating deviations from anticipated highly regular patterns and hinting at profound information encoding and functional potential within the human centromere.


Asunto(s)
Centrómero , Cromosomas Humanos Par 15 , ADN Satélite , Humanos , ADN Satélite/genética , Centrómero/genética , Cromosomas Humanos Par 15/genética , Secuencias Repetitivas de Ácidos Nucleicos
3.
Genes (Basel) ; 14(12)2023 12 12.
Artículo en Inglés | MEDLINE | ID: mdl-38137022

RESUMEN

The Supersymmetry Genetic code (SSyGC) table is based on five physicochemical symmetries: (1) double mirror symmetry on the principle of the horizontal and vertical mirror symmetry axis between all bases (purines [A, G) and pyrimidines (U, C)] and (2) of bases in the form of codons; (3) direct-complement like codon/anticodon symmetry in the sixteen alternating boxes of the genetic code columns; (4) A + T-rich and C + G-rich alternate codons in the same row between both columns of the genetic code; (5) the same position between divided and undivided codon boxes in relation to horizontal mirror symmetry axis. The SSyGC table has a unique physicochemical purine-pyrimidine symmetry net which is as the core symmetry common for all, with more than thirty different nuclear and mitochondrial genetic codes. This net is present in the SSyGC table of all RNA and DNA living species. None of these symmetries are present in the Standard Genetic Code (SGC) table which is constructed on the alphabetic horizontal and vertical U-C-A-G order of bases. Here, we show that the free energy value of each codon incorporated as fundamentally mapping the "energy code" in the SSyGC table is compatible with mirror symmetry. On the other hand, in the SGC table, the same free energy values of codons are dispersed and a mirror symmetry between them is not recognizable. At the same time, the mirror symmetry of the SSyGC table and the DNA quadruplets together with our classification of codons/trinucleotides are perfectly imbedded in the mirror symmetry energy mapping of codons/trinucleotides and point out in favor of maintaining the integrity of the genetic code and DNA genome. We also argue that physicochemical symmetries of the SSyGC table in the manner of the purine-pyrimidine symmetry net, the quadruplet symmetry of DNA molecule, and the free energy of codons have remined unchanged during all of evolution. The unchangeable and universal symmetry properties of the genetic code, DNA molecules, and the energy code are decreasing disorder between codons/trinucleotides and shed a new light on evolution. Diversity in all living species on Earth is broad, but the symmetries of the Supersymmetry Genetic Code as the code of life and the DNA quadruplets related to the "energy code" are unique, unchangeable, and have the power of natural laws.


Asunto(s)
Evolución Molecular , Código Genético , Codón/genética , ADN/genética
4.
Sci Rep ; 13(1): 14420, 2023 09 02.
Artículo en Inglés | MEDLINE | ID: mdl-37660151

RESUMEN

It is known that the ~ 1.6 kb Neuroblastoma BreakPoint Family (NBPF) repeats are human specific and contributing to cognitive capabilities, with increasing frequency in higher order repeat 3mer HORs (Olduvai triplets). From chimpanzee to modern human there is a discontinuous jump from 0 to ~ 50 tandemly organized 3mer HORs. Here we investigate the structure of NBPF 3mer HORs in the Neanderthal genome assembly of Pääbo et al., comparing it to the results obtained for human hg38.p14 chromosome 1. Our findings reveal corresponding NBPF 3mer HOR arrays in Neanderthals with slightly different monomer structures and numbers of HOR copies compared to humans. Additionally, we compute the NBPF 3mer HOR pattern for the complete telomere-to-telomere human genome assembly (T2T-CHM13) by Miga et al., identifying two novel tandem arrays of NBPF 3mer HOR repeats with 5 and 9 NBPF 3mer HOR copies. We hypothesize that these arrays correspond to novel NBPF genes (here referred to as NBPFA1 and NBPFA2). Further improving the quality of the Neanderthal genome using T2T-CHM13 as a reference would be of great interest in determining the presence of such distant novel NBPF genes in the Neanderthal genome and enhancing our understanding of human evolution.


Asunto(s)
Hombre de Neandertal , Neuroblastoma , Humanos , Animales , Hombre de Neandertal/genética , Cromosomas Humanos , Cromosomas Humanos Par 1 , Familia , Pan troglodytes
5.
Int J Mol Sci ; 24(15)2023 Jul 27.
Artículo en Inglés | MEDLINE | ID: mdl-37569405

RESUMEN

Symmetries have not been completely determined and explained from the discovery of the DNA structure in 1953 and the genetic code in 1961. We show, during 10 years of investigation and research, our discovery of the Supersymmetry Genetic Code table in the form of 2 × 8 codon boxes, quadruplet DNA symmetries, and the classification of trinucleotides/codons, all built with the same physiochemical double mirror symmetry and Watson-Crick pairing. We also show that single-stranded RNA had the complete code of life in the form of the Supersymmetry Genetic Code table simultaneously with instructions of codons' relationship as to how to develop the DNA molecule on the principle of Watson-Crick pairing. We show that the same symmetries between the genetic code and DNA quadruplet are highly conserved during the whole evolution even between phylogenetically distant organisms. In this way, decreasing disorder and entropy enabled the evolution of living beings up to sophisticated species with cognitive features. Our hypothesis that all twenty amino acids are necessary for the origin of life on the Earth, which entirely changes our view on evolution, confirms the evidence of organic natural amino acids from the extra-terrestrial asteroid Ryugu, which is nearly as old as our solar system.


Asunto(s)
ADN , Código Genético , ADN/genética , Codón/genética , Aminoácidos/genética , ARN , Evolución Molecular
6.
Life Sci Alliance ; 6(1)2023 01.
Artículo en Inglés | MEDLINE | ID: mdl-36261226

RESUMEN

Previously it was found that the neuroblastoma breakpoint family (NBPF) gene repeat units of ∼1.6 kb have an important role in human brain evolution and function. The higher order organization of these repeat units has been discovered by both methods, the higher order repeat (HOR)-searching method and the HLS searching method. Using the HOR searching method with global repeat map algorithm, here we identified the tandemly organized NBPF HORs in the human and nonhuman primate NCBI reference genomes. We identified 50 tandemly organized canonical 3mer NBPF HOR copies (Olduvai triplets), but none in nonhuman primates chimpanzee, gorilla, orangutan, and Rhesus macaque. This discontinuous jump in tandemly organized HOR copy number is in sharp contrast to the known gradual increase in the number of Olduvai domains (NBPF monomers) from nonhuman primates to human, especially from ∼138 in chimpanzee to ∼300 in human genome. Using the same global repeat map algorithm method we have also determined the 3mer tandems of canonical 3mer HOR copies in 20 randomly chosen human genomes (10 male and 10 female). In all cases, we found the same 3mer HOR copy numbers as in the case of the reference human genome, with no mutation. On the other hand, some point mutations with respect to reference genome are found for some NBPF monomers which are not tandemly organized in canonical HORs.


Asunto(s)
Neuroblastoma , Pan troglodytes , Animales , Humanos , Femenino , Masculino , Pan troglodytes/genética , Macaca mulatta/genética , Evolución Molecular , Genoma Humano/genética , Encéfalo
7.
Genes (Basel) ; 13(11)2022 10 23.
Artículo en Inglés | MEDLINE | ID: mdl-36360166

RESUMEN

In this article, we show that mono/oligonucleotide quadruplets, as basic structures of DNA, along with our classification of trinucleotides, disclose an organization of genomes based on purine-pyrimidine symmetry. Moreover, the structure and stability of DNA are influenced by the Watson-Crick pairing and the natural law of DNA creation and conservation, according to which the same mono- or oligonucleotide insertion must be inserted simultaneously into both strands of DNA. Taken together, they lead to quadruplets with central mirror symmetry and bidirectional DNA strand orientation and are incorporated into Chargaff's second parity rule (CSPR). Performing our quadruplet frequency analysis of all human chromosomes and of Neuroblastoma BreakPoint Family (NBPF) genes, which code Olduvai protein domains in the human genome, we show that the coding part of DNA violates CSPR. This may shed new light and give rise to a novel hypothesis on DNA creation and its evolution. In this framework, the logarithmic relationship between oligonucleotide order and minimal DNA sequence length, to establish the validity of CSPR, automatically follows from the quadruplet structure of the genomic sequence. The problem of the violation of CSPR in rare symbionts is discussed.


Asunto(s)
ADN , Genoma Humano , Composición de Base , ADN/genética , ADN/química , Oligonucleótidos
8.
Biosystems ; 218: 104695, 2022 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-35580818

RESUMEN

The fundamental role of symmetry in the genetic code is to decrease disorder between codons and to preserve the integrity of system. The Standard Genetic Code (SGC) table is structured alphabetically in a horizontal and vertical array of U-C-A-G bases only with aesthetic symmetry. We postulate "the symmetry theory of genetic code" which is based on the unique physicochemical purine - pyrimidine symmetry net between codons of our Supersymmetry genetic code (SSyGC) table. The common purine - pyrimidine symmetry net as "the golden rule" and a core of the SSyGC table is universal, remaining unchanged during all of evolution. It is identical for more than 30 known genetic codes including those that will be discovered in the future, as well as for all RNA and DNA species. The unique SSyGC table has five physicochemical symmetries between bases, codons, and amino acids: 1) purine - pyrimidine symmetry on the principle of the Watson - Crick pairing (A↔U, C↔G), 2) direct - complement symmetry between codons, 3) mirror symmetry between bases and codons, 4) A + T rich and C + G rich symmetry between codons, and 5) symmetry between position of amino acids. Opposite to the SGC table where the third base is inactive, in the SSyGC table the role of the third base in codons is dominant in creation of symmetries. There are also present for the first time the symmetric positions of all boxes with amino acids. Opposite of the SGC table, the SSyGC code table contains three sextets for Serine, Arginine, and Leucine, each with six codons, positioned in continuity. Multi - facet symmetries of the SSyGC table as a natural law exclude the individual random creation of amino acids even in primitive life form. Accordingly, we hypothesize that the contemporary life arose due to common activity of all natural amino acids. With discovery of the unique physicochemical Supersymmetry genetic code table, the new light is shed on the symmetry of the genetic code.


Asunto(s)
Evolución Molecular , Código Genético , Aminoácidos/genética , Codón/genética , Código Genético/genética , Modelos Genéticos , Purinas , Pirimidinas
9.
Prog Mol Subcell Biol ; 60: 203-234, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34386877

RESUMEN

Here we present three interesting novel human Higher-Order Repeats (HORs) discovered using the HOR-searching method with GRM algorithm: (a) The novel Neuroblastoma Breakpoint Family gene (NBPF) 3mer HOR, discovered applying GRM algorithm to human chromosome 1 (Paar et al., Mol Biol Evol 28:1877-1892, 2011). NBPF 3mer HOR is based on previously known ~1.6 kb NBPF primary repeat monomers (known as DUF1220 domain) in human chromosome 1, but the NBPF HOR was not known before its discovery by using GRM. It should be stressed that the NBPF HOR presents a unique human-specific pattern, distinguishing human from nonhuman primates. (b) The novel quartic HOR (2mer⊃2mer⊃9mer) discovered using the GRM algorithm for analysis of hornerin genes in human chromosome 1 (Paar et al., Mol Biol Evol 28:1877-1892, 2011). This quartic HOR is based on 39 bp hornerin primary repeat monomer in human chromosome 1. To our knowledge, this is the first known case of quartic HOR, with four levels of hierarchy of HOR organization. (c) The novel 33mer alpha satellite HOR in human chromosome 21, discovered using the GRM algorithm (Gluncic et al., Sci Rep 9:12629, 2019). This 33mer HOR in the smallest human chromosome is the largest alpha satellite HOR copy among all 22 somatic human chromosomes. Moreover, the same 33mer HOR is present in the hg38 human genome assembly of four human chromosomes: 21, 22, 13, and 14. We point out that the DUF1220 encoding genomic structures in NBPF genes in human chromosome 1, recently studied and related to the brain evolution and pathologies and cognitive aptitude, can be considered in the framework of the general concept of HORs, already extensively studied in genomics, especially in the centromeric region.


Asunto(s)
Cromosomas Humanos Par 21 , Neuroblastoma , Animales , Centrómero , ADN Satélite , Exones , Genoma Humano/genética , Humanos , Neuroblastoma/genética
10.
J Theor Biol ; 524: 110748, 2021 09 07.
Artículo en Inglés | MEDLINE | ID: mdl-33933479

RESUMEN

Ever since Nirenberg's discovery in 1961 in which codons code individual amino acids, numerous scientists searched for symmetries within the genetic code. The standard genetic code (SGC) table is an alphabetic artificial construct based on the U-C-A-G ordering of nucleotides without natural symmetries. Up to the present, complete symmetry in the genetic code has not been found, leaving doubt as to whether the symmetrical nature as the protector of order even exists. Our novel Ideal Symmetry Genetic Code (ISyGC) table reflects a unique fundamental physicochemical purine-pyrimidine symmetry net for all more than thirty known variations of nuclear and mitochondrial genetic codes. The nuclear genetic code for RNA and DNA viruses also contains the same purine-pyrimidine symmetry net. We show that the ISyGC table leads to automatic transformation into a DNA sequence akin to the 5'3 codon and 3'5 anticodon patterns. As a result of purine-pyrimidine symmetries between codons in the ISyGC table, algorithms of the first two bases as well of the third base of codons show how tRNA cognate anticodons can recognize synonymous codons during mRNA decoding. We show that the ISyGC purine-pyrimidine net with its physicochemical properties represents an evolutionary common "frozen accident" at the onset of each genetic code creation and RNA to DNA evolution. As such, during all of evolution the unique fundamental purine-pyrimidine symmetry net of all genetic codes remains unchangeable. In this way, evolution is a road paved with symmetries.


Asunto(s)
Evolución Molecular , ARN , Anticodón , ADN , Código Genético , Purinas , Pirimidinas , ARN/genética , ARN de Transferencia
11.
J Theor Biol ; 483: 109985, 2019 12 21.
Artículo en Inglés | MEDLINE | ID: mdl-31469987

RESUMEN

After explanation of the Chargaff´s first parity rule in terms of the Watson-Crick base-pairing between the two DNA strands, the Chargaff´s second parity rule for each strand of DNA (also named strand symmetry), which cannot be explained by Watson-Crick base-pairing only, is still a challenging issue already fifty years. We show that during evolution DNA preserves its identity in the form of quadruplet A+T and C+G rich matrices based on purine-pyrimidine mirror symmetries of trinucleotides. Identical symmetries are present in our classification of trinucleotides and the genetic code table. All eukaryotes and almost all prokaryotes (bacteria and archaea) have quadruplet mirror symmetries in structural form and frequencies following the principle of Chargaff's second parity rule and Natural symmetry law of DNA creation and conservation. Some rare symbionts have mirror symmetry only in their structural form within each DNA strand. Based on our matrix analysis of closely related species, humans and Neanderthals, we find that the circular cycle of inverse proportionality between trinucleotides preserves identical relative frequencies of trinucleotides in each quadruplet and in the whole genome. According to our calculations, a change in frequencies in quadruplet matrices could lead to the creation of new species. Violation of quadruplet symmetries is practically inconsistent with life. DNA symmetries provide a key for understanding the restriction of disorder (entropy) due to mutations in the evolution of DNA.


Asunto(s)
ADN/genética , Evolución Molecular , Secuencia Rica en At , Cromosomas Humanos/genética , Secuencia Conservada , Eucariontes/metabolismo , Humanos , Nucleótidos/genética , Células Procariotas/metabolismo , Simbiosis/genética
12.
Genome Biol Evol ; 9(10): 2668-2680, 2017 10 01.
Artículo en Inglés | MEDLINE | ID: mdl-27492235

RESUMEN

Higher order repeats (HORs) containing tandems of primary and secondary repeat units (head-to-tail "tandem within tandem pattern"), referred to as regular HORs, are typical for primate alpha satellite DNAs and most pronounced in human genome. Regular HORs are known to be a result of recent evolutionary processes. In non-primate genomes mostly so called complex HORs have been found, without head to tail tandem of primary repeat units. In beetle Tribolium castaneum, considered as a model case for genome studies, large tandem repeats have been identified, but no HORs have been reported. Here, using our novel robust repeat finding algorithm Global Repeat Map, we discover two regular and six complex HORs in T. castaneum. In organizational pattern, the integrity and homogeneity of regular HORs in T. castaneum resemble human regular HORs (with T. castaneum monomers different from human alpha satellite monomers), involving a wider range of monomer lengths than in human HORs. Similar regular higher order repeat structures have previously not been found in insects. Some of these novel HORs in T. castaneum appear as most regular among known HORs in non-primate genomes, although with substantial riddling. This is intriguing, in particular from the point of view of role of non-coding repeats in modulation of gene expression.


Asunto(s)
Genoma de los Insectos , Secuencias Repetitivas de Ácidos Nucleicos , Tribolium/genética , Animales , Secuencia Conservada , Evolución Molecular
13.
J Biomol Struct Dyn ; 34(7): 1383-94, 2016 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-26524490

RESUMEN

For almost 50 years the conclusive explanation of Chargaff's second parity rule (CSPR), the equality of frequencies of nucleotides A=T and C=G or the equality of direct and reverse complement trinucleotides in the same DNA strand, has not been determined yet. Here, we relate CSPR to the interstrand mirror symmetry in 20 symbolic quadruplets of trinucleotides (direct, reverse complement, complement, and reverse) mapped to double-stranded genome. The symmetries of Q-box corresponding to quadruplets can be obtained as a consequence of Watson-Crick base pairing and CSPR together. Alternatively, assuming Natural symmetry law for DNA creation that each trinucleotide in one strand of DNA must simultaneously appear also in the opposite strand automatically leads to Q-box direct-reverse mirror symmetry which in conjunction with Watson-Crick base pairing generates CSPR. We demonstrate quadruplet's symmetries in chromosomes of wide range of organisms, from Escherichia coli to Neanderthal and human genomes, introducing novel quadruplet-frequency histograms and 3D-diagrams with combined interstrand frequencies. These "landscapes" are mutually similar in all mammals, including extinct Neanderthals, and somewhat different in most of older species. In human chromosomes 1-12, and X, Y the "landscapes" are almost identical and slightly different in the remaining smaller and telocentric chromosomes. Quadruplet frequencies could provide a new robust tool for characterization and classification of genomes and their evolutionary trajectories.


Asunto(s)
ADN/química , ADN/genética , G-Cuádruplex , Modelos Genéticos , Repeticiones de Trinucleótidos , Animales , Composición de Base , Mapeo Cromosómico , Codón , Evolución Molecular , Genoma , Genómica , Humanos , Nucleótidos
14.
Gene ; 543(1): 45-52, 2014 Jun 10.
Artículo en Inglés | MEDLINE | ID: mdl-24709107

RESUMEN

The standard classification scheme of the genetic code is organized for alphabetic ordering of nucleotides. Here we introduce the new, "ideal" classification scheme in compact form, for the first time generated by codon sextets encoding Ser, Arg and Leu amino acids. The new scheme creates the known purine/pyrimidine, codon-anticodon, and amino/keto type symmetries and a novel A+U rich/C+G rich symmetry. This scheme is built from "leading" and "nonleading" groups of 32 codons each. In the ensuing 4 × 16 scheme, based on trinucleotide quadruplets, Ser has a central role as initial generator. Six codons encoding Ser and six encoding Arg extend continuously along a linear array in the "leading" group, and together with four of six Leu codons uniquely define construction of the "leading" group. The remaining two Leu codons enable construction of the "nonleading" group. The "ideal" genetic code suggests the evolution of genetic code with serine as an initiator.


Asunto(s)
Aminoácidos/genética , Codón/química , Codón/clasificación , Código Genético/genética , Conformación de Ácido Nucleico , Secuencia de Aminoácidos/fisiología , Secuencia de Bases/fisiología , Codón/genética , Evolución Molecular , Modelos Genéticos , Serina/genética
15.
Gene ; 531(2): 184-90, 2013 Dec 01.
Artículo en Inglés | MEDLINE | ID: mdl-24042127

RESUMEN

The origin and logic of genetic code are two of greatest mysteries of life sciences. Analyzing DNA sequences we showed that the start/stop trinucleotides have broader importance than just marking start and stop of exons in coding DNA. On this basis, here we introduced new classification of trinucleotides and showed that all A+T rich trinucleotides consisting of three different nucleotides arise from start-ATG, stop-TGA and stop-TAG using their complement, reverse complement and reverse transformations. Due to the same transformations during generations of crossing-over they can switch from one form to the other. By direct process the start-ATG and stop-TAG can irreversibly transform into stop-TAA. By transformation into A+T rich trinucleotides and 16/32 C+G rich they can lose the start/stop function and take the role of a sense codon in reversible way. The remaining 16 C+G trinucleotides cannot directly transform into start/stop trinucleotides and thus remain a firm skeleton for structuring the C+G rich DNA. We showed that start/stops strongly enrich the A+T rich noncoding DNA through frequently extended forms. From the evolutionary viewpoint the start/stops are chief creators of prevailing A+T rich noncoding DNA, and of more stable coding DNA. We propose that start/stops have basic role as "seeds" in trinucleotide evolution of noncoding and coding sequences and lead to asymmetry between A+T and C+G rich DNA. By dynamical transformations during evolution they enabled pronounced phylogenetic broadness, keeping the regulator function.


Asunto(s)
Codón Iniciador/genética , Codón de Terminación/genética , Codón/clasificación , Código Genético/fisiología , Secuencias Reguladoras de Ácidos Nucleicos/fisiología , Secuencia de Bases/fisiología , Codón/genética , ADN/genética , Evolución Molecular , Fósiles , Humanos , Datos de Secuencia Molecular , Saccharomyces cerevisiae/genética
16.
J Theor Biol ; 317: 301-9, 2013 Jan 21.
Artículo en Inglés | MEDLINE | ID: mdl-23026763

RESUMEN

The centromeres remain "the final frontier" in unexplored segments of genome landscape in primate genomes, characterized by 2-5 Mb arrays of evolutionary rapidly evolving alpha satellite (AS) higher order repeats (HORs). Alpha satellites as specific noncoding sequences may be also significant in light of regulatory role of noncoding sequences. Using the Global Repeat Map (GRM) algorithm we identify in NCBI assemblies of chromosome 5 the species-specific alpha satellite HORs: 13mer in human, 5mer in chimpanzee, 14mer in orangutan and 3mers in macaque. The suprachromosomal family (SF) classification of alpha satellite HORs and surrounding monomeric alpha satellites is performed and specific segmental structure was found for major alpha satellite arrays in chromosome 5 of primates. In the framework of our novel concept of start/stop Codon Like Trinucleotides (CLTs) as a "new DNA language in noncoding sequences", we find characteristics and differences of these species in CLT extensions, in particular the extensions of stop-TGA CLT. We hypothesize that these are regulators in noncoding sequences, acting at a distance, and that they can amplify or weaken the activity of start/stop codons in coding sequences in protein genesis, increasing the richness of regulatory phenomena.


Asunto(s)
Codón Iniciador/genética , Codón de Terminación/genética , ADN Satélite/genética , Primates/genética , Expansión de Repetición de Trinucleótido/genética , Animales , Secuencia de Bases , Cromosomas de los Mamíferos/genética , Secuencia de Consenso/genética , Genoma/genética , Humanos , Datos de Secuencia Molecular , Nucleosomas/genética , Nucleótidos/genética
17.
Mol Biol Evol ; 28(6): 1877-92, 2011 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-21273634

RESUMEN

Much attention has been devoted to identifying genomic patterns underlying the evolution of the human brain and its emergent advanced cognitive capabilities, which lie at the heart of differences distinguishing humans from chimpanzees, our closest living relatives. Here, we identify two particular intragene repeat structures of noncoding human DNA, spanning as much as a hundred kilobases, that are present in human genome but are absent from the chimpanzee genome and other nonhuman primates. Using our novel computational method Global Repeat Map, we examine tandem repeat structure in human and chimpanzee chromosome 1. In human chromosome 1, we find three higher order repeats (HORs), two of them novel, not reported previously, whereas in chimpanzee chromosome 1, we find only one HOR, a 2mer alphoid HOR instead of human alphoid 11mer HOR. In human chromosome 1, we identify an HOR based on 39-bp primary repeat unit, with secondary, tertiary, and quartic repeat units, fully embedded in human hornerin gene, related to regenerating and psoriatric skin. Such an HOR is not found in chimpanzee chromosome 1. We find a remarkable human 3mer HOR organization based on the ~1.6-kb primary repeat unit, fully embedded within the neuroblastoma breakpoint family genes, which is related to the function of the human brain. Such HORs are not present in chimpanzees. In general, we find that human-chimpanzee differences are much larger for tandem repeats, in particularly for HORs, than for gene sequences. This may be of great significance in light of recent studies that are beginning to reveal the large-scale regulatory architecture of the human genome, in particular the role of noncoding sequences. We hypothesize about the possible importance of human accelerated HOR patterns as components in the gene expression multilayered regulatory network.


Asunto(s)
Proteínas de Neoplasias/genética , Pan troglodytes/genética , Secuencias Repetidas en Tándem/genética , Elementos Alu/genética , Animales , Secuencia de Bases , Proteínas de Unión al Calcio/genética , Proteínas Portadoras/genética , Mapeo Cromosómico , Cromosomas Humanos Par 1/genética , Evolución Molecular , Orden Génico , Redes Reguladoras de Genes/genética , Genoma Humano/genética , Humanos , Proteínas de Filamentos Intermediarios/genética , Repeticiones de Microsatélite/genética , Datos de Secuencia Molecular , Alineación de Secuencia , Especificidad de la Especie
18.
J Mol Evol ; 72(1): 34-55, 2011 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-21103868

RESUMEN

Comparison of human and chimpanzee genomes has received much attention, because of paramount role for understanding evolutionary step distinguishing us from our closest living relative. In order to contribute to insight into Y chromosome evolutionary history, we study and compare tandems, higher order repeats (HORs), and regularly dispersed repeats in human and chimpanzee Y chromosome contigs, using robust Global Repeat Map algorithm. We find a new type of long-range acceleration, human-accelerated HOR regions. In peripheral domains of 35mer human alphoid HORs, we find riddled features with ten additional repeat monomers. In chimpanzee, we identify 30mer alphoid HOR. We construct alphoid HOR schemes showing significant human-chimpanzee difference, revealing rapid evolution after human-chimpanzee separation. We identify and analyze over 20 large repeat units, most of them reported here for the first time as: chimpanzee and human ~1.6 kb 3mer secondary repeat unit (SRU) and ~23.5 kb tertiary repeat unit (~0.55 kb primary repeat unit, PRU); human 10848, 15775, 20309, 60910, and 72140 bp PRUs; human 3mer SRU (~2.4 kb PRU); 715mer and 1123mer SRUs (5mer PRU); chimpanzee 5096, 10762, 10853, 60523 bp PRUs; and chimpanzee 64624 bp SRU (10853 bp PRU). We show that substantial human-chimpanzee differences are concentrated in large repeat structures, at the level of as much as ~70% divergence, sizably exceeding previous numerical estimates for some selected noncoding sequences. Smeared over the whole sequenced assembly (25 Mb) this gives ~14% human-chimpanzee divergence. This is significantly higher estimate of divergence between human and chimpanzee than previous estimates.


Asunto(s)
Evolución Molecular , Variación Genética , Genoma Humano , Pan troglodytes/genética , Secuencias Repetitivas de Ácidos Nucleicos , Secuencias Repetidas en Tándem , Cromosoma Y/genética , Algoritmos , Animales , Secuencia de Bases , Mapeo Cromosómico , ADN Satélite , Humanos , Análisis de Secuencia de ADN , Homología de Secuencia de Ácido Nucleico
19.
BMC Bioinformatics ; 9: 466, 2008 Nov 03.
Artículo en Inglés | MEDLINE | ID: mdl-18980673

RESUMEN

BACKGROUND: Identification of approximate tandem repeats is an important task of broad significance and still remains a challenging problem of computational genomics. Often there is no single best approach to periodicity detection and a combination of different methods may improve the prediction accuracy. Discrete Fourier transform (DFT) has been extensively used to study primary periodicities in DNA sequences. Here we investigate the application of DFT method to identify and study alphoid higher order repeats. RESULTS: We used method based on DFT with mapping of symbolic into numerical sequence to identify and study alphoid higher order repeats (HOR). For HORs the power spectrum shows equidistant frequency pattern, with characteristic two-level hierarchical organization as signature of HOR. Our case study was the 16 mer HOR tandem in AC017075.8 from human chromosome 7. Very long array of equidistant peaks at multiple frequencies (more than a thousand higher harmonics) is based on fundamental frequency of 16 mer HOR. Pronounced subset of equidistant peaks is based on multiples of the fundamental HOR frequency (multiplication factor n for nmer) and higher harmonics. In general, nmer HOR-pattern contains equidistant secondary periodicity peaks, having a pronounced subset of equidistant primary periodicity peaks. This hierarchical pattern as signature for HOR detection is robust with respect to monomer insertions and deletions, random sequence insertions etc. For a monomeric alphoid sequence only primary periodicity peaks are present. The 1/fbeta- noise and periodicity three pattern are missing from power spectra in alphoid regions, in accordance with expectations. CONCLUSION: DFT provides a robust detection method for higher order periodicity. Easily recognizable HOR power spectrum is characterized by hierarchical two-level equidistant pattern: higher harmonics of the fundamental HOR-frequency (secondary periodicity) and a subset of pronounced peaks corresponding to constituent monomers (primary periodicity). The number of lower frequency peaks (secondary periodicity) below the frequency of the first primary periodicity peak reveals the size of nmer HOR, i.e., the number n of monomers contained in consensus HOR.


Asunto(s)
Biología Computacional/métodos , Secuencias Repetidas en Tándem , Algoritmos , Animales , Secuencia de Bases , ADN/genética , Bases de Datos de Ácidos Nucleicos , Análisis de Fourier , Humanos , Conformación de Ácido Nucleico , Análisis de Secuencia de ADN
20.
J Theor Biol ; 254(3): 555-60, 2008 Oct 07.
Artículo en Inglés | MEDLINE | ID: mdl-18625244

RESUMEN

Understanding the folding of centromere DNA in the maximally condensed methaphase chromosome remains a basic challenge in cell biology. We propose here a set of structural models with a graphical presentation of alphoid higher order repeat (HOR) distribution in the centromere folding, based on the assumption of encryption key for microtubule-centromere interaction which arises from chromosome-specific crystal-like structure of HORs. Specific HOR leads to a characteristic geometrical pattern which may be responsible for individual microtubule to recognize a specific structure of centromere in each chromosome.


Asunto(s)
Centrómero/genética , ADN/genética , Secuencias Repetitivas de Ácidos Nucleicos/genética , Algoritmos , Secuencia de Bases , Biología Computacional , Simulación por Computador , Humanos , Microtúbulos/genética , Modelos Moleculares , Datos de Secuencia Molecular , Alineación de Secuencia
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...