Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 57
Filter
1.
NAR Genom Bioinform ; 6(3): lqae074, 2024 Sep.
Article in English | MEDLINE | ID: mdl-38962254

ABSTRACT

Thousands of prolonged sequences of human ultra-conserved non-coding elements (UCNEs) share only one common feature: peculiarities in the unique composition of their dinucleotides. Here we investigate whether the numerous weak signals emanating from these dinucleotide arrangements can be used for computational identification of UCNEs within the human genome. For this purpose, we analyzed 4272 UCNE sequences, encompassing 1 393 448 nucleotides, alongside equally sized control samples of randomly selected human genomic sequences. Our research identified nine different features of dinucleotide arrangements that enable differentiation of UCNEs from the rest of the genome. We employed these nine features, implementing three Machine Learning techniques - Support Vector Machine, Random Forest, and Artificial Neural Networks - to classify UCNEs, achieving an accuracy rate of 82-84%, with specific conditions allowing for over 90% accuracy. Notably, the strongest feature for UCNE identification was the frequency ratio between GpC dinucleotides and the sum of GpG and CpC dinucleotides. Additionally, we investigated the entire pool of 31 046 SNPs located within UCNEs for their representation in the ClinVar database, which catalogs human SNPs with known phenotypic effects. The presence of UCNE-associated SNPs in ClinVar aligns with the expectation of a random distribution, emphasizing the enigmatic nature of UCNE phenotypic manifestation.

2.
Nat Commun ; 15(1): 2739, 2024 Mar 28.
Article in English | MEDLINE | ID: mdl-38548765

ABSTRACT

Non-volatile phase-change memory devices utilize local heating to toggle between crystalline and amorphous states with distinct electrical properties. Expanding on this kind of switching to two topologically distinct phases requires controlled non-volatile switching between two crystalline phases with distinct symmetries. Here, we report the observation of reversible and non-volatile switching between two stable and closely related crystal structures, with remarkably distinct electronic structures, in the near-room-temperature van der Waals ferromagnet Fe5-δGeTe2. We show that the switching is enabled by the ordering and disordering of Fe site vacancies that results in distinct crystalline symmetries of the two phases, which can be controlled by a thermal annealing and quenching method. The two phases are distinguished by the presence of topological nodal lines due to the preserved global inversion symmetry in the site-disordered phase, flat bands resulting from quantum destructive interference on a bipartite lattice, and broken inversion symmetry in the site-ordered phase.

3.
Biology (Basel) ; 12(8)2023 Aug 12.
Article in English | MEDLINE | ID: mdl-37627009

ABSTRACT

Long human ultra-conserved non-coding elements (UCNEs) do not have any sequence similarity to each other or other characteristics that make them unalterable during vertebrate evolution. We hypothesized that UCNEs have unique dinucleotide (DN) composition and arrangements compared to the rest of the genome. A total of 4272 human UCNE sequences were analyzed computationally and compared with the whole genomes of human, chicken, zebrafish, and fly. Statistical analysis was performed to assess the non-randomness in DN spacing arrangements within the entire human genome and within UCNEs. Significant non-randomness in DN spacing arrangements was observed in the entire human genome. Additionally, UCNEs exhibited distinct patterns in DN arrangements compared to the rest of the genome. Approximately 83% of all DN pairs within UCNEs showed significant (>10%) non-random genomic arrangements at short distances (2-6 nucleotides) relative to each other. At the extremes, non-randomness in DN spacing distances deviated up to 40% from expected values and were frequently associated with GpC, CpG, ApT, and GpG/CpC dinucleotides. The described peculiarities in DN arrangements have persisted for hundreds of millions of years in vertebrates. These distinctive patterns may suggest that UCNEs have specific DNA conformations.

4.
Nano Lett ; 23(1): 380-388, 2023 Jan 11.
Article in English | MEDLINE | ID: mdl-36382909

ABSTRACT

Glide-mirror symmetry in nonsymmorphic crystals can foster the emergence of novel hourglass nodal loop states. Here, we present spectroscopic signatures from angle-resolved photoemission of a predicted topological hourglass semimetal phase in Nb3SiTe6. Linear band crossings are observed at the zone boundary of Nb3SiTe6, which could be the origin of the nontrivial Berry phase and are consistent with a predicted glide quantum spin Hall effect; such linear band crossings connect to form a nodal loop. Furthermore, the saddle-like Fermi surface of Nb3SiTe6 observed in our results helps unveil linear band crossings that could be missed. In situ alkali-metal doping of Nb3SiTe6 also facilitated the observation of other band crossings and parabolic bands at the zone center correlated with accidental nodal loop states. Overall, our results complete the system's band structure, help explain prior Hall measurements, and suggest the existence of a nodal loop at the zone center of Nb3SiTe6.

5.
Genes (Basel) ; 13(11)2022 11 07.
Article in English | MEDLINE | ID: mdl-36360290

ABSTRACT

The public UCNEbase database, comprising 4273 human ultra-conserved noncoding elements (UCNEs), was thoroughly investigated with the aim to find any nucleotide signals or motifs that have made these DNA sequences practically unchanged over three hundred million years of evolution. Each UCNE comprises over 200 nucleotides and has at least 95% identity between humans and chickens. A total of 31,046 SNPs were found within the UCNE database. We demonstrated that every human has over 300 mutations within 4273 UCNEs. No association of UCNEs with non-coding RNAs, nor preference of a particular meiotic recombination rate within them were found. No sequence motifs associated with UCNEs nor their flanking regions have been found. However, we demonstrated that UCNEs have strong nucleotide and dinucleotide sequence abnormalities compared to genome averages. Specifically, UCNEs are depleted for CC and GG dinucleotides, while GC dinucleotides are in excess of 28%. Importantly, GC dinucleotides have extraordinarily strong stacking free-energy inside the DNA helix and unique resistance to dissociation. Based on the adjacent nucleotide stacking abnormalities within UCNEs, we conjecture that peculiarities in dinucleotide distribution within UCNEs may create unique 3D conformation and specificity to bind proteins. We also discuss the strange dynamics of multiple SNPs inside UCNEs and reasons why these sequences are extraordinarily conserved.


Subject(s)
Chickens , Nucleotides , Humans , Animals , Nucleotides/genetics , Base Sequence , Genome , DNA/genetics
6.
Genes (Basel) ; 13(8)2022 08 18.
Article in English | MEDLINE | ID: mdl-36011383

ABSTRACT

Common alleles tend to be more ancient than rare alleles. These common SNPs appeared thousands of years ago and reflect intricate human evolution including various adaptations, admixtures, and migration events. Eighty-four thousand abundant region-specific alleles (ARSAs) that are common in one continent but absent in the rest of the world have been characterized by processing 3100 genomes from 230 populations. Also computed were 17,446 polymorphic sites with regional absence of common alleles (RACAs), which are widespread globally but absent in one region. A majority of these region-specific SNPs were found in Africa. America has the second greatest number of ARSAs (3348) and is even ahead of Europe (1911). Surprisingly, East Asia has the highest number of RACAs (10,524) and the lowest number of ARSAs (362). ARSAs and RACAs have distinct compositions of ancestral versus derived alleles in different geographical regions, reflecting their unique evolution. Genes associated with ARSA and RACA SNPs were identified and their functions were analyzed. The core 100 genes shared by multiple populations and associated with region-specific natural selection were examined. The largest part of them (42%) are related to the nervous system. ARSA and RACA SNPs are important for both association and human evolution studies.


Subject(s)
Genomics , Polymorphism, Single Nucleotide , Africa , Alleles , Humans , Polymorphism, Single Nucleotide/genetics , Selection, Genetic
7.
Sci Adv ; 7(16)2021 Apr.
Article in English | MEDLINE | ID: mdl-33853778

ABSTRACT

Controlling electronic properties via band structure engineering is at the heart of modern semiconductor devices. Here, we extend this concept to semimetals where, using LuSb as a model system, we show that quantum confinement lifts carrier compensation and differentially affects the mobility of the electron and hole-like carriers resulting in a strong modification in its large, nonsaturating magnetoresistance behavior. Bonding mismatch at the heteroepitaxial interface of a semimetal (LuSb) and a semiconductor (GaSb) leads to the emergence of a two-dimensional, interfacial hole gas. This is accompanied by a charge transfer across the interface that provides another avenue to modify the electronic structure and magnetotransport properties in the ultrathin limit. Our work lays out a general strategy of using confined thin-film geometries and heteroepitaxial interfaces to engineer electronic structure in semimetallic systems, which allows control over their magnetoresistance behavior and simultaneously provides insights into its origin.

8.
Biology (Basel) ; 9(11)2020 Nov 10.
Article in English | MEDLINE | ID: mdl-33182715

ABSTRACT

We performed an exhaustive pairwise comparison of whole-genome sequences of 3120 individuals, representing 232 populations from all continents and seven prehistoric people including archaic and modern humans. In order to reveal an intricate picture of worldwide human genetic relatedness, 65 million very rare single nucleotide polymorphic (SNP) alleles have been bioinformatically processed. The number and size of shared identical-by-descent (IBD) genomic fragments for every pair of 3127 individuals have been revealed. Over 17 million shared IBD fragments have been described. Our approach allowed detection of very short IBD fragments (<20 kb) that trace common ancestors who lived up to 200,000 years ago. We detected nine distinct geographical regions within which individuals had strong genetic relatedness, but with negligible relatedness between the populations of these regions. The regions, comprising nine unique genetic components for mankind, are the following: East and West Africa, Northern Europe, Arctica, East Asia, Oceania, South Asia, Middle East, and South America. The level of admixture in every studied population has been apportioned among these nine genetic components. Genetically, long-term neighboring populations are strikingly similar to each other in spite of any political, religious, and cultural differences. The topmost admixture has been observed at the center of Eurasia. These admixed populations (including Uyghurs, Azerbaijanis, Uzbeks, and Iranians) have roughly equal genetic contributions from the Middle East, Europe, China, and India, with additional significant traces from Africa and Arctic. The entire picture of relatedness of all the studied populations unfolds and presents itself in the form of shared number/size of IBDs.

9.
PLoS One ; 15(4): e0232167, 2020.
Article in English | MEDLINE | ID: mdl-32353016

ABSTRACT

We examined seventy million well-characterized human mutations, and their impact on G+C-compositional dynamics, in order to understand the formation and maintenance of major genomic nucleotide sequence patterns. Among novel mutations, those that change a strong (S) base pair G:C/C:G to a weak (W) pair A:T/T:A occur at nearly twice the frequency of the opposite mutations. Such imbalance puts strong downward pressure on overall GC-content. However, along protracted paths to fixation, S→W mutations are much less likely to propagate than W→S mutations. The magnitude of relative propagation disadvantages for S→W mutations is inexplicable by any currently-accepted model. This fact forced us to re-examine the quantitative features of Biased Gene Conversion (BGC) theory. Revised parameters of BGC that, per average individual, convert 7-14 W base pairs into S pairs, would account for the S-content turnover differences between new and old mutations, and make BGC an instrumental force for nucleotide dynamics and evolution. BGC should thus be considered seriously in both theories and biomedical practice. In particular, BGC should be taken into account during allele imputations, where missing SNP alleles are computationally predicted based on the information about several neighboring alleles. Finally, we analyzed the effect of neighboring nucleotide context on the mutation frequencies, dynamics, and GC-composition turnover. For this purpose, we examined genomic regions having extremely biased nucleotide compositions (enriched for S-, W-, purine/pyrimidine strand asymmetry, or AC/GT-strand asymmetry). It was found that point mutations in these regions preferentially degrade the nucleotide inhomogeneities, decreasing the sequence biases. Degradation of sequence bias is highest for novel mutations, and considerably lower for older mutations (those widespread across populations). Besides BGC, there may be additional, still uncharacterized molecular mechanisms that either preserve genomic regions with biased nucleotide compositions from mutational degradation or fail to degrade such inhomogeneities in specific chromosomal regions.


Subject(s)
Base Composition/genetics , Gene Conversion/genetics , Genome, Human/genetics , Alleles , Evolution, Molecular , Humans , Mutation Rate , Point Mutation/genetics , Polymorphism, Single Nucleotide/genetics
10.
PLoS One ; 15(2): e0228778, 2020.
Article in English | MEDLINE | ID: mdl-32023328

ABSTRACT

Natural selection of beneficial genetic variants played a critical role in human adaptation to a wide range of environmental conditions. Northern Eurasia, despite its severe climate, is home to lots of ethnically diverse populations. The genetic variants associated with the survival of these populations have hardly been analyzed. We searched for the genomic signatures of positive selection in (1) the genome-wide microarray data of 432 people from eight different northern Russian populations and (2) the whole-genome sequences of 250 people from Northern Eurasia from a public repository through testing the extended haplotype homozigosity (EHH) and direct comparison of allele frequency, respectively. The 20 loci with the strongest selection signals were characterized in detail. Among the top EHH hits were the NRG3 and NBEA genes, which are involved in the development and functioning of the neural system, the PTPRM gene, which mediates cell-cell interactions and adhesion, and a region on chromosome 4 (chr4:28.7-28.9 Mb) that contained several loci affiliated with different classes of non-coding RNAs (RN7SL101P, MIR4275, MESTP3, and LINC02364). NBEA and the region on chromosome 4 were novel selection targets that were identified for the first time in Western Siberian populations. Cross-population comparisons of EHH profiles suggested a particular role for the chr4:28.7-28.9 Mb region in the local adaptation of Western Siberians. The strongest selection signal identified in Siberian sequenced genomes was formed by six SNPs on chromosome 11 (chr11:124.9-125.2 Mb). This region included well-known genes SLC37A2 and PKNOX2. SLC37A2 is most-highly expressed in the gut. Its expression is regulated by vitamin D, which is often deficient in northern regions. The PKNOX2 gene is a transcription factor of the homeobox family that is expressed in the brain and many other tissues. This gene is associated with alcohol addiction, which is widespread in many Northern Eurasian populations.


Subject(s)
Evolution, Molecular , Genetics, Population , Genomics , Selection, Genetic , Gene Frequency , Genome, Human/genetics , Humans , Polymorphism, Single Nucleotide , Russia
11.
ACS Nano ; 13(5): 5611-5615, 2019 May 28.
Article in English | MEDLINE | ID: mdl-30987421

ABSTRACT

Chemical conversion by atomic substitution offers a powerful route toward the creation of unusual structures and functionalities. Here, we demonstrate the progressive transformation of single-layer TiTe2 into TiSe2 by reaction with a Se flux in vacuum. Angle-resolved photoemission spectroscopy and scanning tunneling microscopy reveal intriguing reaction patterns involving TiSe2 island ingrowth starting from the TiTe2 island edges, while the band structure and core level signatures of TiSe2 grow in intensity at the expense of those corresponding to TiTe2. Lattice mismatch between TiTe2 and TiSe2 results in misfit holes and lattice distortions over a distance behind a seamless fingerlike reaction front. The regions of TiSe2 and TiTe2 are distinguished by a height difference and a charge density wave (CDW) at different transition temperatures. The method of in situ chemical conversion offers opportunities for atomic-scale engineering of layered transition metal dichalcogenides that host useful properties arising from CDW, Dirac, Weyl, superconducting, spin-valley, and magnetic structures.

12.
BMC Genomics ; 19(1): 256, 2018 Apr 16.
Article in English | MEDLINE | ID: mdl-29661137

ABSTRACT

BACKGROUND: GC-Biased Gene Conversion (gBGC) is one of the important theories put forward to explain profound long-range non-randomness in nucleotide compositions along mammalian chromosomes. Nucleotide changes due to gBGC are hard to distinguish from regular mutations. Here, we present an algorithm for analysis of millions of known SNPs that detects a subset of so-called "SNP flip-over" events representing recent gBGC nucleotide changes, which occurred in previous generations via non-crossover meiotic recombination. RESULTS: This algorithm has been applied in a large-scale analysis of 1092 sequenced human genomes. Altogether, 56,328 regions on all autosomes have been examined, which revealed 223,955 putative gBGC cases leading to SNP flip-overs. We detected a strong bias (11.7% ± 0.2% excess) in AT- > GC over GC- > AT base pair changes within the entire set of putative gBGC cases. CONCLUSIONS: On average, a human gamete acquires 7 SNP flip-over events, in which one allele is replaced by its complementary allele during the process of meiotic non-crossover recombination. In each meiosis event, on average, gBGC results in replacement of 7 AT base pairs by GC base pairs, while only 6 GC pairs are replaced by AT pairs. Therefore, every human gamete is enriched by one GC pair. Happening over millions of years of evolution, this bias may be a noticeable force in changing the nucleotide composition landscape along chromosomes.


Subject(s)
Gene Conversion , Genome, Human , Algorithms , Base Composition , Chromosomes, Human , DNA/chemistry , Haplotypes , Humans , Polymorphism, Single Nucleotide
13.
BMC Genomics ; 18(1): 433, 2017 06 05.
Article in English | MEDLINE | ID: mdl-28583085

ABSTRACT

BACKGROUND: Inferring history from genomic sequences is challenging and problematic because chromosomes are mosaics of thousands of small Identicalby-descent (IBD) fragments, each of them having their own unique story. However, the main events in recent evolution might be deciphered from comparative analysis of numerous loci. A paradox of why humans, whose effective population size is only 104, have nearly three million frequent SNPs is formulated and examined. RESULTS: We studied 5398 loci evenly covering all human autosomes. Common haplotypes built from frequent SNPs that are present in people from various populations have been examined. We demonstrated highly non-random arrangement of alleles in common haplotypes. Abundance of mutually exclusive pairs of common haplotypes that have different alleles at every polymorphic position (so-called Yin/Yang haplotypes) was found in 56% of loci. A novel widely spread category of common haplotypes named Mosaic has been described. Mosaic consists of numerous pieces of Yin/Yang haplotypes and represents an ancestral stage of one of them. Scenarios of possible appearance of large number of frequent human SNPs and their habitual arrangement in Yin/Yang common haplotypes have been evaluated with an advanced genomic simulation algorithm. CONCLUSIONS: Computer modeling demonstrated that the observed arrangement of 2.9 million frequent SNPs could not originate from a sole stand-alone population. A "Great Admixture" event has been proposed that can explain peculiarities with frequent SNP distributions. This Great Admixture presumably occurred 100-300 thousand years ago between two ancestral populations that had been separated from each other about a million years ago. Our programs and algorithms can be applied to other species to perform evolutionary and comparative genomics.


Subject(s)
Genomics , Haplotypes , Polymorphism, Single Nucleotide , Alleles , Computer Simulation , Genetic Loci/genetics , Humans
14.
J Phys Condens Matter ; 28(44): 444002, 2016 11 09.
Article in English | MEDLINE | ID: mdl-27604538

ABSTRACT

In van der Waals heterostructures, the periodic potential from the Moiré superlattice can be used as a control knob to modulate the electronic structure of the constituent materials. Here we present a nanoscale angle-resolved photoemission spectroscopy (nano-ARPES) study of transferred graphene/h-BN heterostructures with two different stacking angles of 2.4° and 4.3° respectively. Our measurements reveal six replicas of graphene Dirac cones at the superlattice Brillouin zone (SBZ) centers. The size of the SBZ and its relative rotation angle to the graphene BZ are in good agreement with Moiré superlattice period extracted from atomic force microscopy (AFM) measurements. Comparison to the epitaxial graphene/h-BN with 0° stacking angles suggests that the interaction between graphene and h-BN decreases with increasing stacking angle.

15.
Genome Biol Evol ; 8(3): 777-90, 2016 Feb 23.
Article in English | MEDLINE | ID: mdl-26907499

ABSTRACT

A novel computational method for detecting identical-by-descent (IBD) chromosomal segments between sequenced genomes is presented. It utilizes the distribution patterns of very rare genetic variants (vrGVs), which have minor allele frequencies <0.2%. Contrary to the existing probabilistic approaches our method is rather deterministic, because it considers a group of very rare events which cannot happen together only by chance. This method has been applied for exhaustive computational search of shared IBD segments among 1,092 sequenced individuals from 14 populations. It demonstrated that clusters of vrGVs are unique and powerful markers of genetic relatedness, that uncover IBD chromosomal segments between and within populations, irrespective of whether divergence was recent or occurred hundreds-to-thousands of years ago. We found that several IBD segments are shared by practically any possible pair of individuals belonging to the same population. Moreover, shared short IBD segments (median size 183 kb) were found in 10% of inter-continental human pairs, each comprising of a person from sub-Saharan Africa and a person from Southern Europe. The shortest shared IBD segments (median size 54 kb) were found in 0.42% of inter-continental pairs composed of individuals from Chinese/Japanese populations and Africans from Kenya and Nigeria. Knowledge of inheritance of IBD segments is important in clinical case-control and cohort studies, since unknown distant familial relationships could compromise interpretation of collected data. Clusters of vrGVs should be useful markers for familial relationship and common multifactorial disorders.


Subject(s)
Gene Frequency/genetics , Genetic Variation , Genetics, Population , Genome, Human , Asian People/genetics , Black People/genetics , Chromosomes/genetics , High-Throughput Nucleotide Sequencing , Humans
16.
Nano Lett ; 15(3): 2031-6, 2015 Mar 11.
Article in English | MEDLINE | ID: mdl-25710329

ABSTRACT

The interaction between magnetic impurities and the gapless surface state is of critical importance for realizing novel quantum phenomena and new functionalities in topological insulators. By combining angle-resolved photoemission spectroscopic experiments with density functional theory calculations, we show that surface deposition of Cr atoms on Bi2Se3 does not lead to gap opening of the surface state at the Dirac point, indicating the absence of long-range out-of-plane ferromagnetism down to our measurement temperature of 15 K. This is in sharp contrast to bulk Cr doping, and the origin is attributed to different Cr occupation sites. These results highlight the importance of nanoscale configuration of doped magnetic impurities in determining the electronic and magnetic properties of topological insulators.

17.
Genomics ; 105(5-6): 282-7, 2015 May.
Article in English | MEDLINE | ID: mdl-25708667

ABSTRACT

In 1974, Takeo Maruyama deduced that neutral mutations should, on average, be older than deleterious or beneficial ones. This theory is based on the diffusion approximation for a branching process, which considers mutations independently of one another and not as multiple groups of interconnected mutations with strong linkage disequilibrium (haplotypes). However, mammalian genomes contain thousands of haplotypes, in which beneficial, neutral, and deleterious mutations are tightly linked to each other. This complex haplotype organization should not be ignored for estimation of allelic ages. We employed our GEMA computer simulation program for genome evolution to re-evaluate Maruyama's phenomenon in modeled populations that include haplotypes approximating real genomes. We determined that only under specific conditions (high recombination rates and abundance of neutral mutations), the deleterious and beneficial mutations are younger than neutral ones as predicted by Maruyama. Under other conditions, the ages of negative, neutral, and beneficial mutations were almost the same.


Subject(s)
Alleles , Computer Simulation , Genome, Human , Mutation , Age Factors , Genetic Linkage , Haplotypes , Humans , Models, Genetic
18.
Genome Biol Evol ; 7(2): 481-92, 2015 Jan 07.
Article in English | MEDLINE | ID: mdl-25573959

ABSTRACT

Nucleotide sequence differences on the whole-genome scale have been computed for 1,092 people from 14 populations publicly available by the 1000 Genomes Project. Total number of differences in genetic variants between 96,464 human pairs has been calculated. The distributions of these differences for individuals within European, Asian, or African origin were characterized by narrow unimodal peaks with mean values of 3.8, 3.5, and 5.1 million, respectively, and standard deviations of 0.1-0.03 million. The total numbers of genomic differences between pairs of all known relatives were found to be significantly lower than their respective population means and in reverse proportion to the distance of their consanguinity. By counting the total number of genomic differences it is possible to infer familial relations for people that share down to 6% of common loci identical-by-descent. Detection of familial relations can be radically improved when only very rare genetic variants are taken into account. Counting of total number of shared very rare single nucleotide polymorphisms (SNPs) from whole-genome sequences allows establishing distant familial relations for persons with eighth and ninth degrees of relationship. Using this analysis we predicted 271 distant familial pairwise relations among 1,092 individuals that have not been declared by 1000 Genomes Project. Particularly, among 89 British and 97 Chinese individuals we found three British-Chinese pairs with distant genetic relationships. Individuals from these pairs share identical-by-descent DNA fragments that represent 0.001%, 0.004%, and 0.01% of their genomes. With affordable whole-genome sequencing techniques, very rare SNPs should become important genetic markers for familial relationships and population stratification.


Subject(s)
Genetic Variation , Genome, Human , Phylogeny , Chromosomes, Human/genetics , Genetics, Population , Humans
19.
Gene ; 548(1): 81-90, 2014 Sep 10.
Article in English | MEDLINE | ID: mdl-25014137

ABSTRACT

Orthologous introns have identical positions relative to the coding sequence in orthologous genes of different species. By analyzing the complete genomes of five plants we generated a database of 40,512 orthologous intron groups of dicotyledonous plants, 28,519 orthologous intron groups of angiosperms, and 15,726 of land plants (moss and angiosperms). Multiple sequence alignments of each orthologous intron group were obtained using the Mafft algorithm. The number of conserved regions in plant introns appeared to be hundreds of times fewer than that in mammals or vertebrates. Approximately three quarters of conserved intronic regions among angiosperms and dicots, in particular, correspond to alternatively-spliced exonic sequences. We registered only a handful of conserved intronic ncRNAs of flowering plants. However, the most evolutionarily conserved intronic region, which is ubiquitous for all plants examined in this study, including moss, possessed multiple structural features of tRNAs, which caused us to classify it as a putative tRNA-like ncRNA. Intronic sequences encoding tRNA-like structures are not unique to plants. Bioinformatics examination of the presence of tRNA inside introns revealed an unusually long-term association of four glycine tRNAs inside the Vac14 gene of fish, amniotes, and mammals.


Subject(s)
Introns , Magnoliopsida/genetics , RNA, Plant , Algorithms , Animals , Arabidopsis/genetics , Arabidopsis/growth & development , Base Sequence , Bryophyta/genetics , Computational Biology/methods , Conserved Sequence , Databases, Genetic , Flowers/genetics , Genome, Plant , Humans , Mice , Molecular Sequence Data , Oryza/genetics , Phylogeny , Populus/genetics , RNA, Plant/chemistry , RNA, Transfer/genetics , Vitis/genetics
20.
Genome Biol Evol ; 6(4): 988-99, 2014 Apr.
Article in English | MEDLINE | ID: mdl-24723728

ABSTRACT

Mammalian genomes are replete with millions of polymorphic sites, among which those genetic variants that are colocated on the same chromosome and exist close to one another form blocks of closely linked mutations known as haplotypes. The linkage within haplotypes is constantly disrupted due to meiotic recombination events. Whole ensembles of such numerous haplotypes are subjected to evolutionary pressure, where mutations influence each other and should be considered as a whole entity-a gigantic matrix, unique for each individual genome. This idea was implemented into a computational approach, named Genome Evolution by Matrix Algorithms (GEMA) to model genomic changes taking into account all mutations in a population. GEMA has been tested for modeling of entire human chromosomes. The program can precisely mimic real biological processes that have influence on genome evolution such as: 1) Authentic arrangements of genes and functional genomic elements, 2) frequencies of various types of mutations in different nucleotide contexts, and 3) nonrandom distribution of meiotic recombination events along chromosomes. Computer modeling with GEMA has demonstrated that the number of meiotic recombination events per gamete is among the most crucial factors influencing population fitness. In humans, these recombinations create a gamete genome consisting on an average of 48 pieces of corresponding parental chromosomes. Such highly mosaic gamete structure allows preserving fitness of population under the intense influx of novel mutations (40 per individual) even when the number of mutations with deleterious effects is up to ten times more abundant than those with beneficial effects.


Subject(s)
Algorithms , Chromosomes, Human/genetics , Evolution, Molecular , Genome, Human/physiology , Models, Genetic , Sequence Analysis, DNA/methods , Software , Humans , Recombination, Genetic/physiology
SELECTION OF CITATIONS
SEARCH DETAIL