Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 7 de 7
Filter
Add more filters











Database
Language
Publication year range
1.
Int J Mol Sci ; 23(7)2022 Mar 29.
Article in English | MEDLINE | ID: mdl-35409113

ABSTRACT

In the course of evolution, both flowering plants and some gymnosperms have developed such an adaptation to winter and unfavorable living conditions as deciduousness. Of particular interest is Siberian larch (Larix sibirica Ledeb.), which is the only species in the pine family (Pinaceae) with a seasonal deciduousness. New generation sequencing technologies make it possible to study this phenomenon at the genomic level and to reveal the genetic mechanisms of leaf and needle aging in angiosperms and gymnosperms. Using a comparative analysis of the genomes of evergreen and deciduous trees, it was found that the genes that control EXORDIUM LIKE 2 (EXL2) and DORMANCY-ASSOCIATED PROTEIN 1 (DRM1) proteins are most represented in Siberian larch, while an excess of genes that control proteins acting as immune receptors were found in evergreens. Orthologs from the family of genes that control leucine-rich repeat receptor-like kinases (LRR-RLK) contributed mostly to the distinction between evergreens and deciduous plants.


Subject(s)
Larix , Magnoliopsida , Tracheophyta , Forests , Genomics , Larix/genetics , Seasons , Trees/genetics
2.
PLoS One ; 17(3): e0264640, 2022.
Article in English | MEDLINE | ID: mdl-35259178

ABSTRACT

The SARS-CoV-2 is the third coronavirus in addition to SARS-CoV and MERS-CoV that causes severe respiratory syndrome in humans. All of them likely crossed the interspecific barrier between animals and humans and are of zoonotic origin, respectively. The origin and evolution of viruses and their phylogenetic relationships are of great importance for study of their pathogenicity and development of antiviral drugs and vaccines. The main objective of the presented study was to compare two methods for identifying relationships between coronavirus genomes: phylogenetic one based on the whole genome alignment followed by molecular phylogenetic tree inference and alignment-free clustering of triplet frequencies, respectively, using 69 coronavirus genomes selected from two public databases. Both approaches resulted in well-resolved robust classifications. In general, the clusters identified by the first approach were in good agreement with the classes identified by the second using K-means and the elastic map method, but not always, which still needs to be explained. Both approaches demonstrated also a significant divergence of genomes on a taxonomic level, but there was less correspondence between genomes regarding the types of diseases they caused, which may be due to the individual characteristics of the host. This research showed that alignment-free methods are efficient in combination with alignment-based methods. They have a significant advantage in computational complexity and provide valuable additional alternative information on the genomes relationships.


Subject(s)
Comparative Genomic Hybridization/methods , Coronavirus/genetics , Genome, Viral , Chromosome Mapping , Cluster Analysis , Coronavirus/classification , Humans , Phylogeny , SARS-CoV-2/classification , SARS-CoV-2/genetics , Sequence Alignment
3.
BMC Genomics ; 21(1): 654, 2020 Sep 23.
Article in English | MEDLINE | ID: mdl-32972367

ABSTRACT

BACKGROUND: Plant mitochondrial genomes (mitogenomes) can be structurally complex while their size can vary from ~ 222 Kbp in Brassica napus to 11.3 Mbp in Silene conica. To date, in comparison with the number of plant species, only a few plant mitogenomes have been sequenced and released, particularly for conifers (the Pinaceae family). Conifers cover an ancient group of land plants that includes about 600 species, and which are of great ecological and economical value. Among them, Siberian larch (Larix sibirica Ledeb.) represents one of the keystone species in Siberian boreal forests. Yet, despite its importance for evolutionary and population studies, the mitogenome of Siberian larch has not yet been assembled and studied. RESULTS: Two sources of DNA sequences were used to search for mitochondrial DNA (mtDNA) sequences: mtDNA enriched samples and nucleotide reads generated in the de novo whole genome sequencing project, respectively. The assembly of the Siberian larch mitogenome contained nine contigs, with the shortest and the largest contigs being 24,767 bp and 4,008,762 bp, respectively. The total size of the genome was estimated at 11.7 Mbp. In total, 40 protein-coding, 34 tRNA, and 3 rRNA genes and numerous repetitive elements (REs) were annotated in this mitogenome. In total, 864 C-to-U RNA editing sites were found for 38 out of 40 protein-coding genes. The immense size of this genome, currently the largest reported, can be partly explained by variable numbers of mobile genetic elements, and introns, but unlikely by plasmid-related sequences. We found few plasmid-like insertions representing only 0.11% of the entire Siberian larch mitogenome. CONCLUSIONS: Our study showed that the size of the Siberian larch mitogenome is much larger than in other so far studied Gymnosperms, and in the same range as for the annual flowering plant Silene conica (11.3 Mbp). Similar to other species, the Siberian larch mitogenome contains relatively few genes, and despite its huge size, the repeated and low complexity regions cover only 14.46% of the mitogenome sequence.


Subject(s)
Genome Size , Genome, Mitochondrial , Larix/genetics , Contig Mapping , Molecular Sequence Annotation , Plant Proteins/genetics , RNA, Ribosomal/genetics , RNA, Transfer/genetics , Repetitive Sequences, Nucleic Acid
4.
BMC Bioinformatics ; 21(Suppl 2): 83, 2020 Mar 11.
Article in English | MEDLINE | ID: mdl-32164552

ABSTRACT

BACKGROUND: Previously, a seven-cluster pattern claiming to be a universal one in bacterial genomes has been reported. Keeping in mind the most popular theory of chloroplast origin, we checked whether a similar pattern is observed in chloroplast genomes. RESULTS: Surprisingly, eight cluster structure has been found, for chloroplasts. The pattern observed for chloroplasts differs rather significantly, from bacterial one, and from that latter observed for cyanobacteria. The structure is provided by clustering of the fragments of equal length isolated within a genome so that each fragment is converted in triplet frequency dictionary with non-overlapping triplets with no gaps in frame tiling. The points in 63-dimensional space were clustered due to elastic map technique. The eight cluster found in chloroplasts comprises the fragments of a genome bearing tRNA genes and exhibiting excessively high GC-content, in comparison to the entire genome. CONCLUSION: Chloroplasts exhibit very specific symmetry type in distribution of coding and non-coding fragments of a genome in the space of triplet frequencies: this is mirror symmetry. Cyanobacteria may have both mirror symmetry, and the rotational symmetry typical for other bacteria.


Subject(s)
Chloroplasts/genetics , Genome, Chloroplast , Base Composition , Cluster Analysis , Cyanobacteria/genetics , RNA, Transfer/genetics
5.
Theory Biosci ; 127(2): 69-78, 2008 Jun.
Article in English | MEDLINE | ID: mdl-18443840

ABSTRACT

Information capacity of nucleotide sequences measures the unexpectedness of a continuation of a given string of nucleotides, thus having a sound relation to a variety of biological issues. A continuation is defined in a way maximizing the entropy of the ensemble of such continuations. The capacity is defined as a mutual entropy of real frequency dictionary of a sequence with respect to the one bearing the most expected continuations; it does not depend on the length of strings contained in a dictionary. Various genomes exhibit a multi-minima pattern of the dependence of information capacity on the string length, thus reflecting an order within a sequence. The strings with significant deviation of an expected frequency from the real one are the words of increased information value. Such words exhibit a non-random distribution alongside a sequence, thus making it possible to retrieve the correlation between a structure, and a function encoded within a sequence.


Subject(s)
Algorithms , Chromosome Mapping/methods , Computational Biology/methods , DNA/genetics , Information Storage and Retrieval/methods , Models, Genetic , Sequence Analysis, DNA/methods , Base Sequence , Computer Simulation , Molecular Sequence Data , Structure-Activity Relationship
6.
Bull Math Biol ; 65(2): 309-22, 2003 Mar.
Article in English | MEDLINE | ID: mdl-12675334

ABSTRACT

A new method to compare two (or several) symbol sequences is developed. The method is based on the comparison of the frequencies of the small fragments of the compared sequences; it requires neither string editing, nor other transformations of the compared objects. The comparison is executed through a calculation of the specific entropy of a frequency dictionary against the special dictionary called the hybrid one; this latter is the statistical ancestor of the group of sequences under comparison. Some applications of the developed method in the fields of genetics and bioinformatics are discussed.


Subject(s)
Nucleotides/chemistry , Sequence Analysis/methods , Animals , Base Sequence , Databases, Factual , Entropy , Sequence Alignment/methods , Sequence Homology, Nucleic Acid , Statistics as Topic
7.
J Biol Phys ; 29(1): 23-38, 2003 Mar.
Article in English | MEDLINE | ID: mdl-23345817

ABSTRACT

The information capacity of nucleotide sequences is defined through the calculation of specific entropy of their frequency dictionary. The specificentropy of the frequency dictionary is calculated against the reconstructeddictionary; this latter bears the most probable continuations of the shorterstrings. This developed measure allows to distinguish the sequences both from the randons ones, and from those with high level of (rather simple) order. Some implications of the developed methodology in the fields of genetics,bioinformatics, and molecular biology are discussed.

SELECTION OF CITATIONS
SEARCH DETAIL