Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 3 de 3
1.
Pac Symp Biocomput ; : 254-65, 1999.
Article En | MEDLINE | ID: mdl-10380202

Suppose that a biologist wishes to study some local property P of genetic sequences. If he can design (with a computer scientist) an algorithm C which efficiently compresses parts of the sequence which satisfy P, then our algorithm TurboOptLift locates very quickly where property P occurs by chance on a sequence, and where it occurs as a result of a significant process. Under some conditions, the time complexity of TurboOptLift is O(n log n). We illustrate its use on the practical problem of locating approximate tandem repeats in DNA sequences.


Base Sequence , DNA/chemistry , Repetitive Sequences, Nucleic Acid , Sequence Alignment , Algorithms , Computational Biology/methods , Computer Simulation , Computing Methodologies , DNA/genetics , Software
2.
Comput Appl Biosci ; 13(2): 131-6, 1997 Apr.
Article En | MEDLINE | ID: mdl-9146959

MOTIVATION: Compression algorithms can be used to analyse genetic sequences. A compression algorithm tests a given property on the sequence and uses it to encode the sequence: if the property is true, it reveals some structure of the sequence which can be described briefly, this yields a description of the sequence which is shorter than the sequence of nucleotides given in extenso. The more a sequence is compressed by the algorithm, the more significant is the property for that sequence. RESULTS: We present a compression algorithm that tests the presence of a particular type of dosDNA (defined ordered sequence-DNA): approximate tandem repeats of small motifs (i.e. of lengths < 4). This algorithm has been experimented with on four yeast chromosomes. The presence of approximate tandem repeats seems to be a uniform structural property of yeast chromosomes.


Algorithms , DNA/genetics , Repetitive Sequences, Nucleic Acid , Base Sequence , Chromosomes, Fungal/genetics , DNA, Fungal/genetics , Evaluation Studies as Topic , Molecular Sequence Data , Saccharomyces cerevisiae/genetics , Sequence Analysis, DNA/methods , Sequence Analysis, DNA/statistics & numerical data , Software
3.
Biochimie ; 78(5): 315-22, 1996.
Article En | MEDLINE | ID: mdl-8905150

A novel approach to genetic sequence analysis is presented. This approach, based on compression of algorithms, has been launched simultaneously by Grumbach and Tahi, Milosavljevic and Rivals. To reduce the description of an object, a compression algorithm replaces some regularities in the description by special codes. Thus a compression algorithm can be applied to a sequence in order to study the presence of those regularities all over the sequence. This paper explains this ability, gives examples of compression algorithms already developed and mentions their applications. Finally, the theoretical foundations of the approach are presented in an overview of the algorithmic theory of information.


Sequence Analysis/methods , Algorithms , Information Systems , Information Theory , Repetitive Sequences, Nucleic Acid
...