Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
Mais filtros

Base de dados
Tipo de documento
País de afiliação
Intervalo de ano de publicação
1.
J Biomed Inform ; 43(5): 800-4, 2010 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-20546935

RESUMO

Single Nucleotide Polymorphisms (SNPs) provide valuable information on human evolutionary history and may lead us to identify genetic variants responsible for human complex diseases. Unfortunately, molecular haplotyping methods are costly, laborious, and time consuming; therefore, algorithms for constructing full haplotype patterns from small available data through computational methods, Tag SNP selection problem, are convenient and attractive. This problem is proved to be an NP-hard problem, so heuristic methods may be useful. In this paper we present a heuristic method based on genetic algorithm to find reasonable solution within acceptable time. The algorithm was tested on a variety of simulated and experimental data. In comparison with the exact algorithm, based on brute force approach, results show that our method can obtain optimal solutions in almost all cases and runs much faster than exact algorithm when the number of SNP sites is large. Our software is available upon request to the corresponding author.


Assuntos
Algoritmos , Biologia Computacional/métodos , Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Software , Simulação por Computador , Doença/genética , Predisposição Genética para Doença/epidemiologia , Predisposição Genética para Doença/genética , Haplótipos , Humanos
2.
BMC Bioinformatics ; 10: 93, 2009 Mar 20.
Artigo em Inglês | MEDLINE | ID: mdl-19302709

RESUMO

BACKGROUND: Pattern discovery in DNA sequences is one of the most fundamental problems in molecular biology with important applications in finding regulatory signals and transcription factor binding sites. An important task in this problem is to search (or predict) known binding sites in a new DNA sequence. For this reason, all subsequences of the given DNA sequence are scored based on an scoring function and the prediction is done by selecting the best score. By assuming no dependency between binding site base positions, most of the available tools for known binding site prediction are designed. Recently Tomovic and Oakeley investigated the statistical basis for either a claim of dependence or independence, to determine whether such a claim is generally true, and they presented a scoring function for binding site prediction based on the dependency between binding site base positions. Our primary objective is to investigate the scoring functions which can be used in known binding site prediction based on the assumption of dependency or independency in binding site base positions. RESULTS: We propose a new scoring function based on the dependency between all positions in biding site base positions. This scoring function uses joint information content and mutual information as a measure of dependency between positions in transcription factor binding site. Our method for modeling dependencies is simply an extension of position independency methods. We evaluate our new scoring function on the real data sets extracted from JASPAR and TRANSFAC data bases, and compare the obtained results with two other well known scoring functions. CONCLUSION: The results demonstrate that the new approach improves known binding site discovery and show that the joint information content and mutual information provide a better and more general criterion to investigate the relationships between positions in the TFBS. Our scoring function is formulated by simple mathematical calculations. By implementing our method on several biological data sets, it can be induced that this method performs better than methods that do not consider dependencies.


Assuntos
Biologia Computacional/métodos , DNA/química , Análise de Sequência de DNA/métodos , Sequência de Bases , Sítios de Ligação
3.
BMC Bioinformatics ; 10: 318, 2009 Oct 04.
Artigo em Inglês | MEDLINE | ID: mdl-19799800

RESUMO

BACKGROUND: Complex networks are studied across many fields of science and are particularly important to understand biological processes. Motifs in networks are small connected sub-graphs that occur significantly in higher frequencies than in random networks. They have recently gathered much attention as a useful concept to uncover structural design principles of complex networks. Existing algorithms for finding network motifs are extremely costly in CPU time and memory consumption and have practically restrictions on the size of motifs. RESULTS: We present a new algorithm (Kavosh), for finding k-size network motifs with less memory and CPU time in comparison to other existing algorithms. Our algorithm is based on counting all k-size sub-graphs of a given graph (directed or undirected). We evaluated our algorithm on biological networks of E. coli and S. cereviciae, and also on non-biological networks: a social and an electronic network. CONCLUSION: The efficiency of our algorithm is demonstrated by comparing the obtained results with three well-known motif finding tools. For comparison, the CPU time, memory usage and the similarities of obtained motifs are considered. Besides, Kavosh can be employed for finding motifs of size greater than eight, while most of the other algorithms have restriction on motifs with size greater than eight. The Kavosh source code and help files are freely available at: http://Lbb.ut.ac.ir/Download/LBBsoft/Kavosh/.


Assuntos
Algoritmos , Biologia Computacional/métodos , Software , Escherichia coli/genética , Redes Neurais de Computação , Saccharomyces cerevisiae/genética
4.
Genes Genet Syst ; 84(1): 81-93, 2009 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-19420804

RESUMO

In this paper a novel genetic algorithm is presented for the dyad motif finding problem. The genetic algorithm uses a multi-objective fitness function based on the sum of pairs, the number of matches, and the information content. The individuals required for the population pool in the genetic algorithm are optimized by Gibbs sampling method. Also, new crossover and mutation operators are designed. The algorithm is implemented and tested on the different types of real datasets. The results are compared with other well-known algorithms and the effectiveness of our algorithm is shown.


Assuntos
Algoritmos , Bases de Dados Genéticas , Análise de Sequência de DNA/métodos
5.
Biosystems ; 82(1): 52-60, 2005 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-15982801

RESUMO

In this paper, we present a new DNA-based evaluation algorithm for a Boolean circuit that employs standard bio-molecular techniques. The algorithm operates on an unbounded fan-in Boolean circuit consisting of AND and OR gates. The whole simulation of our algorithm is proposed in a single test tube in O(1) time complexity and is much easier to implement in the laboratory than previously described models. Furthermore, the algorithm allows for evaluating any number of Boolean circuits in parallel in a single test tube.


Assuntos
Algoritmos , Computadores Moleculares , DNA/química , DNA/genética , Modelos Logísticos , Modelos Genéticos , Processamento de Sinais Assistido por Computador , Simulação por Computador
6.
Int J Bioinform Res Appl ; 9(6): 584-94, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-24084239

RESUMO

One of the fundamental problems in computational biology is the construction of physical maps of chromosomes from the hybridisation experiments between unique probes and clones of chromosome fragments. Before introducing the shotgun sequencing method, Partial Digest Problem (PDP) was an intractable problem used to construct the physical maps of DNA sequence in molecular biology. In this paper, we develop a novel Genetic Algorithm (GA) for solving the PDP. This algorithm is implemented and compared with well-known existing algorithms on different types of random and real instances data, and the obtained results show the efficiency of our algorithm. Also, our GA is adapted to handle the erroneous data and their efficiency is presented for the large instances of this problem.


Assuntos
Algoritmos , Sequência de Bases , Genômica/métodos , Mapeamento Cromossômico/métodos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA