Your browser doesn't support javascript.
loading
Novel algorithms for LDD motif search.
Xiao, Peng; Schiller, Martin; Rajasekaran, Sanguthevar.
Afiliação
  • Xiao P; Department of Computer Science and Engineering, University of Connecticut, 371 Fairfield Road, Storrs, 06269, CT, USA.
  • Schiller M; School of Life Sciences, University of Nevada, Las Vegas, NV, USA.
  • Rajasekaran S; Department of Computer Science and Engineering, University of Connecticut, 371 Fairfield Road, Storrs, 06269, CT, USA. rajasek@engr.uconn.edu.
BMC Genomics ; 20(Suppl 5): 424, 2019 Jun 06.
Article em En | MEDLINE | ID: mdl-31167665
ABSTRACT

BACKGROUND:

Motifs are crucial patterns that have numerous applications including the identification of transcription factors and their binding sites, composite regulatory patterns, similarity between families of proteins, etc. Several motif models have been proposed in the literature. The (l,d)-motif model is one of these that has been studied widely. However, this model will sometimes report too many spurious motifs than expected. We interpret a motif as a biologically significant entity that is evolutionarily preserved within some distance. It may be highly improbable that the motif undergoes the same number of changes in each of the species. To address this issue, in this paper, we introduce a new model which is more general than (l,d)-motif model. This model is called (l,d1,d2)-motif model (LDDMS) and is NP-hard as well. We present three elegant as well as efficient algorithms to solve the LDDMS problem, i.e., LDDMS1, LDDMS2 and LDDMS3. They are all exact algorithms.

RESULTS:

We did both theoretical analyses and empirical tests on these algorithms. Theoretical analyses demonstrate that our algorithms have less computational cost than the pattern driven approach. Empirical results on both simulated datasets and real datasets show that each of the three algorithms has some advantages on some (l,d1,d2) instances.

CONCLUSIONS:

We proposed LDDMS model which is more practically relevant. We also proposed three exact efficient algorithms to solve the problem. Besides, our algorithms can be nicely parallelized. We believe that the idea in this new model can also be extended to other motif search problems such as Edit-distance-based Motif Search (EMS) and Simple Motif Search (SMS).
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos / Motivos de Aminoácidos / Motivos de Nucleotídeos Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2019 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos / Motivos de Aminoácidos / Motivos de Nucleotídeos Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2019 Tipo de documento: Article