Pesquisa | Portal de Pesquisa da BVS Enfermagem

DIALIGN-TX and multiple protein alignment using secondary structure information at GOBICS.

Subramanian, Amarendran R; Hiran, Suvrat; Steinkamp, Rasmus; Meinicke, Peter; Corel, Eduardo; Morgenstern, Burkhard.

Nucleic Acids Res ; 38(Web Server issue): W19-22, 2010 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-20497995

RESUMO

We introduce web interfaces for two recent extensions of the multiple-alignment program DIALIGN. DIALIGN-TX combines the greedy heuristic previously used in DIALIGN with a more traditional 'progressive' approach for improved performance on locally and globally related sequence sets. In addition, we offer a version of DIALIGN that uses predicted protein secondary structures together with primary sequence information to construct multiple protein alignments. Both programs are available through 'Göttingen Bioinformatics Compute Server' (GOBICS).

Assuntos

Estrutura Secundária de Proteína , Alinhamento de Sequência/métodos , Análise de Sequência de Proteína , Software , Internet

DIALIGN-T: an improved algorithm for segment-based multiple sequence alignment.

Subramanian, Amarendran R; Weyer-Menkhoff, Jan; Kaufmann, Michael; Morgenstern, Burkhard.

BMC Bioinformatics ; 6: 66, 2005 Mar 22.

Artigo em Inglês | MEDLINE | ID: mdl-15784139

RESUMO

BACKGROUND: We present a complete re-implementation of the segment-based approach to multiple protein alignment that contains a number of improvements compared to the previous version 2.2 of DIALIGN. This previous version is superior to Needleman-Wunsch-based multi-alignment programs on locally related sequence sets. However, it is often outperformed by these methods on data sets with global but weak similarity at the primary-sequence level. RESULTS: In the present paper, we discuss strengths and weaknesses of DIALIGN in view of the underlying objective function. Based on these results, we propose several heuristics to improve the segment-based alignment approach. For pairwise alignment, we implemented a fragment-chaining algorithm that favours chains of low-scoring local alignments over isolated high-scoring fragments. For multiple alignment, we use an improved greedy procedure that is less sensitive to spurious local sequence similarities. To evaluate our method on globally related protein families, we used the well-known database BAliBASE. For benchmarking tests on locally related sequences, we created a new reference database called IRMBASE which consists of simulated conserved motifs implanted into non-related random sequences. CONCLUSION: On BAliBASE, our new program performs significantly better than the previous version of DIALIGN and is comparable to the standard global aligner CLUSTAL W, though it is outperformed by some newly developed programs that focus on global alignment. On the locally related test sets in IRMBASE, our method outperforms all other programs that we evaluated.

Assuntos

Biologia Computacional/métodos , Alinhamento de Sequência , Algoritmos , Motivos de Aminoácidos , Sequência de Aminoácidos , Inteligência Artificial , Análise por Conglomerados , Simulação por Computador , Sequência Conservada , Interpretação Estatística de Dados , Bases de Dados Genéticas , Bases de Dados de Proteínas , Genoma , Modelos Estatísticos , Técnicas de Sonda Molecular , Dados de Sequência Molecular , Reconhecimento Automatizado de Padrão , Proteínas/química , Análise de Sequência , Análise de Sequência de DNA , Análise de Sequência de Proteína , Homologia de Sequência de Aminoácidos , Software , Design de Software , Validação de Programas de Computador

DIALIGN-TX: greedy and progressive approaches for segment-based multiple sequence alignment.

Subramanian, Amarendran R; Kaufmann, Michael; Morgenstern, Burkhard.

Algorithms Mol Biol ; 3: 6, 2008 May 27.

Artigo em Inglês | MEDLINE | ID: mdl-18505568

RESUMO

BACKGROUND: DIALIGN-T is a reimplementation of the multiple-alignment program DIALIGN. Due to several algorithmic improvements, it produces significantly better alignments on locally and globally related sequence sets than previous versions of DIALIGN. However, like the original implementation of the program, DIALIGN-T uses a a straight-forward greedy approach to assemble multiple alignments from local pairwise sequence similarities. Such greedy approaches may be vulnerable to spurious random similarities and can therefore lead to suboptimal results. In this paper, we present DIALIGN-TX, a substantial improvement of DIALIGN-T that combines our previous greedy algorithm with a progressive alignment approach. RESULTS: Our new heuristic produces significantly better alignments, especially on globally related sequences, without increasing the CPU time and memory consumption exceedingly. The new method is based on a guide tree; to detect possible spurious sequence similarities, it employs a vertex-cover approximation on a conflict graph. We performed benchmarking tests on a large set of nucleic acid and protein sequences For protein benchmarks we used the benchmark database BALIBASE 3 and an updated release of the database IRMBASE 2 for assessing the quality on globally and locally related sequences, respectively. For alignment of nucleic acid sequences, we used BRAliBase II for global alignment and a newly developed database of locally related sequences called DIRM-BASE 1. IRMBASE 2 and DIRMBASE 1 are constructed by implanting highly conserved motives at random positions in long unalignable sequences. CONCLUSION: On BALIBASE3, our new program performs significantly better than the previous program DIALIGN-T and outperforms the popular global aligner CLUSTAL W, though it is still outperformed by programs that focus on global alignment like MAFFT, MUSCLE and T-COFFEE. On the locally related test sets in IRMBASE 2 and DIRM-BASE 1, our method outperforms all other programs while MAFFT E-INSi is the only method that comes close to the performance of DIALIGN-TX.

Multiple sequence alignment with user-defined constraints at GOBICS.

Morgenstern, Burkhard; Werner, Nadine; Prohaska, Sonja J; Steinkamp, Rasmus; Schneider, Isabelle; Subramanian, Amarendran R; Stadler, Peter F; Weyer-Menkhoff, Jan.

Bioinformatics ; 21(7): 1271-3, 2005 Apr 01.

Artigo em Inglês | MEDLINE | ID: mdl-15546937

RESUMO

Most multi-alignment methods are fully automated, i.e. they are based on a fixed set of mathematical rules. For various reasons, such methods may fail to produce biologically meaningful alignments. Herein, we describe a semi-automatic approach to multiple sequence alignment where biological expert knowledge can be used to influence the alignment procedure. The user can specify parts of the sequences that are biologically related to each other; our software program uses these sites as anchor points and creates a multiple alignment respecting these user-defined constraints. By using known functionally, structurally or evolutionarily related positions of the input sequences as anchor points, our method can produce alignments that reflect the true biological relationships among the input sequences more accurately than fully automated procedures can do.

Assuntos

Algoritmos , Proteínas/química , Proteínas/classificação , Alinhamento de Sequência/métodos , Análise de Sequência de Proteína/métodos , Software , Interface Usuário-Computador , Sequência de Aminoácidos , Dados de Sequência Molecular , Proteínas/análise , Homologia de Sequência de Aminoácidos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA