Pattern recognition in DNA sequences and its application to consensus foot-printing.

Lefèvre, C; Ikeda, J E

Lefèvre, C; Ikeda, J E.

Afiliação

Lefèvre C; Genosphere Project, ERATO, JRDC, Tokai University School of Medicine, Kanagawa, Japan.

Comput Appl Biosci ; 9(3): 349-54, 1993 Jun.

Article em En | MEDLINE | ID: mdl-8324636

ABSTRACT

ABSTRACT

We consider the problem of comparing several nucleic acid sequences to identify words occurring imperfectly (patterns with no gap) with unusual frequency. Methods for computing, representing, and inspecting interactively the structure of such repeating motifs in nucleic acids and more generally any text are described. Multiple sequences are treated as one large concatenate. In a preprocessing step, a lexical index is created to provide rapid string matching for the enumeration of the words matching a pattern. For given word features (word length, minimal frequency), a sequence profile is displayed. The profile can be inspected interactively with on-line algorithms. Applications to the identification of regulatory elements in DNA regions involved in the control of gene expression are presented. Our program ('DNA-Lexemics') runs on the Macintosh.

Assuntos

Reconhecimento Automatizado de Padrão; Análise de Sequência de DNA; Software; Algoritmos; Sequência de Aminoácidos; Dados de Sequência Molecular; Sistemas On-Line

Buscar no Google

Imprimir

XML

PubMed Links

Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Software / Reconhecimento Automatizado de Padrão / Análise de Sequência de DNA Idioma: En Revista: Comput Appl Biosci Assunto da revista: BIOLOGIA / ENGENHARIA BIOMEDICA Ano de publicação: 1993 Tipo de documento: Article País de afiliação: Japão

Buscar no Google

Imprimir

XML

PubMed Links