Comprehensive aligned sequence construction for automated design of effective probes (CASCADE-P) using 16S rDNA.
Bioinformatics
; 19(12): 1461-8, 2003 Aug 12.
Article
en En
| MEDLINE
| ID: mdl-12912825
MOTIVATION: Prokaryotic organisms have been identified utilizing the sequence variation of the 16S rRNA gene. Variations steer the design of DNA probes for the detection of taxonomic groups or specific organisms. The long-term goal of our project is to create probe arrays capable of identifying 16S rDNA sequences in unknown samples. This necessitated the authentication, categorization and alignment of the >75 000 publicly available '16S' sequences. Preferably, the entire process should be computationally administrated so the aligned collection could periodically absorb 16S rDNA sequences from the public records. A complete multiple sequence alignment would provide a foundation for computational probe selection and facilitates microbial taxonomy and phylogeny. RESULTS: Here we report the alignment and similarity clustering of 62 662 16S rDNA sequences and an approach for designing effective probes for each cluster. A novel alignment compression algorithm, NAST (Nearest Alignment Space Termination), was designed to produce the uniform multiple sequence alignment referred to as the prokMSA. From the prokMSA, 9020 Operational Taxonomic Units (OTUs) were found based on transitive sequence similarities. An automated approach to probe design was straightforward using the prokMSA clustered into OTUs. As a test case, multiple probes were computationally picked for each of the 27 OTUs that were identified within the Staphylococcus Group. The probes were incorporated into a customized microarray and were able to correctly categorize Staphylococcus aureus and Bacillus anthracis into their correct OTUs. Although a successful probe picking strategy is outlined, the main focus of creating the prokMSA was to provide a comprehensive, categorized, updateable 16S rDNA collection useful as a foundation for any probe selection algorithm.
Buscar en Google
Colección:
01-internacional
Base de datos:
MEDLINE
Asunto principal:
Algoritmos
/
ARN Ribosómico 16S
/
Sondas de ADN
/
Alineación de Secuencia
/
Análisis de Secuencia de ADN
/
Análisis de Secuencia por Matrices de Oligonucleótidos
/
Diseño de Equipo
Tipo de estudio:
Evaluation_studies
Idioma:
En
Revista:
Bioinformatics
Asunto de la revista:
INFORMATICA MEDICA
Año:
2003
Tipo del documento:
Article
País de afiliación:
Estados Unidos
Pais de publicación:
Reino Unido