Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
Más filtros

Bases de datos
Tipo del documento
Asunto de la revista
País de afiliación
Intervalo de año de publicación
1.
Nucleic Acids Res ; 37(3): 939-44, 2009 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-19106141

RESUMEN

To represent the sequence specificity of transcription factors, the position weight matrix (PWM) is widely used. In most cases, each element is defined as a log likelihood ratio of a base appearing at a certain position, which is estimated from a finite number of known binding sites. To avoid bias due to this small sample size, a certain numeric value, called a pseudocount, is usually allocated for each position, and its fraction according to the background base composition is added to each element. So far, there has been no consensus on the optimal pseudocount value. In this study, we simulated the sampling process by artificially generating binding sites based on observed nucleotide frequencies in a public PWM database, and then the generated matrix with an added pseudocount value was compared to the original frequency matrix using various measures. Although the results were somewhat different between measures, in many cases, we could find an optimal pseudocount value for each matrix. These optimal values are independent of the sample size and are clearly correlated with the entropy of the original matrices, meaning that larger pseudocount vales are preferable for less conserved binding sites. As a simple representative, we suggest the value of 0.8 for practical uses.


Asunto(s)
Elementos Reguladores de la Transcripción , Análisis de Secuencia de ADN/métodos , Factores de Transcripción/metabolismo , Sitios de Unión , Tamaño de la Muestra
2.
Nature ; 428(6983): 653-7, 2004 Apr 08.
Artículo en Inglés | MEDLINE | ID: mdl-15071595

RESUMEN

Small, compact genomes of ultrasmall unicellular algae provide information on the basic and essential genes that support the lives of photosynthetic eukaryotes, including higher plants. Here we report the 16,520,305-base-pair sequence of the 20 chromosomes of the unicellular red alga Cyanidioschyzon merolae 10D as the first complete algal genome. We identified 5,331 genes in total, of which at least 86.3% were expressed. Unique characteristics of this genomic structure include: a lack of introns in all but 26 genes; only three copies of ribosomal DNA units that maintain the nucleolus; and two dynamin genes that are involved only in the division of mitochondria and plastids. The conserved mosaic origin of Calvin cycle enzymes in this red alga and in green plants supports the hypothesis of the existence of single primary plastid endosymbiosis. The lack of a myosin gene, in addition to the unexpressed actin gene, suggests a simpler system of cytokinesis. These results indicate that the C. merolae genome provides a model system with a simple gene composition for studying the origin, evolution and fundamental mechanisms of eukaryotic cells.


Asunto(s)
Genoma , Rhodophyta/genética , Actinas/genética , Proteínas Algáceas/clasificación , Proteínas Algáceas/genética , Núcleo Celular/genética , Cromosomas/genética , ADN Mitocondrial/genética , ADN Ribosómico/genética , Evolución Molecular , Genómica , Intrones/genética , Datos de Secuencia Molecular , Plastidios/genética , Plastidios/fisiología , Rhodophyta/citología , Análisis de Secuencia de ADN
3.
BMC Genomics ; 10: 358, 2009 Aug 05.
Artículo en Inglés | MEDLINE | ID: mdl-19656368

RESUMEN

BACKGROUND: Streptococcus mutans is the major pathogen of dental caries, and it occasionally causes infective endocarditis. While the pathogenicity of this species is distinct from other human pathogenic streptococci, the species-specific evolution of the genus Streptococcus and its genomic diversity are poorly understood. RESULTS: We have sequenced the complete genome of S. mutans serotype c strain NN2025, and compared it with the genome of UA159. The NN2025 genome is composed of 2,013,587 bp, and the two strains show highly conserved core-genome. However, comparison of the two S. mutans strains showed a large genomic inversion across the replication axis producing an X-shaped symmetrical DNA dot plot. This phenomenon was also observed between other streptococcal species, indicating that streptococcal genetic rearrangements across the replication axis play an important role in Streptococcus genetic shuffling. We further confirmed the genomic diversity among 95 clinical isolates using long-PCR analysis. Genomic diversity in S. mutans appears to occur frequently between insertion sequence (IS) elements and transposons, and these diversity regions consist of restriction/modification systems, antimicrobial peptide synthesis systems, and transporters. S. mutans may preferentially reject the phage infection by clustered regularly interspaced short palindromic repeats (CRISPRs). In particular, the CRISPR-2 region, which is highly divergent between strains, in NN2025 has long repeated spacer sequences corresponding to the streptococcal phage genome. CONCLUSION: These observations suggest that S. mutans strains evolve through chromosomal shuffling and that phage infection is not needed for gene acquisition. In contrast, S. pyogenes tolerates phage infection for acquisition of virulence determinants for niche adaptation.


Asunto(s)
Cromosomas Bacterianos , Genoma Bacteriano , Streptococcus mutans/genética , Bacitracina/biosíntesis , Secuencia de Bases , Familia de Multigenes , Especificidad de la Especie , Streptococcus mutans/metabolismo
4.
J Mol Evol ; 58(3): 291-303, 2004 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-15045484

RESUMEN

The ancestors of plastids and mitochondria were once free-living bacteria that became organelles as a result of endosymbiosis. According to this theory, a key bacterial division protein, FtsZ, plays a role in plastid division in algae and plants as well as in mitochondrial division in lower eukaryotes. Recent studies have shown that organelle division is a process that combines features derived from the bacterial division system with features contributed by host eukaryotic cells. Two nonredundant versions of FtsZ, FtsZ1 and FtsZ2, have been identified in green-lineage plastids, whereas most bacteria have a single ftsZ gene. To examine whether there is also more than one type of FtsZ in red-lineage chloroplasts (red algal chloroplasts and chloroplasts that originated from the secondary endosymbiosis of red algae) and in mitochondria, we obtained FtsZ sequences from the complete sequence of the primitive red alga Cyanidioschyzon merolae and the draft sequence of the stramenopile (heterokont) Thalassiosira pseudonana. Phylogenetic analyses that included known FtsZ proteins identified two types of chloroplast FtsZ in red algae (FtsZA and FtsZB) and stramenopiles (FtsZA and FtsZC). These analyses also showed that FtsZB emerged after the red and green lineages diverged, while FtsZC arose by the duplication of an ftsZA gene that in turn descended from a red alga engulfed by the ancestor of stramenopiles. A comparison of the predicted proteins showed that like bacterial FtsZ and green-lineage FtsZ2, FtsZA has a short conserved C-termmal sequence (the C-terminal core domain), whereas FtsZB and FtsZC, like the green-lineage FtsZ1, lack this sequence. In addition, the Cyanidioschyzon and Dictyostelium genomes encode two types of mitochondrial FtsZ proteins, one of which lacks the C-terminal variable domain. These results suggest that the acquisition of an additional FtsZ protein with a modified C terminus was common to the primary and secondary endosymbioses that produced plastids and that this also occurred during the establishment of mitochondria, presumably to regulate the multiplication of these organelles.


Asunto(s)
Proteínas Algáceas/genética , ADN de Cloroplastos/genética , Diatomeas/genética , Filogenia , Rhodophyta/genética , Secuencia de Aminoácidos , Proteínas Bacterianas/genética , Secuencia de Bases , Análisis por Conglomerados , Proteínas del Citoesqueleto/genética , Cartilla de ADN , ADN Mitocondrial/genética , Evolución Molecular , Componentes del Gen , Funciones de Verosimilitud , Modelos Genéticos , Datos de Secuencia Molecular , Familia de Multigenes/genética , Orgánulos/genética , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa , Alineación de Secuencia , Análisis de Secuencia de ADN
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA