RESUMO
The genetic code is degenerate. Each amino acid is encoded by up to six synonymous codons; the choice between these codons influences gene expression. Here, we show that in coding sequences, once a particular codon has been used, subsequent occurrences of the same amino acid do not use codons randomly, but favor codons that use the same tRNA. The effect is pronounced in rapidly induced genes, involves both frequent and rare codons and diminishes only slowly as a function of the distance between subsequent synonymous codons. Furthermore, we found that in S. cerevisiae codon correlation accelerates translation relative to the translation of synonymous yet anticorrelated sequences. The data suggest that tRNA diffusion away from the ribosome is slower than translation, and that some tRNA channeling takes place at the ribosome. They also establish that the dynamics of translation leave a significant signature at the level of the genome.
Assuntos
Códon/metabolismo , Biossíntese de Proteínas , RNA de Transferência/metabolismo , Saccharomyces cerevisiae/genética , Aminoácidos/metabolismo , RNA Mensageiro/metabolismo , Ribossomos/metabolismo , Saccharomyces cerevisiae/citologia , Saccharomyces cerevisiae/metabolismoRESUMO
We present an algorithm for predicting transcription factor binding sites based on ChIP-chip and phylogenetic footprinting data. Our algorithm is robust against low promoter sequence similarity and motif rearrangements, because it does not depend on multiple sequence alignments. This, in turn, allows us to incorporate information from more distant species. Representative random data sets are used to estimate the score significance. Our algorithm is fully automatic, and does not require human intervention. On a recent S. cerevisiae data set, it achieves higher accuracy than the previously best algorithms. Adaptive ChIP-chip threshold and the modular positional bias score are two general features of our algorithm that increase motif prediction accuracy and could be implemented in other algorithms as well. In addition, since our algorithm works partly orthogonally to other algorithms, combining several algorithms can increase prediction accuracy even further. Specifically, our method finds 6 motifs not found by the 2nd best algorithm.
Assuntos
Algoritmos , DNA/química , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Mapeamento de Peptídeos/métodos , Análise de Sequência de DNA/métodos , Fatores de Transcrição/química , Fatores de Transcrição/genética , Sequência de Bases , Sítios de Ligação , DNA/genética , Dados de Sequência Molecular , Filogenia , Ligação ProteicaRESUMO
In this paper, we wanted to test whether it is possible to use genetical genomics information such as expression quantitative trait loci (eQTL) mapping results as input to a transcription factor binding site (TFBS) prediction algorithm. Furthermore, this new approach was compared to the more traditional cluster based TFBS prediction. The results of eQTL mapping are used as input to one of the top ranking TFBS prediction algorithms. Genes with observed expression profiles showing the same eQTL region are collected into eQTL groups. The promoter sequences of all the genes within the same eQTL group are used as input in the transcription factor binding site search. This approach is tested with a real data set of a recombinant inbred line population of Arabidopsis thaliana. The predicted motifs are compared to results obtained from the conventional approach of first clustering the gene expression values and then using the promoter sequences of the genes within the same cluster as input for the transcription factor binding site prediction. Our eQTL based approach produced different motifs compared to the cluster based method. Furthermore the score of the eQTL based motifs was higher than the score of the cluster based motifs. In a comparison to already predicted motifs from the AtcisDB database, the eQTL based and the cluster based method produced about the same number of hits with binding sites from AtcisDB. In conclusion, the results of this study clearly demonstrate the usefulness of eQTL to predict transcription factor binding sites.