Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
FEBS Lett ; 579(14): 3175-82, 2005 Jun 06.
Artículo en Inglés | MEDLINE | ID: mdl-15922334

RESUMEN

A large number of sequences in each newly sequenced genome correspond to lineage and species-specific proteins, also known as ORFans. Amongst these ORFans, a large number are sequences with unknown structures and functions. We have identified a family of sequences, annotated as hypothetical proteins, which are specific to Bacillus and have carried out a computational study aimed at characterizing this family. Fold-recognition methods predict that these sequences belong to the alpha/beta hydrolase fold. We suggest possible catalytic triads for the ORFans and propose a hypothesis regarding the possible families within the alpha/beta hydrolase superfamily to which they may belong.


Asunto(s)
Bacillus/enzimología , Hidrolasas/química , Hidrolasas/clasificación , Secuencia de Aminoácidos , Bacillus/genética , Catálisis , Biología Computacional , Hidrolasas/genética , Hidrolasas/metabolismo , Modelos Moleculares , Datos de Secuencia Molecular , Pliegue de Proteína , Estructura Terciaria de Proteína , Alineación de Secuencia , Especificidad de la Especie , Relación Estructura-Actividad
2.
J Mol Biol ; 342(2): 369-73, 2004 Sep 10.
Artículo en Inglés | MEDLINE | ID: mdl-15327940

RESUMEN

Genomic ORFans are orphan open reading frames (ORFs) with no significant sequence similarity to other ORFs. ORFans comprise 20-30% of the ORFs of most completely sequenced genomes. Because nothing can be learnt about ORFans via sequence homology, the functions and evolutionary origins of ORFans remain a mystery. Furthermore, because relatively few ORFans have been experimentally characterized, it has been suggested that most ORFans are not likely to correspond to functional, expressed proteins, but rather to spurious ORFs, pseudo-genes or to rapidly evolving proteins with non-essential roles. As a snapshot view of current ORFan structural studies, we searched for ORFans among proteins whose three-dimensional structures have been recently determined. We find that functional and structural studies of ORFans are not as underemphasized as previously suggested. These recently determined structures correspond to ORFans from all Kingdoms of life, and include proteins that have previously been functionally characterized, as well as structural genomics targets of unknown function labeled as "hypothetical proteins". This suggests that many of the ORFans in the databases are likely to correspond to expressed, functional (and even essential) proteins. Furthermore, the recently determined structures include examples of the various types of ORFans, suggesting that the functions and evolutionary origins of ORFans are diverse. Although this survey sheds some light on the ORFan mystery, further experimental studies are required to gain a better understanding of the role and origins of the tens of thousands of ORFans awaiting characterization.


Asunto(s)
Evolución Molecular , Sistemas de Lectura Abierta/genética , Estructura Terciaria de Proteína , Análisis de Secuencia de ADN
3.
Nucleic Acids Res ; 32(Database issue): D281-3, 2004 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-14681413

RESUMEN

As each newly sequenced genome contains a significant number of protein-coding ORFs that are species-, family- or lineage-specific, many interesting questions arise about the evolution and role of these ORFs and of the genomes they are part of. We refer to these poorly conserved ORFs as singleton or paralogous ORFans if they are unique to one genome, or as orthologous ORFans if they appear only in a family of closely related organisms and have no homolog in other genomes. In order to study and classify ORFans we have constructed the ORFanage, an ORFan database. This database consists of the predicted ORFs in fully sequenced microbial genomes, and enables searching for the three types of ORFans in any subset of the genomes chosen by the user. The ORFanage could help in choosing interesting targets for further genomic and evolutionary studies. The ORFanage is accessible via http://www.bioinformatics.buffalo. edu/ORFanage.


Asunto(s)
Bases de Datos Genéticas , Sistemas de Lectura Abierta , Biología Computacional , Evolución Molecular , Genes Bacterianos/genética , Internet , Homología de Secuencia , Especificidad de la Especie
4.
Proteins ; 53(2): 241-51, 2003 Nov 01.
Artículo en Inglés | MEDLINE | ID: mdl-14517975

RESUMEN

Singleton sequence ORFans are orphan ORFs (open reading frames) that have no detectable sequence similarity to any other sequence in the databases. ORFans are of particular interest not only as evolutionary puzzles but also because we can learn little about them using bioinformatics tools. Here, we present a first systematic analysis of singleton ORFans in the first 60 fully sequenced microbial genomes. We show that although ORFans have been underemphasized, the number of ORFans is steadily growing, currently accounting for 23,634 sequences. At the same time, the percentage of ORFans as a fraction of all sequences is slowly diminishing, and is currently about 14%. Short ORFans comprise about 61% of all ORFans. The abundance of short ORFans may be due to a yet unexplained artifact. The data also suggest that the number of longer ORFans may soon diminish as more genomes of closely related organisms become available. To better address the questions about the functions and origins of ORFans, we propose to focus further studies on the longer ORFans, with emphasis on three new types of ORFans: ORFan modules, paralogous ORFans, and orthologous ORFans. We conclude that the large number of ORFans reflects an intrinsic property of the genetic material not yet fully understood. Further computational and experimental studies aimed at understanding Nature's protein diversity should also include ORFans.


Asunto(s)
Genoma , Sistemas de Lectura Abierta , Genoma Arqueal , Genoma Bacteriano , Genoma Fúngico , Saccharomyces cerevisiae/genética , Análisis de Secuencia de ADN
5.
Structure ; 11(1): 7-9, 2003 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-12517334

RESUMEN

The genomes of most newly sequenced organisms contain a significant fraction of ORFs (open reading frames) that match no other sequence in the databases. We refer to these singleton ORFs as sequence ORFans. Because little can be learned about ORFans by homology, the origin and functions of ORFans remain a mystery. However, in this era of full genome sequencing, it seems that ORFans have been underemphasized. In this minireview, we draw attention to the increasing number of ORFans and to the consequences of this growth to biological research in the postgenomic era.


Asunto(s)
Genoma Bacteriano , Genoma Fúngico , Sistemas de Lectura Abierta , Animales , Proteínas/clasificación , Proteínas/genética
6.
Comp Funct Genomics ; 4(4): 432-41, 2003.
Artículo en Inglés | MEDLINE | ID: mdl-18629076

RESUMEN

ORFans are open reading frames (ORFs) with no detectable sequence similarity to any other sequence in the databases. Each newly sequenced genome contains a significant number of ORFans. Therefore, ORFans entail interesting evolutionary puzzles. However, little can be learned about them using bioinformatics tools, and their study seems to have been underemphasized. Here we present some of the questions that the existence of so many ORFans have raised and review some of the studies aimed at understanding ORFans, their functions and their origins. These works have demonstrated that ORFans are an untapped source of research, requiring further computational and experimental studies.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA