RESUMEN
To verify the genome annotation and to create a resource to functionally characterize the proteome, we attempted to Gateway-clone all predicted protein-encoding open reading frames (ORFs), or the 'ORFeome,' of Caenorhabditis elegans. We successfully cloned approximately 12,000 ORFs (ORFeome 1.1), of which roughly 4,000 correspond to genes that are untouched by any cDNA or expressed-sequence tag (EST). More than 50% of predicted genes needed corrections in their intron-exon structures. Notably, approximately 11,000 C. elegans proteins can now be expressed under many conditions and characterized using various high-throughput strategies, including large-scale interactome mapping. We suggest that similar ORFeome projects will be valuable for other organisms, including humans.
Asunto(s)
Caenorhabditis elegans/genética , Genoma , Empalme Alternativo , Animales , Clonación Molecular , ADN Complementario/genética , ADN de Helmintos/genética , Bases de Datos Genéticas , Exones , Etiquetas de Secuencia Expresada , Expresión Génica , Genes de Helminto , Genómica , Proteínas del Helminto/genética , Humanos , Intrones , Sistemas de Lectura Abierta , Proteoma , ProteómicaRESUMEN
The laboratory rat (Rattus norvegicus) is an indispensable tool in experimental medicine and drug development, having made inestimable contributions to human health. We report here the genome sequence of the Brown Norway (BN) rat strain. The sequence represents a high-quality 'draft' covering over 90% of the genome. The BN rat sequence is the third complete mammalian genome to be deciphered, and three-way comparisons with the human and mouse genomes resolve details of mammalian evolution. This first comprehensive analysis includes genes and proteins and their relation to human disease, repeated sequences, comparative genome-wide studies of mammalian orthologous chromosomal regions and rearrangement breakpoints, reconstruction of ancestral karyotypes and the events leading to existing species, rates of variation, and lineage-specific and lineage-independent evolutionary events such as expansion of gene families, orthology relations and protein evolution.
Asunto(s)
Evolución Molecular , Genoma , Genómica , Ratas Endogámicas BN/genética , Animales , Composición de Base , Centrómero/genética , Cromosomas de los Mamíferos/genética , Islas de CpG/genética , Elementos Transponibles de ADN/genética , ADN Mitocondrial/genética , Duplicación de Gen , Humanos , Intrones/genética , Masculino , Ratones , Modelos Moleculares , Mutagénesis , Polimorfismo de Nucleótido Simple/genética , Sitios de Empalme de ARN/genética , ARN no Traducido/genética , Ratas , Secuencias Reguladoras de Ácidos Nucleicos/genética , Retroelementos/genética , Análisis de Secuencia de ADN , Telómero/genéticaRESUMEN
By integrating functional genomic and proteomic mapping approaches, biological hypotheses should be formulated with increasing levels of confidence. For example, yeast interactome and transcriptome data can be correlated in biologically meaningful ways. Here, we combine interactome mapping data generated for a multicellular organism with data from both large-scale phenotypic analysis ("phenome mapping") and transcriptome profiling. First, we generated a two-hybrid interactome map of the Caenorhabditis elegans germline by using 600 transcripts enriched in this tissue. We compared this map to a phenome map of the germline obtained by RNA interference (RNAi) and to a transcriptome map obtained by clustering worm genes across 553 expression profiling experiments. In this dataset, we find that essential proteins have a tendency to interact with each other, that pairs of genes encoding interacting proteins tend to exhibit similar expression profiles, and that, for approximately 24% of germline interactions, both partners show overlapping embryonic lethal or high incidence of males RNAi phenotypes and similar expression profiles. We propose that these interactions are most likely to be relevant to germline biology. Similar integration of interactome, phenome, and transcriptome data should be possible for other biological processes in the nematode and for other organisms, including humans.
Asunto(s)
Proteínas de Caenorhabditis elegans/genética , Caenorhabditis elegans/genética , Técnicas Genéticas , Transcripción Genética , Animales , Mapeo Cromosómico , Genoma , Sistemas de Lectura Abierta , ProteomaRESUMEN
Aspergillus fumigatus is one of the causes of invasive lung disease in immunocompromised individuals. To rapidly identify genes in this fungus, including potential targets for chemotherapy, diagnostics, and vaccine development, we constructed cDNA libraries. We began with non-normalized libraries, then to improve this approach we constructed a normalized cDNA library using direct cDNA selection. Normalization resulted in a reduction of the frequency of clones with highly expressed genes and an enrichment of underrepresented cDNAs. Expressed sequence tags generated from both the original and the normalized libraries were compared with the genomes of Saccharomyces cerevisiae, Schizosaccharomyces pombe, and Candida albicans, indicating that a large proportion of A. fumigatus genes do not have orthologs in these fungal species. This method allowed the expeditious identification of genes in a fungal pathogen. The same approach can be applied to other human or plant pathogens to rapidly identify genes without the need for genomic sequence information.