Pesquisa | Portal de Pesquisa da BVS Enfermagem

Insights into evolution of multicellular fungi from the assembled chromosomes of the mushroom Coprinopsis cinerea (Coprinus cinereus).

Stajich, Jason E; Wilke, Sarah K; Ahrén, Dag; Au, Chun Hang; Birren, Bruce W; Borodovsky, Mark; Burns, Claire; Canbäck, Björn; Casselton, Lorna A; Cheng, C K; Deng, Jixin; Dietrich, Fred S; Fargo, David C; Farman, Mark L; Gathman, Allen C; Goldberg, Jonathan; Guigó, Roderic; Hoegger, Patrick J; Hooker, James B; Huggins, Ashleigh; James, Timothy Y; Kamada, Takashi; Kilaru, Sreedhar; Kodira, Chinnapa; Kües, Ursula; Kupfer, Doris; Kwan, H S; Lomsadze, Alexandre; Li, Weixi; Lilly, Walt W; Ma, Li-Jun; Mackey, Aaron J; Manning, Gerard; Martin, Francis; Muraguchi, Hajime; Natvig, Donald O; Palmerini, Heather; Ramesh, Marilee A; Rehmeyer, Cathy J; Roe, Bruce A; Shenoy, Narmada; Stanke, Mario; Ter-Hovhannisyan, Vardges; Tunlid, Anders; Velagapudi, Rajesh; Vision, Todd J; Zeng, Qiandong; Zolan, Miriam E; Pukkila, Patricia J.

Proc Natl Acad Sci U S A ; 107(26): 11889-94, 2010 Jun 29.

Artigo em Inglês | MEDLINE | ID: mdl-20547848

RESUMO

The mushroom Coprinopsis cinerea is a classic experimental model for multicellular development in fungi because it grows on defined media, completes its life cycle in 2 weeks, produces some 10(8) synchronized meiocytes, and can be manipulated at all stages in development by mutation and transformation. The 37-megabase genome of C. cinerea was sequenced and assembled into 13 chromosomes. Meiotic recombination rates vary greatly along the chromosomes, and retrotransposons are absent in large regions of the genome with low levels of meiotic recombination. Single-copy genes with identifiable orthologs in other basidiomycetes are predominant in low-recombination regions of the chromosome. In contrast, paralogous multicopy genes are found in the highly recombining regions, including a large family of protein kinases (FunK1) unique to multicellular fungi. Analyses of P450 and hydrophobin gene families confirmed that local gene duplications drive the expansions of paralogous copies and the expansions occur in independent lineages of Agaricomycotina fungi. Gene-expression patterns from microarrays were used to dissect the transcriptional program of dikaryon formation (mating). Several members of the FunK1 kinase family are differentially regulated during sexual morphogenesis, and coordinate regulation of adjacent duplications is rare. The genomes of C. cinerea and Laccaria bicolor, a symbiotic basidiomycete, share extensive regions of synteny. The largest syntenic blocks occur in regions with low meiotic recombination rates, no transposable elements, and tight gene spacing, where orthologous single-copy genes are overrepresented. The chromosome assembly of C. cinerea is an essential resource in understanding the evolution of multicellularity in the fungi.

Assuntos

Cromossomos Fúngicos/genética , Coprinus/genética , Evolução Molecular , Sequência de Bases , Mapeamento Cromossômico , Coprinus/citologia , Coprinus/crescimento & desenvolvimento , Sistema Enzimático do Citocromo P-450/genética , Primers do DNA/genética , Proteínas Fúngicas/genética , Duplicação Gênica , Genoma Fúngico , Meiose/genética , Dados de Sequência Molecular , Família Multigênica , Filogenia , Proteínas Quinases/genética , RNA Fúngico/genética , Recombinação Genética , Retroelementos/genética

Gene identification in novel eukaryotic genomes by self-training algorithm.

Lomsadze, Alexandre; Ter-Hovhannisyan, Vardges; Chernoff, Yury O; Borodovsky, Mark.

Nucleic Acids Res ; 33(20): 6494-506, 2005.

Artigo em Inglês | MEDLINE | ID: mdl-16314312

RESUMO

Finding new protein-coding genes is one of the most important goals of eukaryotic genome sequencing projects. However, genomic organization of novel eukaryotic genomes is diverse and ab initio gene finding tools tuned up for previously studied species are rarely suitable for efficacious gene hunting in DNA sequences of a new genome. Gene identification methods based on cDNA and expressed sequence tag (EST) mapping to genomic DNA or those using alignments to closely related genomes rely either on existence of abundant cDNA and EST data and/or availability on reference genomes. Conventional statistical ab initio methods require large training sets of validated genes for estimating gene model parameters. In practice, neither one of these types of data may be available in sufficient amount until rather late stages of the novel genome sequencing. Nevertheless, we have shown that gene finding in eukaryotic genomes could be carried out in parallel with statistical models estimation directly from yet anonymous genomic DNA. The suggested method of parallelization of gene prediction with the model parameters estimation follows the path of the iterative Viterbi training. Rounds of genomic sequence labeling into coding and non-coding regions are followed by the rounds of model parameters estimation. Several dynamically changing restrictions on the possible range of model parameters are added to filter out fluctuations in the initial steps of the algorithm that could redirect the iteration process away from the biologically relevant point in parameter space. Tests on well-studied eukaryotic genomes have shown that the new method performs comparably or better than conventional methods where the supervised model training precedes the gene prediction step. Several novel genomes have been analyzed and biologically interesting findings are discussed. Thus, a self-training algorithm that had been assumed feasible only for prokaryotic genomes has now been developed for ab initio eukaryotic gene identification.

Assuntos

Algoritmos , Genes , Genômica/métodos , Animais , Éxons , Genoma , Cadeias de Markov , Filogenia , Reprodutibilidade dos Testes

Gene prediction in novel fungal genomes using an ab initio algorithm with unsupervised training.

Ter-Hovhannisyan, Vardges; Lomsadze, Alexandre; Chernoff, Yury O; Borodovsky, Mark.

Genome Res ; 18(12): 1979-90, 2008 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-18757608

RESUMO

We describe a new ab initio algorithm, GeneMark-ES version 2, that identifies protein-coding genes in fungal genomes. The algorithm does not require a predetermined training set to estimate parameters of the underlying hidden Markov model (HMM). Instead, the anonymous genomic sequence in question is used as an input for iterative unsupervised training. The algorithm extends our previously developed method tested on genomes of Arabidopsis thaliana, Caenorhabditis elegans, and Drosophila melanogaster. To better reflect features of fungal gene organization, we enhanced the intron submodel to accommodate sequences with and without branch point sites. This design enables the algorithm to work equally well for species with the kinds of variations in splicing mechanisms seen in the fungal phyla Ascomycota, Basidiomycota, and Zygomycota. Upon self-training, the intron submodel switches on in several steps to reach its full complexity. We demonstrate that the algorithm accuracy, both at the exon and the whole gene level, is favorably compared to the accuracy of gene finders that employ supervised training. Application of the new method to known fungal genomes indicates substantial improvement over existing annotations. By eliminating the effort necessary to build comprehensive training sets, the new algorithm can streamline and accelerate the process of annotation in a large number of fungal genome sequencing projects.

Assuntos

Algoritmos , Genes Fúngicos , Genoma Fúngico , Valor Preditivo dos Testes , Ensino/métodos , Etiquetas de Sequências Expressas , Íntrons , Análise de Sequência de DNA

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA