Your browser doesn't support javascript.
loading
Identifying genes within pathways in unannotated genomes with PaGeSearch.
Won, Sohyoung; Yu, Jaewoong; Kim, Heebal.
Afiliação
  • Won S; Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea, 08826.
  • Yu J; eGnome, Incorporated, Seoul, Republic of Korea, 05836.
  • Kim H; eGnome, Incorporated, Seoul, Republic of Korea, 05836.
Genome Res ; 34(5): 784-795, 2024 06 25.
Article em En | MEDLINE | ID: mdl-38858086
ABSTRACT
In biological research, the identification and comparison of genes within specific pathways across the genomes of various species are invaluable. However, annotating the entire genome is resource intensive, and sequence similarity searches often yield results that are not actually genes. To address these limitations, we introduce Pathway Gene Search (PaGeSearch), a tool designed to identify genes from predefined lists, especially those in specific pathways, within genomes. The tool uses an initial sequence similarity search to identify relevant genomic regions, followed by targeted gene prediction and neural network-based result filtering. PaGeSearch suggests the regions that are most likely the orthologs of the genes in the query and is designed to be applicable for species within five classes mammals, fish, birds, eudicotyledons, and Liliopsida. Compared with GeMoMa and miniprot, PaGeSearch generally outperforms in terms of sensitivity and positive predictive value, as well as negative predictive value. Also, the exon coverage of gene models from PaGeSearch is higher compared with those in GeMoMa and miniprot. Although its performance shows increased variability when applied to actual biological pathways, it nonetheless maintains an acceptable level of accuracy. Evaluating PaGeSearch across different assembly levels, chromosome, scaffold, and contig shows minimal variation in outcomes, indicating that PaGeSearch is resilient to variations in assembly quality.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Genoma Limite: Animals / Humans Idioma: En Revista: Genome Res Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Genoma Limite: Animals / Humans Idioma: En Revista: Genome Res Ano de publicação: 2024 Tipo de documento: Article