Your browser doesn't support javascript.
loading
IPEV: identification of prokaryotic and eukaryotic virus-derived sequences in virome using deep learning.
Yin, Hengchuang; Wu, Shufang; Tan, Jie; Guo, Qian; Li, Mo; Guo, Jinyuan; Wang, Yaqi; Jiang, Xiaoqing; Zhu, Huaiqiu.
Afiliação
  • Yin H; Department of Biomedical Engineering, College of Future Technology, and Center for Quantitative Biology, Peking University, Beijing 100871, China.
  • Wu S; Department of Biomedical Engineering, College of Future Technology, and Center for Quantitative Biology, Peking University, Beijing 100871, China.
  • Tan J; Department of Biomedical Engineering, College of Future Technology, and Center for Quantitative Biology, Peking University, Beijing 100871, China.
  • Guo Q; Department of Biomedical Engineering, College of Future Technology, and Center for Quantitative Biology, Peking University, Beijing 100871, China.
  • Li M; Department of Biomedical Engineering, College of Future Technology, and Center for Quantitative Biology, Peking University, Beijing 100871, China.
  • Guo J; School of Life Sciences, Peking University, Beijing 100871, China.
  • Wang Y; Department of Biomedical Engineering, College of Future Technology, and Center for Quantitative Biology, Peking University, Beijing 100871, China.
  • Jiang X; Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA 30332, USA.
  • Zhu H; Department of Biomedical Engineering, College of Future Technology, and Center for Quantitative Biology, Peking University, Beijing 100871, China.
Gigascience ; 132024 01 02.
Article em En | MEDLINE | ID: mdl-38649300
ABSTRACT

BACKGROUND:

The virome obtained through virus-like particle enrichment contains a mixture of prokaryotic and eukaryotic virus-derived fragments. Accurate identification and classification of these elements are crucial to understanding their roles and functions in microbial communities. However, the rapid mutation rates of viral genomes pose challenges in developing high-performance tools for classification, potentially limiting downstream analyses.

FINDINGS:

We present IPEV, a novel method to distinguish prokaryotic and eukaryotic viruses in viromes, with a 2-dimensional convolutional neural network combining trinucleotide pair relative distance and frequency. Cross-validation assessments of IPEV demonstrate its state-of-the-art precision, significantly improving the F1-score by approximately 22% on an independent test set compared to existing methods when query viruses share less than 30% sequence similarity with known viruses. Furthermore, IPEV outperforms other methods in accuracy on marine and gut virome samples based on annotations by sequence alignments. IPEV reduces runtime by at most 1,225 times compared to existing methods under the same computing configuration. We also utilized IPEV to analyze longitudinal samples and found that the gut virome exhibits a higher degree of temporal stability than previously observed in persistent personal viromes, providing novel insights into the resilience of the gut virome in individuals.

CONCLUSIONS:

IPEV is a high-performance, user-friendly tool that assists biologists in identifying and classifying prokaryotic and eukaryotic viruses within viromes. The tool is available at https//github.com/basehc/IPEV.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Vírus / Aprendizado Profundo / Viroma Limite: Humans Idioma: En Revista: Gigascience Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Vírus / Aprendizado Profundo / Viroma Limite: Humans Idioma: En Revista: Gigascience Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China