Your browser doesn't support javascript.
loading
Virus classification for viral genomic fragments using PhaGCN2.
Jiang, Jing-Zhe; Yuan, Wen-Guang; Shang, Jiayu; Shi, Ying-Hui; Yang, Li-Ling; Liu, Min; Zhu, Peng; Jin, Tao; Sun, Yanni; Yuan, Li-Hong.
Affiliation
  • Jiang JZ; Key Laboratory of South China Sea Fishery Resources Exploitation & Utilization, Ministry of Agriculture and Rural Affairs, South China Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Guangzhou 510300, Guangdong, China.
  • Yuan WG; Guangdong Province Key Laboratory for Biotechnology Drug Candidates, School of Biosciences and Biopharmaceutics, Guangdong Pharmaceutical University, Guangzhou 510006, Guangdong, China.
  • Shang J; College of Fisheries and Life Science, Shanghai Ocean University, Shanghai 201306, China.
  • Shi YH; Tianjin Agricultural University, Tianjin 300384, China.
  • Yang LL; Guangdong Province Key Laboratory for Biotechnology Drug Candidates, School of Biosciences and Biopharmaceutics, Guangdong Pharmaceutical University, Guangzhou 510006, Guangdong, China.
  • Liu M; Department of Electrical Engineering, City University of Hong Kong, Hong Kong (SAR), China.
  • Zhu P; Guangdong Province Key Laboratory for Biotechnology Drug Candidates, School of Biosciences and Biopharmaceutics, Guangdong Pharmaceutical University, Guangzhou 510006, Guangdong, China.
  • Jin T; Tianjin Agricultural University, Tianjin 300384, China.
  • Sun Y; College of Fisheries and Life Science, Shanghai Ocean University, Shanghai 201306, China.
  • Yuan LH; College of Fisheries and Life Science, Shanghai Ocean University, Shanghai 201306, China.
Brief Bioinform ; 24(1)2023 01 19.
Article in En | MEDLINE | ID: mdl-36464489
ABSTRACT
Viruses are the most ubiquitous and diverse entities in the biome. Due to the rapid growth of newly identified viruses, there is an urgent need for accurate and comprehensive virus classification, particularly for novel viruses. Here, we present PhaGCN2, which can rapidly classify the taxonomy of viral sequences at the family level and supports the visualization of the associations of all families. We evaluate the performance of PhaGCN2 and compare it with the state-of-the-art virus classification tools, such as vConTACT2, CAT and VPF-Class, using the widely accepted metrics. The results show that PhaGCN2 largely improves the precision and recall of virus classification, increases the number of classifiable virus sequences in the Global Ocean Virome dataset (v2.0) by four times and classifies more than 90% of the Gut Phage Database. PhaGCN2 makes it possible to conduct high-throughput and automatic expansion of the database of the International Committee on Taxonomy of Viruses. The source code is freely available at https//github.com/KennthShang/PhaGCN2.0.
Subject(s)
Key words

Full text: 1 Database: MEDLINE Main subject: Viruses Language: En Journal: Brief Bioinform Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2023 Type: Article Affiliation country: China

Full text: 1 Database: MEDLINE Main subject: Viruses Language: En Journal: Brief Bioinform Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2023 Type: Article Affiliation country: China