Your browser doesn't support javascript.
loading
EzBioCloud: a genome-driven database and platform for microbiome identification and discovery.
Chalita, Mauricio; Kim, Yeong Ouk; Park, Sein; Oh, Hyun-Seok; Cho, Jae Hyoung; Moon, Jeongsup; Baek, Nuga; Moon, Changsik; Lee, Kihyun; Yang, Junwon; Nam, Gi Gyun; Jung, Yeonjae; Na, Seong-In; Bailey, Michael James; Chun, Jongsik.
Afiliación
  • Chalita M; CJ Bioscience Inc, Seoul, 04527, Republic of Korea.
  • Kim YO; CJ Bioscience Inc, Seoul, 04527, Republic of Korea.
  • Park S; Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 08826, Republic of Korea.
  • Oh HS; CJ Bioscience Inc, Seoul, 04527, Republic of Korea.
  • Cho JH; Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 08826, Republic of Korea.
  • Moon J; CJ Bioscience Inc, Seoul, 04527, Republic of Korea.
  • Baek N; CJ Bioscience Inc, Seoul, 04527, Republic of Korea.
  • Moon C; CJ Bioscience Inc, Seoul, 04527, Republic of Korea.
  • Lee K; CJ Bioscience Inc, Seoul, 04527, Republic of Korea.
  • Yang J; CJ Bioscience Inc, Seoul, 04527, Republic of Korea.
  • Nam GG; CJ Bioscience Inc, Seoul, 04527, Republic of Korea.
  • Jung Y; CJ Bioscience Inc, Seoul, 04527, Republic of Korea.
  • Na SI; Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, 08826, Republic of Korea.
  • Bailey MJ; CJ Bioscience Inc, Seoul, 04527, Republic of Korea.
  • Chun J; CJ Bioscience Inc, Seoul, 04527, Republic of Korea.
Article en En | MEDLINE | ID: mdl-38888585
ABSTRACT
With the continued evolution of DNA sequencing technologies, the role of genome sequence data has become more integral in the classification and identification of Bacteria and Archaea. Six years after introducing EzBioCloud, an integrated platform representing the taxonomic hierarchy of Bacteria and Archaea through quality-controlled 16S rRNA gene and genome sequences, we present an updated version, that further refines and expands its capabilities. The current update recognizes the growing need for accurate taxonomic information as defining a species increasingly relies on genome sequence comparisons. We also incorporated an advanced strategy for addressing underrepresented or less studied lineages, bolstering the comprehensiveness and accuracy of our database. Our rigorous quality control protocols remain, where whole-genome assemblies from the NCBI Assembly Database undergo stringent screening to remove low-quality sequence data. These are then passed through our enhanced identification bioinformatics pipeline which initiates a 16S rRNA gene similarity search and then calculates the average nucleotide identity (ANI). For genome sequences lacking a 16S rRNA sequence and without a closely related genomic representative for ANI calculation, we apply a different ANI approach using bacterial core genes for improved taxonomic placement (core gene ANI, cgANI). Because of the increase in genome sequences available in NCBI and our newly introduced cgANI method, EzBioCloud now encompasses a total of 109 835 species, of which 21 964 have validly published names. 47 896 are candidate species identified either through 16S rRNA sequence similarity (phylotypes) or through whole genome ANI (genomospecies), and the remaining 39 975 were positioned in the taxonomic tree by cgANI (species clusters). Our EzBioCloud database is accessible at www.ezbiocloud.net/db.
Asunto(s)
Palabras clave

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Bacterias / ARN Ribosómico 16S / Genoma Bacteriano / Archaea / Microbiota Idioma: En Revista: Int J Syst Evol Microbiol Asunto de la revista: MICROBIOLOGIA Año: 2024 Tipo del documento: Article

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Bacterias / ARN Ribosómico 16S / Genoma Bacteriano / Archaea / Microbiota Idioma: En Revista: Int J Syst Evol Microbiol Asunto de la revista: MICROBIOLOGIA Año: 2024 Tipo del documento: Article