Your browser doesn't support javascript.
loading
Reference-based read clustering improves the de novo genome assembly of microbial strains.
Sim, Mikang; Lee, Jongin; Kwon, Daehong; Lee, Daehwan; Park, Nayoung; Wy, Suyeon; Ko, Younhee; Kim, Jaebum.
Afiliação
  • Sim M; Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Republic of Korea.
  • Lee J; Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Republic of Korea.
  • Kwon D; Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Republic of Korea.
  • Lee D; Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Republic of Korea.
  • Park N; Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Republic of Korea.
  • Wy S; Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Republic of Korea.
  • Ko Y; Division of Biomedical Engineering, Hankuk University of Foreign Studies, Gyeonggi-do 17035, Republic of Korea.
  • Kim J; Department of Biomedical Science and Engineering, Konkuk University, Seoul 05029, Republic of Korea.
Comput Struct Biotechnol J ; 21: 444-451, 2023.
Article em En | MEDLINE | ID: mdl-36618978
Constructing accurate microbial genome assemblies is necessary to understand genetic diversity in microbial genomes and its functional consequences. However, it still remains as a challenging task especially when only short-read sequencing technologies are used. Here, we present a new read-clustering algorithm, called RBRC, for improving de novo microbial genome assembly, by accurately estimating read proximity using multiple reference genomes. The performance of RBRC was confirmed by simulation-based evaluation in terms of assembly contiguity and the number of misassemblies, and was successfully applied to existing fungal and bacterial genomes by improving the quality of the assemblies without using additional sequencing data. RBRC is a very useful read-clustering algorithm that can be used (i) for generating high-quality genome assemblies of microbial strains when genome assemblies of related strains are available, and (ii) for upgrading existing microbial genome assemblies when the generation of additional sequencing data, such as long reads, is difficult.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Comput Struct Biotechnol J Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Comput Struct Biotechnol J Ano de publicação: 2023 Tipo de documento: Article