Constructing telomere-to-telomere diploid genome by polishing haploid nanopore-based assembly.
Nat Methods
; 21(4): 574-583, 2024 Apr.
Article
em En
| MEDLINE
| ID: mdl-38459383
ABSTRACT
Draft genomes generated from Oxford Nanopore Technologies (ONT) long reads are known to have a higher error rate. Although existing genome polishers can enhance their quality, the error rate (including mismatches, indels and switching errors between paternal and maternal haplotypes) can be significant. Here, we develop two polishers, hypo-short and hypo-hybrid to address this issue. Hypo-short utilizes Illumina short reads to polish an ONT-based draft assembly, resulting in a high-quality assembly with low error rates and switching errors. Expanding on this, hypo-hybrid incorporates ONT long reads to further refine the assembly into a diploid representation. Leveraging on hypo-hybrid, we have created a diploid genome assembly pipeline called hypo-assembler. Hypo-assembler automates the generation of highly accurate, contiguous and nearly complete diploid assemblies using ONT long reads, Illumina short reads and optionally Hi-C reads. Notably, our solution even allows for the production of telomere-to-telomere diploid genomes with additional manual steps. As a proof of concept, we successfully assembled a fully phased telomere-to-telomere diploid genome of HG00733, achieving a quality value exceeding 50.
Texto completo:
1
Base de dados:
MEDLINE
Assunto principal:
Nanoporos
Idioma:
En
Revista:
Nat Methods
Assunto da revista:
TECNICAS E PROCEDIMENTOS DE LABORATORIO
Ano de publicação:
2024
Tipo de documento:
Article
País de afiliação:
Singapura