Your browser doesn't support javascript.
loading
GradHC: highly reliable gradual hash-based clustering for DNA storage systems.
Ben Shabat, Dvir; Hadad, Adar; Boruchovsky, Avital; Yaakobi, Eitan.
Afiliação
  • Ben Shabat D; Department of Computer Science, Technion, Haifa 320003, Israel.
  • Hadad A; Department of Computer Science, Technion, Haifa 320003, Israel.
  • Boruchovsky A; Department of Computer Science, Technion, Haifa 320003, Israel.
  • Yaakobi E; Department of Computer Science, Technion, Haifa 320003, Israel.
Bioinformatics ; 40(5)2024 May 02.
Article em En | MEDLINE | ID: mdl-38648049
ABSTRACT
MOTIVATION As data storage challenges grow and existing technologies approach their limits, synthetic DNA emerges as a promising storage solution due to its remarkable density and durability advantages. While cost remains a concern, emerging sequencing and synthetic technologies aim to mitigate it, yet introduce challenges such as errors in the storage and retrieval process. One crucial task in a DNA storage system is clustering numerous DNA reads into groups that represent the original input strands.

RESULTS:

In this paper, we review different methods for evaluating clustering algorithms and introduce a novel clustering algorithm for DNA storage systems, named Gradual Hash-based clustering (GradHC). The primary strength of GradHC lies in its capability to cluster with excellent accuracy various types of designs, including varying strand lengths, cluster sizes (including extremely small clusters), and different error ranges. Benchmark analysis demonstrates that GradHC is significantly more stable and robust than other clustering algorithms previously proposed for DNA storage, while also producing highly reliable clustering results. AVAILABILITY AND IMPLEMENTATION https//github.com/bensdvir/GradHC.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos / DNA / Análise de Sequência de DNA Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos / DNA / Análise de Sequência de DNA Idioma: En Ano de publicação: 2024 Tipo de documento: Article