Your browser doesn't support javascript.
loading
DNA Bloom Filter enables anti-contamination and file version control for DNA-based data storage.
Li, Yiming; Zhang, Haoling; Chen, Yuxin; Shen, Yue; Ping, Zhi.
Afiliação
  • Li Y; BGI Research, Shenzhen, 518083, China.
  • Zhang H; BGI Research, Changzhou, 213299, China.
  • Chen Y; BGI Research, Shenzhen, 518083, China.
  • Shen Y; Living Systems Lab, BESE, CEMSE, King Abdullah University of Science and Technology, Thuwal, 23955, Saudi Arabia.
  • Ping Z; BGI Research, Shenzhen, 518083, China.
Brief Bioinform ; 25(3)2024 Mar 27.
Article em En | MEDLINE | ID: mdl-38555478
ABSTRACT
DNA storage is one of the most promising ways for future information storage due to its high data storage density, durable storage time and low maintenance cost. However, errors are inevitable during synthesizing, storing and sequencing. Currently, many error correction algorithms have been developed to ensure accurate information retrieval, but they will decrease storage density or increase computing complexity. Here, we apply the Bloom Filter, a space-efficient probabilistic data structure, to DNA storage to achieve the anti-error, or anti-contamination function. This method only needs the original correct DNA sequences (referred to as target sequences) to produce a corresponding data structure, which will filter out almost all the incorrect sequences (referred to as non-target sequences) during sequencing data analysis. Experimental results demonstrate the universal and efficient filtering capabilities of our method. Furthermore, we employ the Counting Bloom Filter to achieve the file version control function, which significantly reduces synthesis costs when modifying DNA-form files. To achieve cost-efficient file version control function, a modified system based on yin-yang codec is developed.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / DNA Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / DNA Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: China