Your browser doesn't support javascript.
loading
Multiple errors correction for position-limited DNA sequences with GC balance and no homopolymer for DNA-based data storage.
Li, Xiayang; Chen, Moxuan; Wu, Huaming.
Afiliación
  • Li X; School of Mathematics, Tianjin University, Tianjin, 300372, China.
  • Chen M; School of Mathematics, Tianjin University, Tianjin, 300372, China.
  • Wu H; Center for Applied Mathematics, Tianjin University, Tianjin, 300372, China.
Brief Bioinform ; 24(1)2023 01 19.
Article en En | MEDLINE | ID: mdl-36410731
ABSTRACT
Deoxyribonucleic acid (DNA) is an attractive medium for long-term digital data storage due to its extremely high storage density, low maintenance cost and longevity. However, during the process of synthesis, amplification and sequencing of DNA sequences with homopolymers of large run-length, three different types of errors, namely, insertion, deletion and substitution errors frequently occur. Meanwhile, DNA sequences with large imbalances between GC and AT content exhibit high dropout rates and are prone to errors. These limitations severely hinder the widespread use of DNA-based data storage. In order to reduce and correct these errors in DNA storage, this paper proposes a novel coding schema called DNA-LC, which converts binary sequences into DNA base sequences that satisfy both the GC balance and run-length constraints. Furthermore, our coding mode is able to detect and correct multiple errors with a higher error correction capability than the other methods targeting single error correction within a single strand. The decoding algorithm has been implemented in practice. Simulation results indicate that our proposed coding scheme can offer outstanding error protection to DNA sequences. The source code is freely accessible at https//github.com/XiayangLi2301/DNA.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Programas Informáticos / ADN Idioma: En Revista: Brief Bioinform Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2023 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Programas Informáticos / ADN Idioma: En Revista: Brief Bioinform Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2023 Tipo del documento: Article País de afiliación: China