Your browser doesn't support javascript.
loading
Highly reliable and efficient encoding systems for hexadecimal polypeptide-based data storage.
Ren, Yubin; Zhang, Yi; Liu, Yawei; Wu, Qinglin; Hu, Hong-Gang; Li, Jingjing; Fan, Chunhai; Chen, Dong; Liu, Kai; Zhang, Hongjie.
Afiliação
  • Ren Y; Department of Chemistry, Tsinghua University, Beijing 100084, China.
  • Zhang Y; State Key Laboratory of Rare Earth Resource Utilization, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun 130022, China.
  • Liu Y; State Key Laboratory of Rare Earth Resource Utilization, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun 130022, China.
  • Wu Q; Institute of Process Equipment, College of Energy Engineering and State Key Laboratory of Fluid Power and Mechatronic Systems, Zhejiang University, Hangzhou 310027, China.
  • Hu HG; Institute of Translational Medicine, Shanghai University, Shanghai 200444, China.
  • Li J; State Key Laboratory of Rare Earth Resource Utilization, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun 130022, China.
  • Fan C; Frontiers Science Center for Transformative Molecules, School of Chemistry and Chemical Engineering, and Institute of Molecular Medicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200240, China.
  • Chen D; Institute of Process Equipment, College of Energy Engineering and State Key Laboratory of Fluid Power and Mechatronic Systems, Zhejiang University, Hangzhou 310027, China.
  • Liu K; Department of Chemistry, Tsinghua University, Beijing 100084, China.
  • Zhang H; Department of Chemistry, Tsinghua University, Beijing 100084, China.
Fundam Res ; 3(2): 298-304, 2023 Mar.
Article em En | MEDLINE | ID: mdl-38932929
ABSTRACT
Polypeptides consisting of amino acid (AA) sequences are suitable for high-density information storage. However, the lack of suitable encoding systems, which accommodate the characteristics of polypeptide synthesis, storage and sequencing, impedes the application of polypeptides for large-scale digital data storage. To address this, two reliable and highly efficient encoding systems, i.e. RaptorQ-Arithmetic-Base64-Shuffle-RS (RABSR) and RaptorQ-Arithmetic-Huffman-Rotary-Shuffle-RS (RAHRSR) systems, are developed for polypeptide data storage. The two encoding systems realized the advantages of compressing data, correcting errors of AA chain loss, correcting errors within AA chains, eliminating homopolymers, and pseudo-randomized encrypting. The coding efficiency without arithmetic compression and error correction of audios, pictures and texts by the RABSR system was 3.20, 3.12 and 3.53 Bits/AA, respectively. While that using the RAHRSR system reached 4.89, 4.80 and 6.84 Bits/AA, respectively. When implemented with redundancy for error correction and arithmetic compression to reduce redundancy, the coding efficiency of audios, pictures and texts by the RABSR system was 4.43, 4.36 and 5.22 Bits/AA, respectively. This efficiency further increased to 7.24, 7.11 and 9.82 Bits/AA by the RAHRSR system, respectively. Therefore, the developed hexadecimal polypeptide-based systems may provide a new scenario for highly reliable and highly efficient data storage.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2023 Tipo de documento: Article