RESUMO
Harnessing DNA as a high-density storage medium for information storage and molecular recording of signals has been of increasing interest in the biotechnology field. Recently, progress in enzymatic DNA synthesis, DNA digital data storage, and DNA-based molecular recording has been made by leveraging the activity of the template-independent DNA polymerase, terminal deoxynucleotidyl transferase (TdT). TdT adds deoxyribonucleotides to the 3' end of single-stranded DNA, generating random sequences of single-stranded DNA. TdT can use several divalent cations for its enzymatic activity and exhibits shifts in deoxyribonucleotide incorporation frequencies in response to changes in its reaction environment. However, there is limited understanding of sequence-structure-function relationships regarding these properties, which in turn limits our ability to modulate TdT to further advance TdT-based tools. Most TdT literature to-date explores the activity of murine, bovine or human TdTs; studies probing TdT sequence and structure largely focus on strictly conserved residues that are functionally critical to TdT activity. Here, we explore non-conserved TdT sequence space by surveying the natural diversity of TdT. We characterize a diverse set of TdT homologs from different organisms and identify several TdT residues/regions that confer differences in TdT behavior between homologs. The observations in this study can design rules for targeted TdT libraries, in tandem with a screening assay, to modulate TdT properties. Moreover, the data can be useful in guiding further studies of potential residues of interest. Overall, we characterize TdTs that have not been previously studied in the literature, and we provide new insights into TdT sequence-function relationships.
RESUMO
Enzymatic DNA writing technologies based on the template-independent DNA polymerase terminal deoxynucleotidyl transferase (TdT) have the potential to advance DNA information storage. TdT is unique in its ability to synthesize single-stranded DNA de novo but has limitations, including catalytic inhibition by ribonucleotide presence and slower incorporation rates compared to replicative polymerases. We anticipate that protein engineering can improve, modulate, and tailor the enzyme's properties, but there is limited information on TdT sequence-structure-function relationships to facilitate rational approaches. Therefore, we developed an easily modifiable screening assay that can measure the TdT activity in high-throughput to evaluate large TdT mutant libraries. We demonstrated the assay's capabilities by engineering TdT mutants that exhibit both improved catalytic efficiency and improved activity in the presence of an inhibitor. We screened for and identified TdT variants with greater catalytic efficiency in both selectively incorporating deoxyribonucleotides and in the presence of deoxyribonucleotide/ribonucleotide mixes. Using this information from the screening assay, we rationally engineered other TdT homologues with the same properties. The emulsion-based assay we developed is, to the best of our knowledge, the first high-throughput screening assay that can measure TdT activity quantitatively and without the need for protein purification.