Your browser doesn't support javascript.
loading
Transfer learning for cross-context prediction of protein expression from 5'UTR sequence.
Gilliot, Pierre-Aurélien; Gorochowski, Thomas E.
Afiliação
  • Gilliot PA; School of Biological Sciences, University of Bristol, 24 Tyndall Avenue, Bristol BS8 1TQ, UK.
  • Gorochowski TE; School of Biological Sciences, University of Bristol, 24 Tyndall Avenue, Bristol BS8 1TQ, UK.
Nucleic Acids Res ; 52(13): e58, 2024 Jul 22.
Article em En | MEDLINE | ID: mdl-38864396
ABSTRACT
Model-guided DNA sequence design can accelerate the reprogramming of living cells. It allows us to engineer more complex biological systems by removing the need to physically assemble and test each potential design. While mechanistic models of gene expression have seen some success in supporting this goal, data-centric, deep learning-based approaches often provide more accurate predictions. This accuracy, however, comes at a cost - a lack of generalization across genetic and experimental contexts that has limited their wider use outside the context in which they were trained. Here, we address this issue by demonstrating how a simple transfer learning procedure can effectively tune a pre-trained deep learning model to predict protein translation rate from 5' untranslated region (5'UTR) sequence for diverse contexts in Escherichia coli using a small number of new measurements. This allows for important model features learnt from expensive massively parallel reporter assays to be easily transferred to new settings. By releasing our trained deep learning model and complementary calibration procedure, this study acts as a starting point for continually refined model-based sequence design that builds on previous knowledge and future experimental efforts.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Regiões 5' não Traduzidas / Escherichia coli / Aprendizado Profundo Idioma: En Revista: Nucleic Acids Res Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Regiões 5' não Traduzidas / Escherichia coli / Aprendizado Profundo Idioma: En Revista: Nucleic Acids Res Ano de publicação: 2024 Tipo de documento: Article