Your browser doesn't support javascript.
loading
Deep representation learning improves prediction of LacI-mediated transcriptional repression.
Garruss, Alexander S; Collins, Katherine M; Church, George M.
Afiliación
  • Garruss AS; Department of Genetics, Harvard Medical School, Boston, MA 02115; garruss@fas.harvard.edu.
  • Collins KM; Wyss Institute for Biologically Inspired Engineering, Harvard University, Cambridge, MA 02138.
  • Church GM; Program in Bioinformatics and Integrative Genomics, Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115.
Proc Natl Acad Sci U S A ; 118(27)2021 07 06.
Article en En | MEDLINE | ID: mdl-34187888
ABSTRACT
Recent progress in DNA synthesis and sequencing technology has enabled systematic studies of protein function at a massive scale. We explore a deep mutational scanning study that measured the transcriptional repression function of 43,669 variants of the Escherichia coli LacI protein. We analyze structural and evolutionary aspects that relate to how the function of this protein is maintained, including an in-depth look at the C-terminal domain. We develop a deep neural network to predict transcriptional repression mediated by the lac repressor of Escherichia coli using experimental measurements of variant function. When measured across 10 separate training and validation splits using 5,009 single mutations of the lac repressor, our best-performing model achieved a median Pearson correlation of 0.79, exceeding any previous model. We demonstrate that deep representation learning approaches, first trained in an unsupervised manner across millions of diverse proteins, can be fine-tuned in a supervised fashion using lac repressor experimental datasets to more effectively predict a variant's effect on repression. These findings suggest a deep representation learning model may improve the prediction of other important properties of proteins.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Transcripción Genética / Proteínas de Escherichia coli / Represoras Lac / Aprendizaje Profundo Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Proc Natl Acad Sci U S A Año: 2021 Tipo del documento: Article

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Transcripción Genética / Proteínas de Escherichia coli / Represoras Lac / Aprendizaje Profundo Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Proc Natl Acad Sci U S A Año: 2021 Tipo del documento: Article