Your browser doesn't support javascript.
loading
Interpretable pairwise distillations for generative protein sequence models.
Feinauer, Christoph; Meynard-Piganeau, Barthelemy; Lucibello, Carlo.
Afiliação
  • Feinauer C; Department of Computing Sciences, Bocconi University, Milan, Italy.
  • Meynard-Piganeau B; Bocconi Institute for Data Science and Analytics (BIDSA), Milan, Italy.
  • Lucibello C; Laboratory of Computational and Quantitative Biology (LCQB) UMR 7238 CNRS, Sorbonne Université, Paris, France.
PLoS Comput Biol ; 18(6): e1010219, 2022 06.
Article em En | MEDLINE | ID: mdl-35737722
ABSTRACT
Many different types of generative models for protein sequences have been proposed in literature. Their uses include the prediction of mutational effects, protein design and the prediction of structural properties. Neural network (NN) architectures have shown great performances, commonly attributed to the capacity to extract non-trivial higher-order interactions from the data. In this work, we analyze two different NN models and assess how close they are to simple pairwise distributions, which have been used in the past for similar problems. We present an approach for extracting pairwise models from more complex ones using an energy-based modeling framework. We show that for the tested models the extracted pairwise models can replicate the energies of the original models and are also close in performance in tasks like mutational effect prediction. In addition, we show that even simpler, factorized models often come close in performance to the original models.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Destilação / Redes Neurais de Computação Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Destilação / Redes Neurais de Computação Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article