Insight into the protein solubility driving forces with neural attention.
PLoS Comput Biol
; 16(4): e1007722, 2020 04.
Article
em En
| MEDLINE
| ID: mdl-32352965
Protein solubility is a key aspect for many biotechnological, biomedical and industrial processes, such as the production of active proteins and antibodies. In addition, understanding the molecular determinants of the solubility of proteins may be crucial to shed light on the molecular mechanisms of diseases caused by aggregation processes such as amyloidosis. Here we present SKADE, a novel Neural Network protein solubility predictor and we show how it can provide novel insight into the protein solubility mechanisms, thanks to its neural attention architecture. First, we show that SKADE positively compares with state of the art tools while using just the protein sequence as input. Then, thanks to the neural attention mechanism, we use SKADE to investigate the patterns learned during training and we analyse its decision process. We use this peculiarity to show that, while the attention profiles do not correlate with obvious sequence aspects such as biophysical properties of the aminoacids, they suggest that N- and C-termini are the most relevant regions for solubility prediction and are predictive for complex emergent properties such as aggregation-prone regions involved in beta-amyloidosis and contact density. Moreover, SKADE is able to identify mutations that increase or decrease the overall solubility of the protein, allowing it to be used to perform large scale in-silico mutagenesis of proteins in order to maximize their solubility.
Texto completo:
1
Base de dados:
MEDLINE
Assunto principal:
Solubilidade
/
Biologia Computacional
/
Rede Nervosa
Tipo de estudo:
Prognostic_studies
Limite:
Animals
/
Humans
Idioma:
En
Revista:
PLoS Comput Biol
Assunto da revista:
BIOLOGIA
/
INFORMATICA MEDICA
Ano de publicação:
2020
Tipo de documento:
Article
País de afiliação:
Bélgica