Your browser doesn't support javascript.
loading
Application of artificial intelligence and machine learning techniques to the analysis of dynamic protein sequences.
Kombo, David C; LaMarche, Matthew J; Konkankit, Chilaluck C; Rackovsky, S.
Afiliação
  • Kombo DC; Department of Medicinal Chemistry, Integrated Drug Discovery, Cambridge, Massachusetts, USA.
  • LaMarche MJ; Department of Medicinal Chemistry, Integrated Drug Discovery, Cambridge, Massachusetts, USA.
  • Konkankit CC; Department of Chemistry and Chemical Biology, Baker Laboratory, Cornell University, Ithaca, New York, USA.
  • Rackovsky S; Department of Chemistry and Chemical Biology, Baker Laboratory, Cornell University, Ithaca, New York, USA.
Proteins ; 2024 May 29.
Article em En | MEDLINE | ID: mdl-38808365
ABSTRACT
We apply methods of Artificial Intelligence and Machine Learning to protein dynamic bioinformatics. We rewrite the sequences of a large protein data set, containing both folded and intrinsically disordered molecules, using a representation developed previously, which encodes the intrinsic dynamic properties of the naturally occurring amino acids. We Fourier analyze the resulting sequences. It is demonstrated that classification models built using several different supervised learning methods are able to successfully distinguish folded from intrinsically disordered proteins from sequence alone. It is further shown that the most important sequence property for this discrimination is the sequence mobility, which is the sequence averaged value of the residue-specific average alpha carbon B factor. This is in agreement with previous work, in which we have demonstrated the central role played by the sequence mobility in protein dynamic bioinformatics and biophysics. This finding opens a path to the application of dynamic bioinformatics, in combination with machine learning algorithms, to a range of significant biomedical problems.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Proteins Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Proteins Ano de publicação: 2024 Tipo de documento: Article