Pesquisa | BVS - MINISTÉRIO DA SAÚDE

Robust principal component analysis-based prediction of protein-protein interaction hot spots.

Sitani, Divya; Giorgetti, Alejandro; Alfonso-Prieto, Mercedes; Carloni, Paolo.

Proteins ; 89(6): 639-647, 2021 06.

Artigo em Inglês | MEDLINE | ID: mdl-33458895

RESUMO

Proteins often exert their function by binding to other cellular partners. The hot spots are key residues for protein-protein binding. Their identification may shed light on the impact of disease associated mutations on protein complexes and help design protein-protein interaction inhibitors for therapy. Unfortunately, current machine learning methods to predict hot spots, suffer from limitations caused by gross errors in the data matrices. Here, we present a novel data pre-processing pipeline that overcomes this problem by recovering a low rank matrix with reduced noise using Robust Principal Component Analysis. Application to existing databases shows the predictive power of the method.

Assuntos

Aprendizado de Máquina , Análise de Componente Principal , Mapeamento de Interação de Proteínas/estatística & dados numéricos , Proteínas/química , Sítios de Ligação , Biologia Computacional/métodos , Bases de Dados de Proteínas , Humanos , Modelos Moleculares , Ligação Proteica , Domínios e Motivos de Interação entre Proteínas , Mapeamento de Interação de Proteínas/métodos , Proteínas/metabolismo , Curva ROC

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA