Analyzing domain features of small proteins using a machine-learning method.
Proteomics
; 24(16): e2300302, 2024 Aug.
Article
en En
| MEDLINE
| ID: mdl-38258387
ABSTRACT
Small proteins (SPs) are a unique group of proteins that play crucial roles in many important biological processes. Exploring the biological function of SPs is necessary. In this study, the InterPro tool and the maximum correlation method were utilized to analyze functional domains of SPs. The purpose was to identify important functional domains that can indicate the essential differences between small and large protein sequences. First, the small and large proteins were represented by their functional domains via a one-hot scheme. Then, the MaxRel method was adopted to evaluate the relationships between each domain and the target variable, indicating small or large protein. The top 36 domain features were selected for further investigation. Among them, 14 were deemed to be highly related to SPs because they were annotated to SPs more frequently than large proteins. We found the involvement of functional domains, such as ubiquitin-conjugating enzyme/RWD-like, nuclear transport factor 2 domain, and alpha subunit of guanine nucleotide-binding protein (G-protein) in regulating the biological function of SPs. The involvement of these domains has been confirmed by other recent studies. Our findings indicate that protein functional domains may regulate small protein-related functions and predict their biological activity.
Palabras clave
Texto completo:
1
Colección:
01-internacional
Banco de datos:
MEDLINE
Asunto principal:
Aprendizaje Automático
/
Dominios Proteicos
Tipo de estudio:
Prognostic_studies
Límite:
Humans
Idioma:
En
Revista:
Proteomics
Asunto de la revista:
BIOQUIMICA
Año:
2024
Tipo del documento:
Article
País de afiliación:
China