Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
Mais filtros










Intervalo de ano de publicação
1.
Protein J ; 43(2): 171-186, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38427271

RESUMO

Proteomics is a field dedicated to the analysis of proteins in cells, tissues, and organisms, aiming to gain insights into their structures, functions, and interactions. A crucial aspect within proteomics is protein family prediction, which involves identifying evolutionary relationships between proteins by examining similarities in their sequences or structures. This approach holds great potential for applications such as drug discovery and functional annotation of genomes. However, current methods for protein family prediction have certain limitations, including limited accuracy, high false positive rates, and challenges in handling large datasets. Some methods also rely on homologous sequences or protein structures, which introduce biases and restrict their applicability to specific protein families or structures. To overcome these limitations, researchers have turned to machine learning (ML) approaches that can identify connections between protein features and simplify complex high-dimensional datasets. This paper presents a comprehensive survey of articles that employ various ML techniques for predicting protein families. The primary objective is to explore and improve ML techniques specifically for protein family prediction, thus advancing future research in the field. Through qualitative and quantitative analyses of ML techniques, it is evident that multiple methods utilizing a range of classifiers have been applied for protein family prediction. However, there has been limited focus on developing novel classifiers for protein family classification, highlighting the urgent need for improved approaches in this area. By addressing these challenges, this research aims to enhance the accuracy and effectiveness of protein family prediction, ultimately facilitating advancements in proteomics and its diverse applications.


Assuntos
Aprendizado de Máquina , Proteínas , Proteínas/química , Proteômica/métodos , Bases de Dados de Proteínas , Biologia Computacional/métodos , Humanos
2.
Curr Drug Metab ; 24(12): 817-834, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-38270152

RESUMO

BACKGROUND: Drug-Protein Interaction (DPI) identification is crucial in drug discovery. The high dimensionality of drug and protein features poses challenges for accurate interaction prediction, necessitating the use of computational techniques. Docking-based methods rely on 3D structures, while ligand-based methods have limitations such as reliance on known ligands and neglecting protein structure. Therefore, the preferred approach is the chemogenomics-based approach using machine learning, which considers both drug and protein characteristics for DPI prediction. METHODS: In machine learning, feature selection plays a vital role in improving model performance, reducing overfitting, enhancing interpretability, and making the learning process more efficient. It helps extract meaningful patterns from drug and protein data while eliminating irrelevant or redundant information, resulting in more effective machine-learning models. On the other hand, classification is of great importance as it enables pattern recognition, decision-making, predictive modeling, anomaly detection, data exploration, and automation. It empowers machines to make accurate predictions and facilitates efficient decision-making in DPI prediction. For this research work, protein data was sourced from the KEGG database, while drug data was obtained from the DrugBank data machine-learning base. RESULTS: To address the issue of imbalanced Drug Protein Pairs (DPP), different balancing techniques like Random Over Sampling (ROS), Synthetic Minority Over-sampling Technique (SMOTE), and Adaptive SMOTE were employed. Given the large number of features associated with drugs and proteins, feature selection becomes necessary. Various feature selection methods were evaluated: Correlation, Information Gain (IG), Chi-Square (CS), and Relief. Multiple classification methods, including Support Vector Machines (SVM), Random Forest (RF), Adaboost, and Logistic Regression (LR), were used to predict DPI. Finally, this research identifies the best balancing, feature selection, and classification methods for accurate DPI prediction. CONCLUSION: This comprehensive approach aims to overcome the limitations of existing methods and provide more reliable and efficient predictions in drug-protein interaction studies.


Assuntos
Descoberta de Drogas , Aprendizado de Máquina , Humanos , Bases de Dados Factuais , Interações Medicamentosas
3.
J Med Syst ; 42(11): 220, 2018 Oct 03.
Artigo em Inglês | MEDLINE | ID: mdl-30280254

RESUMO

Feature extraction and classification are considered to be the major tasks in image processing applications. This paper proposes a novel method to extract the features of a color image for classification. The proposed method, Dominant Local Texture-Color Patterns (DLTCP) is based on the Dominant Texture and Dominant Color channels in a RGB color space. The dominant texture pattern represents a channel among RGB with maximum variations in the texture and the dominant color pattern represents the color channel with the maximum pixel intensity. The combination of channels with dominant texture pattern and dominant color pattern is assigned a unique value which is used to extract the features of an image. The proposed texture-color features is tested for rotational, illumination and scale invariance property using the color images taken from Outex and Vistex databases. It is experimentally shown that the proposed method achieves the highest accuracy in classification using K-Nearest Neighbor (KNN) classifier under various challenges.


Assuntos
Cor , Processamento de Imagem Assistida por Computador , Algoritmos , Software
4.
Braz. arch. biol. technol ; 59(spe2): e16161057, 2016. tab, graf
Artigo em Inglês | LILACS | ID: biblio-839049

RESUMO

ABSTRACT Texture is one of the chief characteristics of an image. In recent years, local texture descriptors have garnered attention among researchers in describing effective texture patterns to demarcate facial images. A feature descriptor titled Local Texture Description Framework-based Modified Local Directional Number pattern (LTDF_MLDN), capable of encoding texture patterns with pixels that lie at dissimilar regions, has been proposed recently to describe effective features for face images. However, the role of the descriptor can differ with different classifiers and distance metrics for diverse issues in face recognition. Hence, in this paper, an extensive evaluation of the LTDF_MLDN is carried out with an Extreme Learning Machine (ELM), a Support Vector Machine (SVM) and a Nearest Neighborhood Classifier (NNC) which uses Euclidian, Manhattan, Minkowski, G-statistics and chi-square dissimilarity metrics to illustrate differences in performance with respect to assorted issues in face recognition using six benchmark databases. Experimental results depict that the proposed descriptor is best suited with NNC for general case and expression variation, whereas, for the other facial variations ELM is found to produce better results.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...