Pesquisa | BVS - MINISTÉRIO DA SAÚDE

Hyperspectral indices data fusion-based machine learning enhanced by MRMR algorithm for estimating maize chlorophyll content.

Nagy, Attila; Szabó, Andrea; Elbeltagi, Ahmed; Nxumalo, Gift Siphiwe; Bódi, Erika Budayné; Tamás, János.

Front Plant Sci ; 15: 1419316, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-39479550

RESUMO

Accurate estimation of chlorophyll is essential for monitoring maize health and growth, for which hyperspectral imaging provides rich data. In this context, this paper presents an innovative method to estimate maize chlorophyll by combining hyperspectral indices and advanced machine learning models. The methodology of this study focuses on the development of machine learning models using proprietary hyperspectral indices to estimate corn chlorophyll content. Six advanced machine learning models were used, including robust linear stepwise regression, support vector machines (SVM), fine Gaussian SVM, Matern 5/2 Gaussian stepwise regression, and three-layer neural network. The MRMR algorithm was integrated into the process to improve feature selection by identifying the most informative spectral bands, thereby reducing data redundancy and improving model performance. The results showed significant differences in the performance of the six machine learning models applied to chlorophyll estimation. Among the models, the Matern 5/2 Gaussian process regression model showed the highest prediction accuracy. The model achieved R2 = 0.71 for the training set, RMSE = 338.46 µg/g and MAE = 264.30 µg/g. In the case of the validation set, the Matern 5/2 Gaussian process regression model further improved its performance, reaching R2 =0.79, RMSE=296.37 µg/g, MAE=237.12 µg/g. These metrics show that Matern's 5/2 Gaussian process regression model combined with the MRMR algorithm to select optimal traits is highly effective in predicting corn chlorophyll content. This research has important implications for precision agriculture, particularly for real-time monitoring and management of crop health. Accurate estimation of chlorophyll allows farmers to take timely and targeted action.

Optimizing Gene Selection and Cancer Classification with Hybrid Sine Cosine and Cuckoo Search Algorithm.

Yaqoob, Abrar; Verma, Navneet Kumar; Aziz, Rabia Musheer.

J Med Syst ; 48(1): 10, 2024 Jan 09.

Artigo em Inglês | MEDLINE | ID: mdl-38193948

RESUMO

Gene expression datasets offer a wide range of information about various biological processes. However, it is difficult to find the important genes among the high-dimensional biological data due to the existence of redundant and unimportant ones. Numerous Feature Selection (FS) techniques have been created to get beyond this obstacle. Improving the efficacy and precision of FS methodologies is crucial in order to identify significant genes amongst complicated complex biological data. In this work, we present a novel approach to gene selection called the Sine Cosine and Cuckoo Search Algorithm (SCACSA). This hybrid method is designed to work with well-known machine learning classifiers Support Vector Machine (SVM). Using a dataset on breast cancer, the hybrid gene selection algorithm's performance is carefully assessed and compared to other feature selection methods. To improve the quality of the feature set, we use minimum Redundancy Maximum Relevance (mRMR) as a filtering strategy in the first step. The hybrid SCACSA method is then used to enhance and optimize the gene selection procedure. Lastly, we classify the dataset according to the chosen genes by using the SVM classifier. Given the pivotal role gene selection plays in unraveling complex biological datasets, SCACSA stands out as an invaluable tool for the classification of cancer datasets. The findings help medical practitioners make well-informed decisions about cancer diagnosis and provide them with a valuable tool for navigating the complex world of gene expression data.

Assuntos

Algoritmos , Neoplasias da Mama , Humanos , Feminino , Neoplasias da Mama/genética , Pessoal de Saúde , Aprendizado de Máquina , Máquina de Vetores de Suporte

A novel and innovative cancer classification framework through a consecutive utilization of hybrid feature selection.

Mahto, Rajul; Ahmed, Saboor Uddin; Rahman, Rizwan Ur; Aziz, Rabia Musheer; Roy, Priyanka; Mallik, Saurav; Li, Aimin; Shah, Mohd Asif.

BMC Bioinformatics ; 24(1): 479, 2023 Dec 15.

Artigo em Inglês | MEDLINE | ID: mdl-38102551

RESUMO

Cancer prediction in the early stage is a topic of major interest in medicine since it allows accurate and efficient actions for successful medical treatments of cancer. Mostly cancer datasets contain various gene expression levels as features with less samples, so firstly there is a need to eliminate similar features to permit faster convergence rate of classification algorithms. These features (genes) enable us to identify cancer disease, choose the best prescription to prevent cancer and discover deviations amid different techniques. To resolve this problem, we proposed a hybrid novel technique CSSMO-based gene selection for cancer classification. First, we made alteration of the fitness of spider monkey optimization (SMO) with cuckoo search algorithm (CSA) algorithm viz., CSSMO for feature selection, which helps to combine the benefit of both metaheuristic algorithms to discover a subset of genes which helps to predict a cancer disease in early stage. Further, to enhance the accuracy of the CSSMO algorithm, we choose a cleaning process, minimum redundancy maximum relevance (mRMR) to lessen the gene expression of cancer datasets. Next, these subsets of genes are classified using deep learning (DL) to identify different groups or classes related to a particular cancer disease. Eight different benchmark microarray gene expression datasets of cancer have been utilized to analyze the performance of the proposed approach with different evaluation matrix such as recall, precision, F1-score, and confusion matrix. The proposed gene selection method with DL achieves much better classification accuracy than other existing DL and machine learning classification models with all large gene expression dataset of cancer.

Assuntos

Algoritmos , Neoplasias , Humanos , Análise em Microsséries , Neoplasias/genética , Técnicas Genéticas , Aprendizado de Máquina

Automatic Life Detection Based on Efficient Features of Ground-Penetrating Rescue Radar Signals.

Shi, Di; Gidion, Gunnar; Reindl, Leonhard M; Rupitsch, Stefan J.

Sensors (Basel) ; 23(15)2023 Jul 28.

Artigo em Inglês | MEDLINE | ID: mdl-37571552

RESUMO

Good feature engineering is a prerequisite for accurate classification, especially in challenging scenarios such as detecting the breathing of living persons trapped under building rubble using bioradar. Unlike monitoring patients' breathing through the air, the measuring conditions of a rescue bioradar are very complex. The ultimate goal of search and rescue is to determine the presence of a living person, which requires extracting representative features that can distinguish measurements with the presence of a person and without. To address this challenge, we conducted a bioradar test scenario under laboratory conditions and decomposed the radar signal into different range intervals to derive multiple virtual scenes from the real one. We then extracted physical and statistical quantitative features that represent a measurement, aiming to find those features that are robust to the complexity of rescue-radar measuring conditions, including different rubble sites, breathing rates, signal strengths, and short-duration disturbances. To this end, we utilized two methods, Analysis of Variance (ANOVA), and Minimum Redundancy Maximum Relevance (MRMR), to analyze the significance of the extracted features. We then trained the classification model using a linear kernel support vector machine (SVM). As the main result of this work, we identified an optimal feature set of four features based on the feature ranking and the improvement in the classification accuracy of the SVM model. These four features are related to four different physical quantities and independent from different rubble sites.

Assuntos

Radar , Taxa Respiratória , Humanos , Máquina de Vetores de Suporte

Covid-19 detection in chest X-ray through random forest classifier using a hybridization of deep CNN and DWT optimized features.

Mostafiz, Rafid; Uddin, Mohammad Shorif; Alam, Nur-A-; Mahfuz Reza, Md; Rahman, Mohammad Motiur.

J King Saud Univ Comput Inf Sci ; 34(6): 3226-3235, 2022 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-38620614

RESUMO

Chest X-ray image contains sufficient information that finds wide-spread applications in diverse disease diagnosis and decision making to assist the medical experts. This paper has proposed an intelligent approach to detect Covid-19 from the chest X-ray image using the hybridization of deep convolutional neural network (CNN) and discrete wavelet transform (DWT) features. At first, the X-ray image is enhanced and segmented through preprocessing tasks, and then deep CNN and DWT features are extracted. The optimum features are extracted from these hybridized features through minimum redundancy and maximum relevance (mRMR) along with recursive feature elimination (RFE). Finally, the random forest-based bagging approach is used for doing the detection task. An extensive experiment is performed, and the results confirm that our approach gives satisfactory performance compare to the existing methods with an overall accuracy of more than 98.5%.

Minimum redundancy maximal relevance gene selection of apoptosis pathway genes in peripheral blood mononuclear cells of HIV-infected patients with antiretroviral therapy-associated mitochondrial toxicity.

Bose, Eliezer; Paintsil, Elijah; Ghebremichael, Musie.

BMC Med Genomics ; 14(1): 285, 2021 12 01.

Artigo em Inglês | MEDLINE | ID: mdl-34852799

RESUMO

BACKGROUND: We previously identified differentially expressed genes on the basis of false discovery rate adjusted P value using empirical Bayes moderated tests. However, that approach yielded a subset of differentially expressed genes without accounting for redundancy between the selected genes. METHODS: This study is a secondary analysis of a case-control study of the effect of antiretroviral therapy on apoptosis pathway genes comprising of 16 cases (HIV infected with mitochondrial toxicity) and 16 controls (uninfected). We applied the maximum relevance minimum redundancy (mRMR) algorithm on the genes that were differentially expressed between the cases and controls. The mRMR algorithm iteratively selects features (genes) that are maximally relevant for class prediction and minimally redundant. We implemented several machine learning classifiers and tested the prediction accuracy of the two mRMR genes. We next used network analysis to estimate and visualize the association among the differentially expressed genes. We employed Markov Random Field or undirected network models to identify gene networks related to mitochondrial toxicity. The Spinglass model was used to identify clusters of gene communities. RESULTS: The mRMR algorithm ranked DFFA and TNFRSF1A, two of the upregulated proapoptotic genes, on the top. The overall prediction accuracy was 86%, the two mRMR genes correctly classified 86% of the participants into their respective groups. The estimated network models showed different patterns of gene networks. In the network of the cases, FASLG was the most central gene. However, instead of FASLG, ABL1 and LTBR had the highest centrality in controls. CONCLUSION: The mRMR algorithm and network analysis revealed a new correlation of genes associated with mitochondrial toxicity.

Assuntos

Infecções por HIV , Leucócitos Mononucleares , Algoritmos , Apoptose , Teorema de Bayes , Estudos de Casos e Controles , Infecções por HIV/tratamento farmacológico , Infecções por HIV/genética , Humanos

Bearing Fault Diagnosis Using a Particle Swarm Optimization-Least Squares Wavelet Support Vector Machine Classifier.

Van, Mien; Hoang, Duy Tang; Kang, Hee Jun.

Sensors (Basel) ; 20(12)2020 Jun 17.

Artigo em Inglês | MEDLINE | ID: mdl-32560493

RESUMO

Bearing is one of the key components of a rotating machine. Hence, monitoring health condition of the bearing is of paramount importace. This paper develops a novel particle swarm optimization (PSO)-least squares wavelet support vector machine (PSO-LSWSVM) classifier, which is designed based on a combination between a PSO, a least squares procedure, and a new wavelet kernel function-based support vector machine (SVM), for bearing fault diagnosis. In this work, bearing fault classification is transformed into a pattern recognition problem, which consists of three stages of data processing. Firstly, a rich information dataset is built by extracting the features from the signals, which are decomposed by the nonlocal means (NLM) and empirical mode decomposition (EMD). Secondly, a minimum-redundancy maximum-relevance (mRMR) method is employed to determine a subset of feature that can provide an optimal performance. Thirdly, a novel classifier, namely LSWSVM, is proposed with the aid of a PSO, to provide higher classification accuracy. The key innovative science of this work is to propropose a new classifier with the aid of an new wavelet kernel type to increase the classification precision of bearing fault diagnosis. The merit features of the proposed approach are demonstrated based on a benchmark bearing dataset and a comprehensive comparison procedure.

Identification of molecular biomarkers for pancreatic cancer with mRMR shortest path method.

Shen, Shuhua; Gui, Tuantuan; Ma, Chengcheng.

Oncotarget ; 8(25): 41432-41439, 2017 Jun 20.

Artigo em Inglês | MEDLINE | ID: mdl-28611293

RESUMO

The high mortality rate of pancreatic cancer makes it one of the most studied diseases among all cancer types. Many researches have been conducted to understand the mechanism underlying its emergence and pathogenesis of this disease. Here, by using minimum-redundancy-maximum-relevance (mRMR) method, we studied a set of transcriptome data of pancreatic cancer. As we gradually added features to achieve the most accurate classification results of Jackknife, a gene set of 9 genes was identified. They were NHS, SCML2, LAMC2, S100P, COL17A1, AMIGO2, PTPRR, KPNA7 and KCNN4. Through STRING 2.0 protein-protein interactions (PPIs) analysis, 40 proteins were identified in the shortest paths between genes in the gene set, 30 of them passed the permutation test, which indicated they were hubs in the background network. Those genes in the protein-protein interaction network were enriched to 37 functional modules, such as: negative regulation of transcription from RNA polymerase II promoter, negative regulation of ERK1 and ERK2 cascade and BMP signaling pathway. Our study indicated new mechanism of pancreatic cancer, suggesting potential therapeutic targets for further study.

Assuntos

Biomarcadores Tumorais/genética , Perfilação da Expressão Gênica/métodos , Regulação Neoplásica da Expressão Gênica , Neoplasias Pancreáticas/genética , Algoritmos , Biomarcadores Tumorais/metabolismo , Ontologia Genética , Redes Reguladoras de Genes , Humanos , Neoplasias Pancreáticas/diagnóstico , Neoplasias Pancreáticas/metabolismo , Mapas de Interação de Proteínas/genética , Transdução de Sinais/genética

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA