RESUMO
Silkworm seed production is vital for silk farming, requiring precise breeding techniques to optimize yields. In silkworm seed production, precise sex classification is crucial for optimizing breeding and boosting silk yields. A non-destructive approach for sex classification addresses these challenges, offering an efficient alternative that enhances both yield and environmental responsibility. Southern India is a hub for mulberry silk and cocoon farming, with the high-yielding double-hybrid varieties FC1 (foundation cross 1) and FC2 (foundation cross 2) being popular. Traditional methods of silkworm pupae sex classification involve manual sorting by experts, necessitating the cutting of cocoons - a practice with a high risk of damaging the cocoon and affecting yield. To address this issue, this study introduces an accelerated histogram of oriented gradients (HOG) feature extraction technique that is enhanced by block-level dimensionality reduction. This non-destructive method allows for efficient and accurate silkworm pupae classification. The modified HOG features are then fused with weight features and processed through a machine learning classification model that incorporates recursive feature elimination (RFE). Performance evaluation shows that an RFE-hybridized XGBoost model attained the highest classification accuracy, achieving 97.2% for FC1 and 97.1% for FC2. The model further optimized with a novel teaching learning-based population selection genetic algorithm (TLBPSGA) achieved a remarkable accuracy of 98.5% for FC1 and 98.2% for FC2. These findings have far-reaching implications for improving both the ecological sustainability and economic efficiency of silkworm seed production.
Assuntos
Bombyx , Bombyx/genética , Animais , Morus/genética , Aprendizado de Máquina , Algoritmos , Masculino , FemininoRESUMO
BACKGROUND: Breathomics is an emerging area focusing on monitoring and diagnosing pulmonary diseases, especially lung cancer. This research aims to employ metabolomic methods to create a breathprint in human-expelled air to rapidly identify lung cancer and its stages. RESEARCH DESIGN AND METHODS: An electronic nose (e-nose) system with five metal oxide semiconductor (MOS) gas sensors, a microcontroller, and machine learning algorithms was designed and developed for this application. The volunteers in this study include 114 patients with lung cancer and 147 healthy controls to understand the clinical potential of the e-nose system to detect lung cancer and its stages. RESULTS: In the training phase, in discriminating lung cancer from controls, the XGBoost classifier model with 10-fold cross-validation gave an accuracy of 91.67%. In the validation phase, the XGBoost classifier model correctly identified 35 out of 42 patients with lung cancer samples and 44 out of 51 healthy control samples providing an overall sensitivity of 83.33% and specificity of 86.27%. CONCLUSIONS: These results indicate that the exhaled breath VOC analysis method may be developed as a new diagnostic tool for lung cancer detection. The advantages of e-nose based diagnostics, such as an easy and painless method of sampling, and low-cost procedures, will make it an excellent diagnostic method in the future.
RESUMO
As the amount and complexity of biomedical data continue to increase, machine learning methods are becoming a popular tool in creating prediction models for the underlying biomedical processes. Although all machine learning methods aim to fit models to data, the methodologies used can vary greatly and may seem daunting at first. A comprehensive review of various machine learning algorithms per biomedical applications is presented. The key concepts of machine learning are supervised and unsupervised learning, feature selection, and evaluation metrics. Technical insights on the major machine learning methods such as decision trees, random forests, support vector machines, and k-nearest neighbors are analyzed. Next, the dimensionality reduction methods like principal component analysis and t-distributed stochastic neighbor embedding methods, and their applications in biomedical data analysis were reviewed. Moreover, in biomedical applications predominantly feedforward neural networks, convolutional neural networks, and recurrent neural networks are utilized. In addition, the identification of emerging directions in machine learning methodology will serve as a useful reference for individuals involved in biomedical research, clinical practice, and related professions who are interested in understanding and applying machine learning algorithms in their research or practice.