Your browser doesn't support javascript.
loading
Application of the combination method based on RF and LE in near infrared spectral modeling.
Zhang, Xiao-Wen; Chen, Zheng-Guang; Jiao, Feng.
Affiliation
  • Zhang XW; College of Information and Electrical Engineering, Heilongjiang Bayi Agricultural University, Daqing 163319, China.
  • Chen ZG; College of Information and Electrical Engineering, Heilongjiang Bayi Agricultural University, Daqing 163319, China. Electronic address: ruzee@byau.edu.cn.
  • Jiao F; College of Agriculture, Heilongjiang Bayi Agricultural University, Daqing 163319, China.
Spectrochim Acta A Mol Biomol Spectrosc ; 289: 122247, 2023 Mar 15.
Article in En | MEDLINE | ID: mdl-36549073
ABSTRACT
The dimensionality of near-infrared (NIR) spectral data is often extremely large. Dimensionality reduction of spectral data can effectively reduce the redundant information and correlation between spectral variables and simplify the model, which is crucial to increasing the model's performance. As a nonlinear feature extraction method, Laplacian Eigenmaps (LE) may preserve the local neighborhood information of the dataset, has high robustness, and is simple to compute. However, when the LE algorithm maps the data from high-dimensional space to low-dimensional space, it is often disturbed by irrelevant information and multicollinearity in the spectral data, which lowers the model's prediction performance. Random Frog (RF) algorithm can eliminate noise and collinearity in the spectrum. Therefore, before using the LE algorithm, we use the RF algorithm to eliminate irrelevant information in the spectrum and reduce the correlation between the spectra variables to increase the efficiency of the LE algorithm. We used the RF + LE algorithm to reduce the dimensionality of two public NIRS datasets (soil datasets and pharmaceutical tablets datasets) and compared it with RF and LE algorithms alone. We utilized Partial Least Squares Regression (PLSR) and Support Vector Regression (SVR) to establish regression models. The experimental findings demonstrate that compared with the RF algorithm and LE algorithm, the RF + LE combination method can reduce the dimension of spectral variables and model complexity, and improve regression models' prediction accuracy and stability. It is an effective dimensionality reduction method for the near-infrared spectrum.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Algorithms / Spectroscopy, Near-Infrared Type of study: Prognostic_studies Language: En Journal: Spectrochim Acta A Mol Biomol Spectrosc Journal subject: BIOLOGIA MOLECULAR Year: 2023 Document type: Article Affiliation country: China

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Algorithms / Spectroscopy, Near-Infrared Type of study: Prognostic_studies Language: En Journal: Spectrochim Acta A Mol Biomol Spectrosc Journal subject: BIOLOGIA MOLECULAR Year: 2023 Document type: Article Affiliation country: China