화학공학소재연구정보센터
Journal of Process Control, Vol.67, 83-93, 2018
An improved variable selection method for support vector regression in NIR spectral modeling
Support vector regression (SVR) has become increasingly popular in analyzing near-infrared spectroscopic data. As an alternative to the conventional partial least squares (PLS) methods, the advantages of SVR include better generalization performance and its robustness against nonlinear input-output relationships. Since most variable selection methods in literature focus on identification of linear input-output relationships, these methods are suboptimal when nonlinear SVR models are constructed. In this paper, we propose a variable selection method using recursive feature elimination with mutual information-based bias correction (RFE-MICBC). The proposed method is able to handle nonlinearity and correlations within the feature space. Through two case studies (one with simulated data and one with diesel NIR spectra), the performance of SVR-RFE-MICBC is compared against variable importance in projection (VIP) based feature selection method for PLS and a probabilistic prediction method for SVR [1]. The results demonstrate that the new method is superior in performance especially when the data set contains a high level of nonlinearities. Lastly, we also show that variable selection is beneficial in improving model efficiency and interpretability. The resulting wavelength bands considered to be important provide valuable guidance for future spectral analysis of diesel fuels. Published by Elsevier Ltd.